I agree that this site is using cookies. You can find further informations
here
.
X
Login
Merkliste (
0
)
Home
About us
Home About us
Our history
Profile
Press & public relations
Friends
The library in figures
Exhibitions
Projects
Training, internships, careers
Films
Services & Information
Home Services & Information
Lending and interlibrary loans
Returns and renewals
Training and library tours
My Account
Library cards
New to the library?
Download Information
Opening hours
Learning spaces
PC, WLAN, copy, scan and print
Catalogs and collections
Home Catalogs and Collections
Rare books and manuscripts
Digital collections
Subject Areas
Our sites
Home Our sites
Central Library
Law Library (Juridicum)
BB Business and Economics (BB11)
BB Physics and Electrical Engineering
TB Engineering and Social Sciences
TB Economics and Nautical Sciences
TB Music
TB Art & Design
TB Bremerhaven
Contact the library
Home Contact the library
Staff Directory
Open access & publishing
Home Open access & publishing
Reference management: Citavi & RefWorks
Publishing documents
Open Access in Bremen
zur Desktop-Version
Toggle navigation
Merkliste
1 Ergebnisse
1
META-Learning State-based Eligibility Traces for More Sampl..:
, In:
Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems
,
Zhao, Mingde
;
Luan, Sitao
;
Porada, Ian
.. - p. 1647-1655 , 2020
Link:
https://dl.acm.org/doi/10.5555/3398761.3398950
RT T1
Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems
: T1
META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation
UL https://suche.suub.uni-bremen.de/peid=acm-3398950&Exemplar=1&LAN=DE A1 Zhao, Mingde A1 Luan, Sitao A1 Porada, Ian A1 Chang, Xiao-Wen A1 Precup, Doina PB International Foundation for Autonomous Agents and Multiagent Systems YR 2020 K1 auxiliary task K1 function approximation K1 hyperparameter adaptation K1 meta-learning K1 reinforcement learning K1 temporal difference learning K1 Theory of computation K1 Design and analysis of algorithms K1 Online algorithms K1 Online learning algorithms K1 Computing methodologies K1 Machine learning K1 Machine learning algorithms K1 Dynamic programming for Markov decision processes K1 Temporal difference learning K1 Machine learning approaches K1 Markov decision processes K1 Learning paradigms K1 Reinforcement learning SP 1647 OP 1655 LK http://dx.doi.org/https://dl.acm.org/doi/10.5555/3398761.3398950 DO https://dl.acm.org/doi/10.5555/3398761.3398950 SF ELIB - SuUB Bremen
Export
RefWorks (nur Desktop-Version!)
Flow
(Zuerst in
Flow
einloggen, dann importieren)