E-LIB Suche - Ergebnisse für: Czarnecki, WM

Sorted by: Relevance

Sorted by: Year

Distral: robust multitask reinforcement learning:

Teh, YW ; Bapst, V ; Czarnecki, WM...
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6. , 2020

Information asymmetry in KL-regularized RL:

Galashov, A ; Jayakumar, SM ; Hasenclever, L...
https://ora.ox.ac.uk/objects/uuid:d340b1fd-429b-49aa-9940-e1b69d7c03b3. , 2020

Link: https://ora.ox.ac.uk/obj..

Smooth markets: A basic mechanism for organizing gradient-b..:

Balduzzi, D ; Czarnecki, WM ; Anthony, T...
https://discovery.ucl.ac.uk/id/eprint/10109590/1/smooth_markets_a_basic_mechanism_for_organizing_gradient_based_learners.pdf. , 2020

Link: https://discovery.ucl.ac..

Negotiating team formation using deep reinforcement learnin:

Bachrach, Y ; Everett, R ; Hughes, E...
https://discovery.ucl.ac.uk/id/eprint/10109604/1/RLNego_AIJ_CR.pdf. , 2020

Link: https://discovery.ucl.ac..

Human-level performance in 3D multiplayer games with popula..:

Jaderberg, M ; Czarnecki, WM ; Dunning, I...
https://discovery.ucl.ac.uk/id/eprint/10076318/7/Graepel_Human-level%20performance%20in%203D%20multiplayer%20games%20with%20population-based%20reinforcement%20learning_AAM.pdf. , 2019

Link: https://discovery.ucl.ac..

1-5