Czarnecki, WM
5  results:
Search for persons X
?
1

Distral: robust multitask reinforcement learning:

Teh, YW ; Bapst, V ; Czarnecki, WM...
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6.  , 2020
 
?
2

Information asymmetry in KL-regularized RL:

Galashov, A ; Jayakumar, SM ; Hasenclever, L...
https://ora.ox.ac.uk/objects/uuid:d340b1fd-429b-49aa-9940-e1b69d7c03b3.  , 2020
 
?
3

Smooth markets: A basic mechanism for organizing gradient-b..:

Balduzzi, D ; Czarnecki, WM ; Anthony, T...
https://discovery.ucl.ac.uk/id/eprint/10109590/1/smooth_markets_a_basic_mechanism_for_organizing_gradient_based_learners.pdf.  , 2020
 
?
4

Negotiating team formation using deep reinforcement learnin:

Bachrach, Y ; Everett, R ; Hughes, E...
https://discovery.ucl.ac.uk/id/eprint/10109604/1/RLNego_AIJ_CR.pdf.  , 2020
 
?
5

Human-level performance in 3D multiplayer games with popula..:

Jaderberg, M ; Czarnecki, WM ; Dunning, I...
https://discovery.ucl.ac.uk/id/eprint/10076318/7/Graepel_Human-level%20performance%20in%203D%20multiplayer%20games%20with%20population-based%20reinforcement%20learning_AAM.pdf.  , 2019
 
1-5