Lazaric, Alessandro
280  Ergebnisse:
Personensuche X
?
1

System-2 Recommenders: Disentangling Utility and Engagement..:

, In: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency,
 
?
2

Sketched Newton--Raphson:

Yuan, Rui ; Lazaric, Alessandro ; Gower, Robert M.
SIAM Journal on Optimization.  32 (2022)  3 - p. 1555-1583 , 2022
 
?
4

A truthful learning mechanism for contextual multi-slot spo..:

, In: Proceedings of the 13th ACM Conference on Electronic Commerce,
 
?
5

Learning with stochastic inputs and adversarial outputs:

Lazaric, Alessandro ; Munos, Rémi
Journal of Computer and System Sciences.  78 (2012)  5 - p. 1516-1537 , 2012
 
?
6

A truthful learning mechanism for multi-slot sponsored sear..:

, In: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3,
 
?
7

Workshop summary: On-line learning with limited feedback:

, In: Proceedings of the 26th Annual International Conference on Machine Learning,
 
?
 
?
9

On the usefulness of opponent modeling : the Kuhn Poker ..:

, In: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3,
 
?
10

Transfer of task representation in reinforcement learning u..:

, In: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3,
 
?
11

Transfer of samples in batch reinforcement learning:

, In: Proceedings of the 25th international conference on Machine learning,
 
?
12

Reinforcement learning in extensive form games with incompl..:

, In: Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems,
 
?
13

Learning to cooperate in multi-agent social dilemmas:

, In: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems,
 
?
14

Batch Reinforcement Learning for Controlling a Mobile Wheel..:

, In: Artificial Intelligence in Theory and Practice II; IFIP – The International Federation for Information Processing,
 
1-15