I agree that this site is using cookies. You can find further informations
here
.
X
Login
Merkliste (
0
)
Home
About us
Home About us
Our history
Profile
Press & public relations
Friends
The library in figures
Exhibitions
Projects
Training, internships, careers
Films
Services & Information
Home Services & Information
Lending and interlibrary loans
Returns and renewals
Training and library tours
My Account
Library cards
New to the library?
Download Information
Opening hours
Learning spaces
PC, WLAN, copy, scan and print
Catalogs and collections
Home Catalogs and Collections
Rare books and manuscripts
Digital collections
Subject Areas
Our sites
Home Our sites
Central Library
Law Library (Juridicum)
BB Business and Economics (BB11)
BB Physics and Electrical Engineering
TB Engineering and Social Sciences
TB Economics and Nautical Sciences
TB Music
TB Art & Design
TB Bremerhaven
Contact the library
Home Contact the library
Staff Directory
Open access & publishing
Home Open access & publishing
Reference management: Citavi & RefWorks
Publishing documents
Open Access in Bremen
zur Desktop-Version
Toggle navigation
Merkliste
1 Ergebnisse
1
Sample Efficient Reinforcement Learning from Human Feedback..:
Mehta, Viraj
;
Das, Vikramjeet
;
Neopane, Ojash
...
http://arxiv.org/abs/2312.00267. , 2023
Link:
http://arxiv.org/abs/2312.00267
RT Journal T1
Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration
UL https://suche.suub.uni-bremen.de/peid=base-ftarxivpreprints:oai:arXiv.org:2312.00267&Exemplar=1&LAN=DE A1 Mehta, Viraj A1 Das, Vikramjeet A1 Neopane, Ojash A1 Dai, Yijia A1 Bogunovic, Ilija A1 Schneider, Jeff A1 Neiswanger, Willie YR 2023 K1 Computer Science - Machine Learning K1 Computer Science - Artificial Intelligence K1 Statistics - Machine Learning JF http://arxiv.org/abs/2312.00267 LK http://arxiv.org/abs/2312.00267 DO http://arxiv.org/abs/2312.00267 SF ELIB - SuUB Bremen
Export
RefWorks (nur Desktop-Version!)
Flow
(Zuerst in
Flow
einloggen, dann importieren)