I agree that this site is using cookies. You can find further informations
here
.
X
Login
Merkliste (
0
)
Home
About us
Home About us
Our history
Profile
Press & public relations
Friends
The library in figures
Exhibitions
Projects
Training, internships, careers
Films
Services & Information
Home Services & Information
Lending and interlibrary loans
Returns and renewals
Training and library tours
My Account
Library cards
New to the library?
Download Information
Opening hours
Learning spaces
PC, WLAN, copy, scan and print
Catalogs and collections
Home Catalogs and Collections
Rare books and manuscripts
Digital collections
Subject Areas
Our sites
Home Our sites
Central Library
Law Library (Juridicum)
BB Business and Economics (BB11)
BB Physics and Electrical Engineering
TB Engineering and Social Sciences
TB Economics and Nautical Sciences
TB Music
TB Art & Design
TB Bremerhaven
Contact the library
Home Contact the library
Staff Directory
Open access & publishing
Home Open access & publishing
Reference management: Citavi & RefWorks
Publishing documents
Open Access in Bremen
zur Desktop-Version
Toggle navigation
Merkliste
1 Ergebnisse
1
DeepSpeed-FastGen: High-throughput Text Generation for LLMs..:
Holmes, Connor
;
Tanaka, Masahiro
;
Wyatt, Michael
...
http://arxiv.org/abs/2401.08671. , 2024
Link:
http://arxiv.org/abs/2401.08671
RT Journal T1
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
UL https://suche.suub.uni-bremen.de/peid=base-ftarxivpreprints:oai:arXiv.org:2401.08671&Exemplar=1&LAN=DE A1 Holmes, Connor A1 Tanaka, Masahiro A1 Wyatt, Michael A1 Awan, Ammar Ahmad A1 Rasley, Jeff A1 Rajbhandari, Samyam A1 Aminabadi, Reza Yazdani A1 Qin, Heyang A1 Bakhtiari, Arash A1 Kurilenko, Lev A1 He, Yuxiong YR 2024 K1 Computer Science - Performance K1 Computer Science - Machine Learning JF http://arxiv.org/abs/2401.08671 LK http://arxiv.org/abs/2401.08671 DO http://arxiv.org/abs/2401.08671 SF ELIB - SuUB Bremen
Export
RefWorks (nur Desktop-Version!)
Flow
(Zuerst in
Flow
einloggen, dann importieren)