Merkliste 
 1 Ergebnisse 
 
1

Vid2Seq: Large-Scale Pretraining of a Visual Language Model..:

, In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
Yang, Antoine ; Nagrani, Arsha ; Seo, Paul Hongsuck... - p. 10714-10726 , 2023