Peng, Yifan
150  results:
Search for persons X
?
1

VoxtLM: Unified Decoder-Only Models for Consolidating Speec..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Maiti, Soumi ; Peng, Yifan ; Choi, Shukjae... - p. 13326-13330 , 2024
 
?
2

Neural Bokeh: Learning Lens Blur for Computational Videogra..:

, In: 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR),
Mandl, David ; Mori, Shohei ; Mohr, Peter... - p. 870-880 , 2024
 
?
3

CAM-GUI: A Conversational Assistant on Mobile GUI:

, In: Communications in Computer and Information Science; Man-Machine Speech Communication,
Zhu, Zichen ; Sun, Liangtai ; Yang, Jingkai... - p. 302-315 , 2024
 
?
4

Contextualized Automatic Speech Recognition With Attention-..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Sudo, Yui ; Shakeel, Muhammad ; Fukumoto, Yosuke.. - p. 10896-10900 , 2024
 
?
5

Dynamic-Superb: Towards a Dynamic, Collaborative, and Compr..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Huang, Chien-Yu ; Lu, Ke-Han ; Wang, Shih-Heng... - p. 12136-12140 , 2024
 
?
6

Joint Prediction and Denoising for Large-Scale Multilingual..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
Chen, William ; Shi, Jiatong ; Yan, Brian... - p. 1-8 , 2023
 
?
7

E-Branchformer: Branchformer with Enhanced Merging for Spee..:

, In: 2022 IEEE Spoken Language Technology Workshop (SLT),
Kim, Kwangyoun ; Wu, Felix ; Peng, Yifan... - p. 84-91 , 2023
 
?
8

E-Branchformer-Based E2E SLU Toward Stop on-Device Challeng:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
 
?
9

Improving Massively Multilingual ASR with Auxiliary CTC Obj..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Chen, William ; Yan, Brian ; Shi, Jiatong... - p. 1-5 , 2023
 
?
10

A Study on the Integration of Pre-Trained SSL, ASR, LM and ..:

, In: 2022 IEEE Spoken Language Technology Workshop (SLT),
Peng, Yifan ; Arora, Siddhant ; Higuchi, Yosuke... - p. 406-413 , 2023
 
?
11

I3D: Transformer Architectures with Input-Dependent Dynamic..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
 
?
12

The Pipeline System of ASR and NLU with MLM-based data Augm..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
 
?
13

Reproducing Whisper-Style Training Using An Open-Source Too..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
Peng, Yifan ; Tian, Jinchuan ; Yan, Brian... - p. 1-8 , 2023
 
?
14

How Does Pruning Impact Long-Tailed Multi-label Medical Ima..:

, In: Lecture Notes in Computer Science; Medical Image Computing and Computer Assisted Intervention – MICCAI 2023,
Holste, Gregory ; Jiang, Ziyu ; Jaiswal, Ajay... - p. 663-673 , 2023
 
?
15

Learning A Room with the Occ-SDF Hybrid: Signed Distance Fu..:

, In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV),
Lyu, Xiaoyang ; Dai, Peng ; Li, Zizhang... - p. 8906-8916 , 2023
 
1-15