E-LIB Suche - Ergebnisse für: Peng, Yifan

1

VoxtLM: Unified Decoder-Only Models for Consolidating Speec..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Maiti, Soumi ; Peng, Yifan ; Choi, Shukjae... - p. 13326-13330 , 2024

Link: https://doi.org/10.1109/..

2

Neural Bokeh: Learning Lens Blur for Computational Videogra..:

, In: 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR),

Mandl, David ; Mori, Shohei ; Mohr, Peter... - p. 870-880 , 2024

Link: https://doi.org/10.1109/..

3

CAM-GUI: A Conversational Assistant on Mobile GUI:

, In: Communications in Computer and Information Science; Man-Machine Speech Communication,

Zhu, Zichen ; Sun, Liangtai ; Yang, Jingkai... - p. 302-315 , 2024

Link: https://doi.org/10.1007/..

4

Contextualized Automatic Speech Recognition With Attention-..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Sudo, Yui ; Shakeel, Muhammad ; Fukumoto, Yosuke.. - p. 10896-10900 , 2024

Link: https://doi.org/10.1109/..

5

Dynamic-Superb: Towards a Dynamic, Collaborative, and Compr..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Huang, Chien-Yu ; Lu, Ke-Han ; Wang, Shih-Heng... - p. 12136-12140 , 2024

Link: https://doi.org/10.1109/..

6

Joint Prediction and Denoising for Large-Scale Multilingual..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),

Chen, William ; Shi, Jiatong ; Yan, Brian... - p. 1-8 , 2023

Link: https://doi.org/10.1109/..

7

E-Branchformer: Branchformer with Enhanced Merging for Spee..:

, In: 2022 IEEE Spoken Language Technology Workshop (SLT),

Kim, Kwangyoun ; Wu, Felix ; Peng, Yifan... - p. 84-91 , 2023

Link: https://doi.org/10.1109/..

8

E-Branchformer-Based E2E SLU Toward Stop on-Device Challeng:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Kashiwagi, Yosuke ; Arora, Siddhant ; Futami, Hayato... - p. 1-2 , 2023

Link: https://doi.org/10.1109/..

9

Improving Massively Multilingual ASR with Auxiliary CTC Obj..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Chen, William ; Yan, Brian ; Shi, Jiatong... - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

10

A Study on the Integration of Pre-Trained SSL, ASR, LM and ..:

, In: 2022 IEEE Spoken Language Technology Workshop (SLT),

Peng, Yifan ; Arora, Siddhant ; Higuchi, Yosuke... - p. 406-413 , 2023

Link: https://doi.org/10.1109/..

11

I3D: Transformer Architectures with Input-Dependent Dynamic..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Peng, Yifan ; Lee, Jaesong ; Watanabe, Shinji - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

12

The Pipeline System of ASR and NLU with MLM-based data Augm..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Futami, Hayato ; Huynh, Jessica ; Arora, Siddhant... - p. 1-2 , 2023

Link: https://doi.org/10.1109/..

13

Reproducing Whisper-Style Training Using An Open-Source Too..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),

Peng, Yifan ; Tian, Jinchuan ; Yan, Brian... - p. 1-8 , 2023

Link: https://doi.org/10.1109/..

14

How Does Pruning Impact Long-Tailed Multi-label Medical Ima..:

, In: Lecture Notes in Computer Science; Medical Image Computing and Computer Assisted Intervention – MICCAI 2023,

Holste, Gregory ; Jiang, Ziyu ; Jaiswal, Ajay... - p. 663-673 , 2023

Link: https://doi.org/10.1007/..

15

Learning A Room with the Occ-SDF Hybrid: Signed Distance Fu..:

, In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV),

Lyu, Xiaoyang ; Dai, Peng ; Li, Zizhang... - p. 8906-8916 , 2023

Link: https://doi.org/10.1109/..