E-LIB Suche - Ergebnisse für: Kanda, Naoyuki

1

SpeechX: Neural Codec Language Model as a Versatile Speech ..:

Wang, Xiaofei ; Thakker, Manthan ; Chen, Zhuo...
IEEE/ACM Transactions on Audio, Speech, and Language Processing. , 2024

Link: https://doi.org/10.1109/..

2

Self-Supervised Learning with Bi-Label Masked Speech Predic..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Huang, Zili ; Chen, Zhuo ; Kanda, Naoyuki... - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

3

Simulating Realistic Speech Overlaps Improves Multi-Talker ..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Yang, Muqiao ; Kanda, Naoyuki ; Wang, Xiaofei... - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

4

Vararray Meets T-Sot: Advancing the State of the Art of Str..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Kanda, Naoyuki ; Wu, Jian ; Wang, Xiaofei... - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

5

Speech Separation with Large-Scale Self-Supervised Learning:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Chen, Zhuo ; Kanda, Naoyuki ; Wu, Jian... - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

6

Target Speaker Voice Activity Detection with Transformers a..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Wang, Dongmei ; Xiao, Xiong ; Kanda, Naoyuki.. - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

7

WavLM: Large-Scale Self-Supervised Pre-Training for Full St..:

Chen, Sanyuan ; Wang, Chengyi ; Chen, Zhengyang...
IEEE Journal of Selected Topics in Signal Processing. 16 (2022) 6 - p. 1505-1518 , 2022

Link: https://doi.org/10.1109/..

8

A review of speaker diarization: Recent advances with deep ..:

Park, Tae Jin ; Kanda, Naoyuki ; Dimitriadis, Dimitrios...
Computer Speech & Language. 72 (2022) - p. 101317 , 2022

Link: https://doi.org/10.1016/..

9

Transcribe-to-Diarize: Neural Speaker Diarization for Unlim..:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Kanda, Naoyuki ; Xiao, Xiong ; Gaur, Yashesh... - p. 8082-8086 , 2022

Link: https://doi.org/10.1109/..

10

VarArray: Array-Geometry-Agnostic Continuous Speech Separat..:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Yoshioka, Takuya ; Wang, Xiaofei ; Wang, Dongmei... - p. 6027-6031 , 2022

Link: https://doi.org/10.1109/..

11

All-Neural Beamformer for Continuous Speech Separation:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Zhang, Zhuohuang ; Yoshioka, Takuya ; Kanda, Naoyuki... - p. 6032-6036 , 2022

Link: https://doi.org/10.1109/..

12

Microsoft Speaker Diarization System for the Voxceleb Speak..:

, In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Xiao, Xiong ; Kanda, Naoyuki ; Chen, Zhuo... - p. 5824-5828 , 2021

Link: https://doi.org/10.1109/..

13

Hypothesis Stitcher for End-to-End Speaker-Attributed ASR o..:

, In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Chang, Xuankai ; Kanda, Naoyuki ; Gaur, Yashesh... - p. 6763-6767 , 2021

Link: https://doi.org/10.1109/..

14

Efficient growth and characterization of one-dimensional tr..:

Kanda, Naoyuki ; Nakanishi, Yusuke ; Liu, Dan...
Nanoscale. 12 (2020) 33 - p. 17185-17190 , 2020

Link: https://doi.org/10.1039/..

15

End-to-End Neural Speaker Diarization with Self-Attention:

, In: 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),

Fujita, Yusuke ; Kanda, Naoyuki ; Horiguchi, Shota... - p. 296-303 , 2019

Link: https://doi.org/10.1109/..