Kanda, Naoyuki
453  results:
Search for persons X
?
1

SpeechX: Neural Codec Language Model as a Versatile Speech ..:

Wang, Xiaofei ; Thakker, Manthan ; Chen, Zhuo...
IEEE/ACM Transactions on Audio, Speech, and Language Processing.  , 2024
 
?
2

Self-Supervised Learning with Bi-Label Masked Speech Predic..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Huang, Zili ; Chen, Zhuo ; Kanda, Naoyuki... - p. 1-5 , 2023
 
?
3

Simulating Realistic Speech Overlaps Improves Multi-Talker ..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Yang, Muqiao ; Kanda, Naoyuki ; Wang, Xiaofei... - p. 1-5 , 2023
 
?
4

Vararray Meets T-Sot: Advancing the State of the Art of Str..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Kanda, Naoyuki ; Wu, Jian ; Wang, Xiaofei... - p. 1-5 , 2023
 
?
5

Speech Separation with Large-Scale Self-Supervised Learning:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Chen, Zhuo ; Kanda, Naoyuki ; Wu, Jian... - p. 1-5 , 2023
 
?
6

Target Speaker Voice Activity Detection with Transformers a..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Wang, Dongmei ; Xiao, Xiong ; Kanda, Naoyuki.. - p. 1-5 , 2023
 
?
7

WavLM: Large-Scale Self-Supervised Pre-Training for Full St..:

Chen, Sanyuan ; Wang, Chengyi ; Chen, Zhengyang...
IEEE Journal of Selected Topics in Signal Processing.  16 (2022)  6 - p. 1505-1518 , 2022
 
?
9

Transcribe-to-Diarize: Neural Speaker Diarization for Unlim..:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Kanda, Naoyuki ; Xiao, Xiong ; Gaur, Yashesh... - p. 8082-8086 , 2022
 
?
10

VarArray: Array-Geometry-Agnostic Continuous Speech Separat..:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Yoshioka, Takuya ; Wang, Xiaofei ; Wang, Dongmei... - p. 6027-6031 , 2022
 
?
11

All-Neural Beamformer for Continuous Speech Separation:

, In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Zhang, Zhuohuang ; Yoshioka, Takuya ; Kanda, Naoyuki... - p. 6032-6036 , 2022
 
?
12

Microsoft Speaker Diarization System for the Voxceleb Speak..:

, In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Xiao, Xiong ; Kanda, Naoyuki ; Chen, Zhuo... - p. 5824-5828 , 2021
 
?
13

Hypothesis Stitcher for End-to-End Speaker-Attributed ASR o..:

, In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Chang, Xuankai ; Kanda, Naoyuki ; Gaur, Yashesh... - p. 6763-6767 , 2021
 
?
15

End-to-End Neural Speaker Diarization with Self-Attention:

, In: 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
Fujita, Yusuke ; Kanda, Naoyuki ; Horiguchi, Shota... - p. 296-303 , 2019
 
1-15