TODA, Tomoki
312  results:
Search for persons X
?
1

Dual-Channel Target Speaker Extraction Based on Conditional..:

Wang, Rui ; Li, Li ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing.  32 (2024)  - p. 1968-1979 , 2024
 
?
2

Electrolaryngeal Speech Intelligibility Enhancement through..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Violeta, Lester Phillip ; Huang, Wen-Chin ; Ma, Ding... - p. 10961-10965 , 2024
 
?
4

Unequally Spaced Sound Field Interpolation for Rotation-Rob..:

Luan, Shuming ; Wakabayashi, Yukoh ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing.  32 (2024)  - p. 3185-3199 , 2024
 
?
5

MF-AED-AEC: Speech Emotion Recognition by Leveraging Multim..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
He, Jiajun ; Shi, Xiaohan ; Li, Xingfeng. - p. 11066-11070 , 2024
 
?
7

Audio Difference Learning for Audio Captioning:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Komatsu, Tatsuya ; Fujita, Yusuke ; Takeda, Kazuya. - p. 1456-1460 , 2024
 
?
8

A review on subjective and objective evaluation of syntheti..:

Cooper, Erica ; Huang, Wen-Chin ; Tsao, Yu...
Acoustical Science and Technology.  45 (2024)  4 - p. 161-183 , 2024
 
?
9

FIRNet: Fundamental Frequency Controllable Fast Neural Voco..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Ohtani, Yamato ; Okamoto, Takuma ; Toda, Tomoki. - p. 10871-10875 , 2024
 
?
10

Pretraining and Adaptation Techniques for Electrolaryngeal ..:

Violeta, Lester Phillip ; Ma, Ding ; Huang, Wen-Chin.
IEEE/ACM Transactions on Audio, Speech, and Language Processing.  32 (2024)  - p. 2777-2789 , 2024
 
?
11

Convnext-TTS And Convnext-VC: Convnext-Based Fast End-To-En..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Okamoto, Takuma ; Ohtani, Yamato ; Toda, Tomoki. - p. 12456-12460 , 2024
 
?
12

Noisy-to-Noisy Voice Conversion Under Variations of Noisy C..:

Xie, Chao ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing.  31 (2023)  - p. 3871-3882 , 2023
 
?
13

Text-To-Speech Synthesis Based on Latent Variable Conversio..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Yasuda, Yusuke ; Toda, Tomoki - p. 1-5 , 2023
 
?
14

WaveNeXt: ConvNeXt-Based Fast Neural Vocoder Without ISTFT ..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
 
?
15

Semi-supervised Multimodal Emotion Recognition with Consens..:

, In: Proceedings of the 1st International Workshop on Multimodal and Responsible Affective Computing,
Tian, Jingguang ; Hu, Desheng ; Shi, Xiaohan... - p. 67-73 , 2023
 
1-15