E-LIB Suche - Ergebnisse für: TODA, Tomoki

1

Dual-Channel Target Speaker Extraction Based on Conditional..:

Wang, Rui ; Li, Li ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 32 (2024) - p. 1968-1979 , 2024

Link: https://doi.org/10.1109/..

2

Electrolaryngeal Speech Intelligibility Enhancement through..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Violeta, Lester Phillip ; Huang, Wen-Chin ; Ma, Ding... - p. 10961-10965 , 2024

Link: https://doi.org/10.1109/..

3

An Investigation of Fundamental Frequency Pattern Predictio..:

Eshghi, Mohammad ; Toda, Tomoki
IEEE Access. 12 (2024) - p. 50137-50153 , 2024

Link: https://doi.org/10.1109/..

4

Unequally Spaced Sound Field Interpolation for Rotation-Rob..:

Luan, Shuming ; Wakabayashi, Yukoh ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 32 (2024) - p. 3185-3199 , 2024

Link: https://doi.org/10.1109/..

5

MF-AED-AEC: Speech Emotion Recognition by Leveraging Multim..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

He, Jiajun ; Shi, Xiaohan ; Li, Xingfeng. - p. 11066-11070 , 2024

Link: https://doi.org/10.1109/..

6

Fast Neural Speech Waveform Generative Models With Fully-Co..:

Yamashita, Haruki ; Okamoto, Takuma ; Takashima, Ryoichi...
IEEE Access. 12 (2024) - p. 31409-31421 , 2024

Link: https://doi.org/10.1109/..

7

Audio Difference Learning for Audio Captioning:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Komatsu, Tatsuya ; Fujita, Yusuke ; Takeda, Kazuya. - p. 1456-1460 , 2024

Link: https://doi.org/10.1109/..

8

A review on subjective and objective evaluation of syntheti..:

Cooper, Erica ; Huang, Wen-Chin ; Tsao, Yu...
Acoustical Science and Technology. 45 (2024) 4 - p. 161-183 , 2024

Link: https://doi.org/10.1250/..

9

FIRNet: Fundamental Frequency Controllable Fast Neural Voco..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Ohtani, Yamato ; Okamoto, Takuma ; Toda, Tomoki. - p. 10871-10875 , 2024

Link: https://doi.org/10.1109/..

10

Pretraining and Adaptation Techniques for Electrolaryngeal ..:

Violeta, Lester Phillip ; Ma, Ding ; Huang, Wen-Chin.
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 32 (2024) - p. 2777-2789 , 2024

Link: https://doi.org/10.1109/..

11

Convnext-TTS And Convnext-VC: Convnext-Based Fast End-To-En..:

, In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Okamoto, Takuma ; Ohtani, Yamato ; Toda, Tomoki. - p. 12456-12460 , 2024

Link: https://doi.org/10.1109/..

12

Noisy-to-Noisy Voice Conversion Under Variations of Noisy C..:

Xie, Chao ; Toda, Tomoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 31 (2023) - p. 3871-3882 , 2023

Link: https://doi.org/10.1109/..

13

Text-To-Speech Synthesis Based on Latent Variable Conversio..:

, In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

Yasuda, Yusuke ; Toda, Tomoki - p. 1-5 , 2023

Link: https://doi.org/10.1109/..

14

WaveNeXt: ConvNeXt-Based Fast Neural Vocoder Without ISTFT ..:

, In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),

Okamoto, Takuma ; Yamashita, Haruki ; Ohtani, Yamato.. - p. 1-8 , 2023

Link: https://doi.org/10.1109/..

15

Semi-supervised Multimodal Emotion Recognition with Consens..:

, In: Proceedings of the 1st International Workshop on Multimodal and Responsible Affective Computing,

Tian, Jingguang ; Hu, Desheng ; Shi, Xiaohan... - p. 67-73 , 2023

Link: https://dl.acm.org/doi/1..