Skip to main content
Till KTH:s startsida Till KTH:s startsida

Publications by Shivam Mehta

Refereegranskade

Konferensbidrag

[3]
A. Deichler et al., "Difusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation," in PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, pp. 755-762.
[4]
S. Mehta et al., "OverFlow : Putting flows on top of neural transducers for better TTS," in Interspeech 2023, 2023, pp. 4279-4283.
[5]
H. Lameris et al., "Prosody-Controllable Spontaneous TTS with Neural HMMs," in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023.
[6]
S. Mehta et al., "Neural HMMs are all you need (for high-quality attention-free TTS)," in 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 7457-7461.
[7]
B. Moell et al., "Speech Data Augmentation for Improving Phoneme Transcriptions of Aphasic Speech Using Wav2Vec 2.0 for the PSST Challenge," in The RaPID4 Workshop : Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments, 2022, pp. 62-70.

Icke refereegranskade

Konferensbidrag

[8]
H. Lameris et al., "Spontaneous Neural HMM TTS with Prosodic Feature Modification," in Proceedings of Fonetik 2022, 2022.
Senaste synkning med DiVA:
2024-05-12 02:02:30