Cross-lingual speaker adaptation for statistical speech synthesis using limited data | Kütüphane.osmanlica.com

Cross-lingual speaker adaptation for statistical speech synthesis using limited data

İsim Cross-lingual speaker adaptation for statistical speech synthesis using limited data
Yazar Sarfjoo, Seyyed Saeed, Demiroğlu, Cenk
Basım Tarihi: 2016
Basım Yeri - Interspeech
Konu Cross lingual speaker adaptation, Eigenvoice adaptation, Nearest-neighbor, Speaker adaptation, Statistical speech synthesis
Tür Belge
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane: Özyeğin Üniversitesi
Demirbaş Numarası 2308-457X
Kayıt Numarası 04054eaf-40fd-4992-a4e7-53910fe3588d
Lokasyon Electrical & Electronics Engineering
Tarih 2016
Örnek Metin Cross-lingual speaker adaptation with limited adaptation data has many applications such as use in speech-to-speech translation systems. Here, we focus on cross-lingual adaptation for statistical speech synthesis (SSS) systems using limited adaptation data. To that end, we propose two techniques exploiting a bilingual Turkish-English speech database that we collected. In one approach, speaker-specific state-mapping is proposed for cross-lingual adaptation which performed significantly better than the baseline state-mapping algorithm in adapting the excitation parameter both in objective and subjective tests. In the second approach, eigenvoice adaptation is done in the input language which is then used to estimate the eigenvoice weights in the output language using weighted linear regression. The second approach performed significantly better than the baseline system in adapting the spectral envelope parameters both in objective and subjective tests.
DOI 10.21437/Interspeech.2016-345
Kaynağa git Özyeğin Üniversitesi Özyeğin Üniversitesi
Özyeğin Üniversitesi Özyeğin Üniversitesi
Kaynağa git

Cross-lingual speaker adaptation for statistical speech synthesis using limited data

Yazar Sarfjoo, Seyyed Saeed, Demiroğlu, Cenk
Basım Tarihi 2016
Basım Yeri - Interspeech
Konu Cross lingual speaker adaptation, Eigenvoice adaptation, Nearest-neighbor, Speaker adaptation, Statistical speech synthesis
Tür Belge
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane Özyeğin Üniversitesi
Demirbaş Numarası 2308-457X
Kayıt Numarası 04054eaf-40fd-4992-a4e7-53910fe3588d
Lokasyon Electrical & Electronics Engineering
Tarih 2016
Örnek Metin Cross-lingual speaker adaptation with limited adaptation data has many applications such as use in speech-to-speech translation systems. Here, we focus on cross-lingual adaptation for statistical speech synthesis (SSS) systems using limited adaptation data. To that end, we propose two techniques exploiting a bilingual Turkish-English speech database that we collected. In one approach, speaker-specific state-mapping is proposed for cross-lingual adaptation which performed significantly better than the baseline state-mapping algorithm in adapting the excitation parameter both in objective and subjective tests. In the second approach, eigenvoice adaptation is done in the input language which is then used to estimate the eigenvoice weights in the output language using weighted linear regression. The second approach performed significantly better than the baseline system in adapting the spectral envelope parameters both in objective and subjective tests.
DOI 10.21437/Interspeech.2016-345
Özyeğin Üniversitesi
Özyeğin Üniversitesi yönlendiriliyorsunuz...

Lütfen bekleyiniz.