A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages | Kütüphane.osmanlica.com

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

İsim A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages
Yazar Güner, Ekrem, Demiroğlu, Cenk
Basım Tarihi: 2012
Basım Yeri - IEEE
Konu Hidden Markov models, Natural language processing, Speech intelligibility, Speech synthesis, Statistical analysis
Tür Belge
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane: Özyeğin Üniversitesi
Demirbaş Numarası 978-1-4673-0044-5
Kayıt Numarası 519c7f1b-fd96-47cb-89c3-be7fdf3f926c
Lokasyon Electrical & Electronics Engineering
Tarih 2012
Notlar Due to copyright restrictions, the access to the full text of this article is only available via subscription.
Örnek Metin Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly getting more attention from the TTS research community. One of the advantage is the lack of spurious errors that are observed in the unit selection scheme. Another advantage of the HTS system is the small memory footprint requirement which makes it attractive for embedded devices. Here, we propose a novel hybrid statistical unit selection TTS system for agglutinative languages that aims at improving the quality of the baseline HTS system while keeping the memory footprint small. The intelligibility and quality scores of the baseline system are comparable to the MOS scores of English reported in the Blizzard Challenge tests. Listeners preferred the hybrid system over the baseline system in the A/B preference tests.
DOI 10.1109/ICASSP.2012.6288927
Kaynağa git Özyeğin Üniversitesi Özyeğin Üniversitesi
Özyeğin Üniversitesi Özyeğin Üniversitesi
Kaynağa git

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Yazar Güner, Ekrem, Demiroğlu, Cenk
Basım Tarihi 2012
Basım Yeri - IEEE
Konu Hidden Markov models, Natural language processing, Speech intelligibility, Speech synthesis, Statistical analysis
Tür Belge
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane Özyeğin Üniversitesi
Demirbaş Numarası 978-1-4673-0044-5
Kayıt Numarası 519c7f1b-fd96-47cb-89c3-be7fdf3f926c
Lokasyon Electrical & Electronics Engineering
Tarih 2012
Notlar Due to copyright restrictions, the access to the full text of this article is only available via subscription.
Örnek Metin Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly getting more attention from the TTS research community. One of the advantage is the lack of spurious errors that are observed in the unit selection scheme. Another advantage of the HTS system is the small memory footprint requirement which makes it attractive for embedded devices. Here, we propose a novel hybrid statistical unit selection TTS system for agglutinative languages that aims at improving the quality of the baseline HTS system while keeping the memory footprint small. The intelligibility and quality scores of the baseline system are comparable to the MOS scores of English reported in the Blizzard Challenge tests. Listeners preferred the hybrid system over the baseline system in the A/B preference tests.
DOI 10.1109/ICASSP.2012.6288927
Özyeğin Üniversitesi
Özyeğin Üniversitesi yönlendiriliyorsunuz...

Lütfen bekleyiniz.