Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors | Kütüphane.osmanlica.com

Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors

İsim Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors
Yazar Demiroğlu, Cenk, Buyuk, O., Khodabakhsh, Ali, Maia, R.
Basım Tarihi: 2017-06
Basım Yeri - IEEE
Konu Spoofing, Speaker verification, Synthetic speech detection, Complex cepstrum, Speech synthesis, Voice conversion
Tür Süreli Yayın
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane: Özyeğin Üniversitesi
Demirbaş Numarası 1932-4553
Kayıt Numarası c1d66c1e-bdf9-42ed-9e49-7cdec5d5a96e
Lokasyon Electrical & Electronics Engineering
Tarih 2017-06
Notlar Due to copyright restrictions, the access to the full text of this article is only available via subscription.
Örnek Metin State-of-the-art speaker verification systems are vulnerable to spoofing attacks. To address the issue, high-performance synthetic speech detectors (SSDs) for existing spoofing methods have been proposed. Phase-based SSDs that exploit the fact that most of the parametric speech coders use minimum-phase filters are particularly successful when synthetic speech is generated with a parametric vocoder. Here, we propose a new attack strategy to spoof phase-based SSDs with the objective of increasing the security of voice verification systems by enabling the development of more generalized SSDs. As opposed to other parametric vocoders, the complex cepstrum approach uses mixed-phase filters, which makes it an ideal candidate for spoofing the phase-based SSDs. We propose using a complex cepstrum vocoder as a postprocessor to existing techniques to spoof the speaker verification system as well as the phase-based SSDs. Once synthetic speech is generated with a speech synthesis or a voice conversion technique, for each synthetic speech frame, a natural frame is selected from a training database using a spectral distance measure. Then, complex cepstrum parameters of the natural frame are used for resynthesizing the synthetic frame. In the proposed method, complex cepstrum-based resynthesis is used as a postprocessor. Hence, it can be used in tandem with any synthetic speech generator. Experimental results showed that the approach is successful at spoofing four phase-based SSDs across nine parametric attack algorithms. Moreover, performance at spoofing the speaker verification system did not substantially degrade compared to the case when no postprocessor is employed.
DOI 10.1109/JSTSP.2017.2673807
Cilt 11
Kaynağa git Özyeğin Üniversitesi Özyeğin Üniversitesi
Özyeğin Üniversitesi Özyeğin Üniversitesi
Kaynağa git

Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors

Yazar Demiroğlu, Cenk, Buyuk, O., Khodabakhsh, Ali, Maia, R.
Basım Tarihi 2017-06
Basım Yeri - IEEE
Konu Spoofing, Speaker verification, Synthetic speech detection, Complex cepstrum, Speech synthesis, Voice conversion
Tür Süreli Yayın
Dil İngilizce
Dijital Evet
Yazma Hayır
Kütüphane Özyeğin Üniversitesi
Demirbaş Numarası 1932-4553
Kayıt Numarası c1d66c1e-bdf9-42ed-9e49-7cdec5d5a96e
Lokasyon Electrical & Electronics Engineering
Tarih 2017-06
Notlar Due to copyright restrictions, the access to the full text of this article is only available via subscription.
Örnek Metin State-of-the-art speaker verification systems are vulnerable to spoofing attacks. To address the issue, high-performance synthetic speech detectors (SSDs) for existing spoofing methods have been proposed. Phase-based SSDs that exploit the fact that most of the parametric speech coders use minimum-phase filters are particularly successful when synthetic speech is generated with a parametric vocoder. Here, we propose a new attack strategy to spoof phase-based SSDs with the objective of increasing the security of voice verification systems by enabling the development of more generalized SSDs. As opposed to other parametric vocoders, the complex cepstrum approach uses mixed-phase filters, which makes it an ideal candidate for spoofing the phase-based SSDs. We propose using a complex cepstrum vocoder as a postprocessor to existing techniques to spoof the speaker verification system as well as the phase-based SSDs. Once synthetic speech is generated with a speech synthesis or a voice conversion technique, for each synthetic speech frame, a natural frame is selected from a training database using a spectral distance measure. Then, complex cepstrum parameters of the natural frame are used for resynthesizing the synthetic frame. In the proposed method, complex cepstrum-based resynthesis is used as a postprocessor. Hence, it can be used in tandem with any synthetic speech generator. Experimental results showed that the approach is successful at spoofing four phase-based SSDs across nine parametric attack algorithms. Moreover, performance at spoofing the speaker verification system did not substantially degrade compared to the case when no postprocessor is employed.
DOI 10.1109/JSTSP.2017.2673807
Cilt 11
Özyeğin Üniversitesi
Özyeğin Üniversitesi yönlendiriliyorsunuz...

Lütfen bekleyiniz.