Multi-language conversational telephone speech 2011. Central Asian / Linguistic Data Consortium. | Kütüphane.osmanlica.com

Multi-language conversational telephone speech 2011. Central Asian / Linguistic Data Consortium.

İsim Multi-language conversational telephone speech 2011. Central Asian / Linguistic Data Consortium.
Basım Tarihi: 2018
Konu Deori language > Spoken Deori > Data processing. Persian language > Spoken Persian > Data processing. Pushto language > Spoken Pushto > Data processing. Deori language > Discourse analysis. Persian language > Discourse analysis. Pushto language > Discourse analysis. Conversation > Data processing. Automatic speech recognition. Computational linguistics. Audio data mining. Corpora (Linguistics)
Tür Diğer
Dil fas,pus
Dijital Evet
Yazma Hayır
Fiziksel Boyutlar 1 online resource
Kütüphane: Yale Üniversitesi
Demirbaş Numarası 1585638331
Kayıt Numarası 99155737183408651
Tarih 2018
Notlar Authors: Karen Jones, David Graff, Kevin Walker, Stephanie Strassel. Data source: telephone conversations. Applications: language identification. LDC number: LDC2018S03. In Dari, Persian, and Pushto. Title from resource home page (LDC website, viewed September 23, 2020).
Örnek Metin "Multi-Language Conversational Telephone Speech 2011 -- Central Asian was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 37 hours of telephone speech in three distinct language varieties of Central Asia: Dari, Farsi and Pashto. The data were collected primarily to support research and technology evaluation in automatic language identification, and portions of these telephone calls were used in the NIST 2011 Language Recognition Evaluation (LRE). LRE 2011 focused on language pair discrimination for 24 languages/dialects, some of which could be considered mutually intelligible or closely related. Participants were recruited by native speakers who contacted acquaintances in their social network. Those native speakers made one call, up to 15 minutes, to each acquaintance. The data was collected using LDC's telephone collection infrastructure, comprised of three computer telephony systems. Human auditors labeled calls for callee gender, dialect type and noise. Demographic information about the participants was not collected. All audio data are presented in FLAC-compressed MS-WAV (RIFF) file format (*.flac); when uncompressed, each file is 2 channels, recorded at 8000 samples/second with samples stored as 16-bit signed integers, representing a lossless conversion from the original mu-law sample data as captured digitally from the public telephone network. The following table summarizes the total number of calls, total number of hours of recorded audio, and the total size of compressed data" --LDC online catalog.
Kataloğa Eklendi September 23, 2020
Diğer Sorumlular Jones, Karen, creator Linguistic Data Consortium, issuing body
Yerel Notlar Access is available to the Yale community.
Kaynağa git Yale Üniversitesi Yale University

Multi-language conversational telephone speech 2011. Central Asian / Linguistic Data Consortium.

Basım Tarihi 2018
Konu Deori language > Spoken Deori > Data processing. Persian language > Spoken Persian > Data processing. Pushto language > Spoken Pushto > Data processing. Deori language > Discourse analysis. Persian language > Discourse analysis. Pushto language > Discourse analysis. Conversation > Data processing. Automatic speech recognition. Computational linguistics. Audio data mining. Corpora (Linguistics)
Tür Diğer
Dil fas,pus
Dijital Evet
Yazma Hayır
Fiziksel Boyutlar 1 online resource
Kütüphane Yale Üniversitesi
Demirbaş Numarası 1585638331
Kayıt Numarası 99155737183408651
Tarih 2018
Notlar Authors: Karen Jones, David Graff, Kevin Walker, Stephanie Strassel. Data source: telephone conversations. Applications: language identification. LDC number: LDC2018S03. In Dari, Persian, and Pushto. Title from resource home page (LDC website, viewed September 23, 2020).
Örnek Metin "Multi-Language Conversational Telephone Speech 2011 -- Central Asian was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 37 hours of telephone speech in three distinct language varieties of Central Asia: Dari, Farsi and Pashto. The data were collected primarily to support research and technology evaluation in automatic language identification, and portions of these telephone calls were used in the NIST 2011 Language Recognition Evaluation (LRE). LRE 2011 focused on language pair discrimination for 24 languages/dialects, some of which could be considered mutually intelligible or closely related. Participants were recruited by native speakers who contacted acquaintances in their social network. Those native speakers made one call, up to 15 minutes, to each acquaintance. The data was collected using LDC's telephone collection infrastructure, comprised of three computer telephony systems. Human auditors labeled calls for callee gender, dialect type and noise. Demographic information about the participants was not collected. All audio data are presented in FLAC-compressed MS-WAV (RIFF) file format (*.flac); when uncompressed, each file is 2 channels, recorded at 8000 samples/second with samples stored as 16-bit signed integers, representing a lossless conversion from the original mu-law sample data as captured digitally from the public telephone network. The following table summarizes the total number of calls, total number of hours of recorded audio, and the total size of compressed data" --LDC online catalog.
Kataloğa Eklendi September 23, 2020
Diğer Sorumlular Jones, Karen, creator Linguistic Data Consortium, issuing body
Yerel Notlar Access is available to the Yale community.
Yale University
Yale Üniversitesi yönlendiriliyorsunuz...

Lütfen bekleyiniz.