Construction of Large Scale Isolated Word Speech Corpus in Bangla

α
Md. Farukuzzaman Khan
Md. Farukuzzaman Khan
σ
M. Abdus Sobhan
M. Abdus Sobhan
α Islamic University Islamic University

Send Message

To: Author

Construction of Large Scale Isolated Word Speech Corpus in Bangla

Article Fingerprint

ReserarchID

82IH6

Construction of Large Scale Isolated Word Speech Corpus in Bangla Banner

AI TAKEAWAY

Connecting with the Eternal Ground
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Abstract

A new speech corpus of isolated words in Bangla language has recorded including high frequent words from a text corpus BdNC01. It has designed specifically for various research activities related to speaker-independent Bangla speech recognition. The database consists of speech of 100 speakers, each of them speaking 1081 words. Another 50 new speakers were employed to speak all the list of words to construct a test database. Every utterance was repeated five times in different days to avoid time variation of speaker property. The total 375 hours of original recording makes the corpora largest in its type, size and language domain. This paper describes the motivation for the corpora and the processes undertaken in its construction. The paper concludes with the usability of the corpus.

References

17 Cites in Article
  1. Gopala Krishna Anumanchipalli,Kishore Prahallad,Alan Black (2011). Festvox: Tools for Creation and Analyses of Large Speech Corpora.
  2. Baris Bozkurt,Ozlem Ozturk,Thierry Dutoit (2003). Text design for TTS speech corpus building using a modified greedy selection.
  3. K Yeshwant,Ronald Muthusamy,Beatrice Cole (1992). Oshika The Ogi Multi-Language Telephone Speech Corpus.
  4. John Godfrey,Edward Holliman (1997). Switchboard-1 Release 2, Linguistic Data Consortium.
  5. John Garofolo,Lori Lamel,William Fisher,Jonathan Fiscus,David Pallett,Nancy Dahlgren (1993). DARPA TIMIT:.
  6. R,Gary Leonard,George Doddington,Tidigits (1993). Linguistic Data Consortium.
  7. (2011). PHP.
  8. Yi Hu,Philipos Loizou (2007). Subjective comparison and evaluation of speech enhancement algorithms.
  9. Wikipedia (2017). Unknown Title.
  10. S Firoj Alam,Dil Habib,Afroza Sultana,Mumit Khan (2010). 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018).
  11. Md,Md Farukuzzaman Khan,Md Islam,Rahman (2008). Construction and Analysis of Large-Scale Bangla Corpus for Bangla Speech Recognition.
  12. Tony Robinson,Jeroen Fransen,D Pye,J Foote,S Renals (1995). WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition.
  13. Lean-Lac Gauvain,Lori Lamel (2003). Large Vocabulary Speech Recognition Based on Statistical Methods.
  14. David Pallett,Jonathan Fiscus,William Fisher,John Garofolo,Bruce Lund,Mark Przybocki (1994). 1993 benchmark tests for the ARPA spoken language program.
  15. Dafydd Gibbon,Roger Moore,Richard Winski (1998). Spoken Language System and Corpus Design.
  16. P Sherry,Casali (1988). The Effects of Recognition Accuracy and Vocabulary Size of A Speech Recognition System on Task Performance and User Acceptance.
  17. Yong-Jin Liu,Xiao-Dong Shi (2018). Research and implementation of parallel speech recognition based on HTK.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

How to Cite This Article

Md. Farukuzzaman Khan. 2018. \u201cConstruction of Large Scale Isolated Word Speech Corpus in Bangla\u201d. Global Journal of Computer Science and Technology - G: Interdisciplinary GJCST-G Volume 18 (GJCST Volume 18 Issue G2): .

Download Citation

Issue Cover
GJCST Volume 18 Issue G2
Pg. 21- 26
Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Keywords
Classification
GJCST-G Classification: H.2.0
Version of record

v1.2

Issue date

May 14, 2018

Language
en
Experiance in AR

Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.

Read in 3D

Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.

Article Matrices
Total Views: 5983
Total Downloads: 1626
2026 Trends
Related Research

Published Article

A new speech corpus of isolated words in Bangla language has recorded including high frequent words from a text corpus BdNC01. It has designed specifically for various research activities related to speaker-independent Bangla speech recognition. The database consists of speech of 100 speakers, each of them speaking 1081 words. Another 50 new speakers were employed to speak all the list of words to construct a test database. Every utterance was repeated five times in different days to avoid time variation of speaker property. The total 375 hours of original recording makes the corpora largest in its type, size and language domain. This paper describes the motivation for the corpora and the processes undertaken in its construction. The paper concludes with the usability of the corpus.

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Construction of Large Scale Isolated Word Speech Corpus in Bangla

Md. Farukuzzaman Khan
Md. Farukuzzaman Khan Islamic University
M. Abdus Sobhan
M. Abdus Sobhan

Research Journals