Construction of Large Scale Isolated Word Speech Corpus in Bangla

Construction of Large Scale Isolated Word Speech Corpus in Bangla

Md. Farukuzzaman Khan

Contact

M. Abdus Sobhan

Contact

Islamic University

Construction of Large Scale Isolated Word Speech Corpus in Bangla

Article Fingerprint

ReserarchID

82IH6

Construction of Large Scale Isolated Word Speech Corpus in Bangla Banner

AI TAKEAWAY

Connecting with the Eternal Ground

Abstract

A new speech corpus of isolated words in Bangla language has recorded including high frequent words from a text corpus BdNC01. It has designed specifically for various research activities related to speaker-independent Bangla speech recognition. The database consists of speech of 100 speakers, each of them speaking 1081 words. Another 50 new speakers were employed to speak all the list of words to construct a test database. Every utterance was repeated five times in different days to avoid time variation of speaker property. The total 375 hours of original recording makes the corpora largest in its type, size and language domain. This paper describes the motivation for the corpora and the processes undertaken in its construction. The paper concludes with the usability of the corpus.

References

17 Cites in Article

Reference Format

Gopala Krishna Anumanchipalli,Kishore Prahallad,Alan Black (2011). Festvox: Tools for Creation and Analyses of Large Speech Corpora.
Baris Bozkurt,Ozlem Ozturk,Thierry Dutoit (2003). Text design for TTS speech corpus building using a modified greedy selection.
K Yeshwant,Ronald Muthusamy,Beatrice Cole (1992). Oshika The Ogi Multi-Language Telephone Speech Corpus.
John Godfrey,Edward Holliman (1997). Switchboard-1 Release 2, Linguistic Data Consortium.
John Garofolo,Lori Lamel,William Fisher,Jonathan Fiscus,David Pallett,Nancy Dahlgren (1993). DARPA TIMIT:.
R,Gary Leonard,George Doddington,Tidigits (1993). Linguistic Data Consortium.
(2011). PHP.
Yi Hu,Philipos Loizou (2007). Subjective comparison and evaluation of speech enhancement algorithms.
Wikipedia (2017). Unknown Title.
S Firoj Alam,Dil Habib,Afroza Sultana,Mumit Khan (2010). 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018).
Md,Md Farukuzzaman Khan,Md Islam,Rahman (2008). Construction and Analysis of Large-Scale Bangla Corpus for Bangla Speech Recognition.
Tony Robinson,Jeroen Fransen,D Pye,J Foote,S Renals (1995). WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition.
Lean-Lac Gauvain,Lori Lamel (2003). Large Vocabulary Speech Recognition Based on Statistical Methods.
David Pallett,Jonathan Fiscus,William Fisher,John Garofolo,Bruce Lund,Mark Przybocki (1994). 1993 benchmark tests for the ARPA spoken language program.
Dafydd Gibbon,Roger Moore,Richard Winski (1998). Spoken Language System and Corpus Design.
P Sherry,Casali (1988). The Effects of Recognition Accuracy and Vocabulary Size of A Speech Recognition System on Task Performance and User Acceptance.
Yong-Jin Liu,Xiao-Dong Shi (2018). Research and implementation of parallel speech recognition based on HTK.

Download References

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

How to Cite This Article

Md. Farukuzzaman Khan. 2018. \u201cConstruction of Large Scale Isolated Word Speech Corpus in Bangla\u201d. Global Journal of Computer Science and Technology - G: Interdisciplinary GJCST-G Volume 18 (GJCST Volume 18 Issue G2): .

More Citation Formats

Select Citation Style:

Download Citation

Download Article

GJCST Volume 18 Issue G2
Pg. 21- 26

Explore Journals Explore Volume Read This Issue

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Keywords

Not Found

Classification

GJCST-G Classification: H.2.0

Submission ReceivedDecember 16, 2017
Peer Review Double Blind
Handling Editor
Accepted January 5, 2018
Published January 15, 2018

Version of record

v1.2

Issue date

May 14, 2018

Language

Experiance in AR

Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.

View in VR

Read in 3D

Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.

View in 3D

Article Matrices

Total Score: 102

Country: Bangladesh

Subject: Global Journal of Computer Science and Technology - G: Interdisciplinary

Authors: Md. Farukuzzaman Khan, M. Abdus Sobhan (PhD/Dr. count: 0)

View Count (all-time): 275

Total Views (Real + Logic): 5983

Total Downloads (simulated): 1626

Publish Date: 2018 05, Mon

Monthly Totals (Real + Logic):

Month 1: 37 views
Month 2: 51 views
Month 3: 61 views
Month 4: 40 views
Month 5: 29 views
Month 6: 56 views
Month 7: 41 views
Month 8: 52 views
Month 9: 37 views
Month 10: 49 views
Month 11: 25 views
Month 12: 27 views
Month 13: 42 views
Month 14: 20 views
Month 15: 28 views
Month 16: 44 views
Month 17: 20 views
Month 18: 22 views
Month 19: 32 views
Month 20: 21 views
Month 21: 19 views
Month 22: 33 views
Month 23: 30 views
Month 24: 32 views
Month 25: 37 views
Month 26: 14 views
Month 27: 39 views
Month 28: 39 views
Month 29: 29 views
Month 30: 35 views
Month 31: 14 views
Month 32: 34 views
Month 33: 37 views
Month 34: 22 views
Month 35: 36 views
Month 36: 31 views
Month 37: 43 views
Month 38: 18 views
Month 39: 40 views
Month 40: 38 views
Month 41: 14 views
Month 42: 26 views
Month 43: 17 views
Month 44: 38 views
Month 45: 17 views
Month 46: 50 views
Month 47: 41 views
Month 48: 33 views
Month 49: 35 views
Month 50: 42 views
Month 51: 44 views
Month 52: 20 views
Month 53: 32 views
Month 54: 37 views
Month 55: 27 views
Month 56: 26 views
Month 57: 42 views
Month 58: 35 views
Month 59: 11 views
Month 60: 38 views
Month 61: 43 views
Month 62: 44 views
Month 63: 31 views
Month 64: 35 views
Month 65: 15 views
Month 66: 31 views
Month 67: 46 views
Month 68: 27 views
Month 69: 30 views
Month 70: 31 views
Month 71: 40 views
Month 72: 39 views
Month 73: 18 views
Month 74: 16 views
Month 75: 46 views
Month 76: 14 views
Month 77: 36 views
Month 78: 28 views
Month 79: 39 views
Month 80: 30 views
Month 81: 33 views
Month 82: 28 views
Month 83: 24 views
Month 84: 12 views
Month 85: 35 views
Month 86: 43 views
Month 87: 37 views
Month 88: 16 views
Month 89: 31 views
Month 90: 22 views
Month 91: 37 views
Month 92: 41 views
Month 93: 24 views
Month 94: 38 views
Month 95: 43 views

Total Views: 5983

Total Downloads: 1626

2026 Trends

Published Article

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]