Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia

Vincent Elbert Budiman; Andreas Widjaja

doi:10.28932/jutisi.v6i2.2684

PDF (English)

Diterbitkan: Aug 10, 2020

DOI: https://doi.org/10.28932/jutisi.v6i2.2684

Vincent Elbert Budiman

Maranatha Christian University

Andreas Widjaja

Maranatha Christian University

Abstrak

Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.

Unduhan

Data unduhan belum tersedia.

Cara Mengutip

[1]

V. E. Budiman dan A. Widjaja, “Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia”, JuTISI, vol. 6, no. 2, Agu 2020.

Terbitan

Vol 6 No 2 (2020): JuTISI

Bagian

Articles

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial used, distribution and reproduction in any medium.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Artikel paling banyak dibaca berdasarkan penulis yang sama

Kristiawan Kristiawan, Andreas Widjaja, Perbandingan Algoritma Machine Learning dalam Menilai Sebuah Lokasi Toko Ritel , Jurnal Teknik Informatika dan Sistem Informasi: Vol 7 No 1 (2021): JuTISI
Ariel Elbert Budiman, Andreas Widjaja, Analisis Pengaruh Teks Preprocessing Terhadap Deteksi Plagiarisme Pada Dokumen Tugas Akhir , Jurnal Teknik Informatika dan Sistem Informasi: Vol 6 No 3 (2020): JuTISI
Erik Dwi Anggara, Andreas Widjaja, Bernard Renaldy Suteja, Prediksi Kinerja Pegawai sebagai Rekomendasi Kenaikan Golongan dengan Metode Decision Tree dan Regresi Logistik , Jurnal Teknik Informatika dan Sistem Informasi: Vol 8 No 1 (2022): JuTISI
Feliks Victor Parningotan Samosir, Loudry Palmarums Mustamu, Erik Dwi Anggara, Albertus Indarko Wiyogo, Andreas Widjaja, Exploratory Data Analysis terhadap Kepadatan Penumpang Kereta Rel Listrik , Jurnal Teknik Informatika dan Sistem Informasi: Vol 7 No 2 (2021): JuTISI
Joseph Sanjaya, Erick Renata, Vincent Elbert Budiman, Francis Anderson, Mewati Ayub, Prediksi Kelalaian Pinjaman Bank Menggunakan Random Forest dan Adaptive Boosting , Jurnal Teknik Informatika dan Sistem Informasi: Vol 6 No 1 (2020): JuTISI
Kristiawan Kristiawan, Deon Diamanta Somali, Try Atmaja Linggan jaya, Andreas Widjaja, Deteksi Buah Menggunakan Supervised Learning dan Ekstraksi Fitur untuk Pemeriksa Harga , Jurnal Teknik Informatika dan Sistem Informasi: Vol 6 No 3 (2020): JuTISI
Sendy Ferdian, Tjatur Kandaga, Andreas Widjaja, Hapnes Toba, Ronaldo Joshua, Julio Narabel, Continuous Integration and Continuous Delivery Platform Development of Software Engineering and Software Project Management in Higher Education , Jurnal Teknik Informatika dan Sistem Informasi: Vol 7 No 1 (2021): JuTISI
Avinash Avinash, Andreas Widjaja, Oscar Karnalim, Analisis Perbandingan Algoritma Machine Learning untuk Forecasting Persediaan Produk Barang Pokok , Jurnal Teknik Informatika dan Sistem Informasi: Vol 10 No 2 (2024): JuTISI
Laras Ervintyana, Andreas Widjaja, Swat Lie Liliawati, Analisis Deret Waktu dari Produk yang Terjual Menggunakan Beberapa Teknik Populer , Jurnal Teknik Informatika dan Sistem Informasi: Vol 9 No 1 (2023): JuTISI (in progress)
Oktavianus Yopi Wardana, Mewati Ayub, Andreas Widjaja , Perbandingan Akurasi Model Pembelajaran Mesin untuk Prediksi Seleksi Masuk Perguruan Tinggi Negeri , Jurnal Teknik Informatika dan Sistem Informasi: Vol 9 No 1 (2023): JuTISI (in progress)

1 2 > >>

Bilah Samping Artikel

Isi Artikel Utama

Abstrak

Unduhan

Rincian Artikel

Artikel paling banyak dibaca berdasarkan penulis yang sama