A Study on Design and Implementation of Speech Recognition System Using ART2 Algorithm

A Study on Design and Implementation of Speech Recognition System Using ART2 Algorithm
A Study on Design and Implementation of Speech Recognition System Using ART2 Algorithm

ㆍ 저자명: Kim. Joeng Hoon,Kim. Dong Han,Jang. Won Il,Lee. Sang Bae
ㆍ 간행물명: International journal of fuzzy logic and intelligent systems
ㆍ 권/호정보: 2004년|4권 2호|pp.149-154 (6 pages)
ㆍ 발행정보: 한국지능시스템학회
ㆍ 파일정보: 정기간행물|ENG|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

기타언어초록

In this research, we selected the speech recognition to implement the electric wheelchair system as a method to control it by only using the speech and used DTW (Dynamic Time Warping), which is speaker-dependent and has a relatively high recognition rate among the speech recognitions. However, it has to have small memory and fast process speed performance under consideration of real-time. Thus, we introduced VQ (Vector Quantization) which is widely used as a compression algorithm of speaker-independent recognition, to secure fast recognition and small memory. However, we found that the recognition rate decreased after using VQ. To improve the recognition rate, we applied ART2 (Adaptive Reason Theory 2) algorithm as a post-process algorithm to obtain about 5% recognition rate improvement. To utilize ART2, we have to apply an error range. In case that the subtraction of the first distance from the second distance for each distance obtained to apply DTW is 20 or more, the error range is applied. Likewise, ART2 was applied and we could obtain fast process and high recognition rate. Moreover, since this system is a moving object, the system should be implemented as an embedded one. Thus, we selected TMS320C32 chip, which can process significantly many calculations relatively fast, to implement the embedded system. Considering that the memory is speech, we used 128kbyte-RAM and 64kbyte ROM to save large amount of data. In case of speech input, we used 16-bit stereo audio codec, securing relatively accurate data through high resolution capacity.

키워드

ART2 DSP(TMS320C32)DTW Speech Recognition

다운URL