Comparison of ICA Methods for the Recognition of Corrupted Korean Speech

Comparison of ICA Methods for the Recognition of Corrupted Korean Speech
Comparison of ICA Methods for the Recognition of Corrupted Korean Speech

ㆍ 저자명: 김선일,Kim. Seon-Il
ㆍ 간행물명: 電子工學會論文誌. Journal of the institute of electronics engineers of Korea. IE. 산업전자
ㆍ 권/호정보: 2008년|45권 3호|pp.20-26 (7 pages)
ㆍ 발행정보: 대한전자공학회
ㆍ 파일정보: 정기간행물|ENG|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

영문초록

두 가지 Independent Component Analysis(ICA) 알고리즘을 적용하여 자동차 엔진 소음과 섞인 음성 신호의 인식을 시도하였다. 이를 이용하여 추정한 신호를 HMM을 이용하여 인식하였고 이 신호의 인식률을 소음이 섞이기 전의 음성 신호의 인식률과 비교하였다. 음성 신호를 추정하는데 두 가지 서로 다른 ICA를 사용하였으며 그 중의 하나는 negentropy를 최대화하는 FastICA 알고리즘이며 다른 하나는 출력 신호 사이의 독립성을 최대화하여서 입력과 출력 사이의 mutual information을 최대화하는 information-maximization approach 이다. 남성 앵커가 진행한 한국어 뉴스 문장에 대한 단어 인식률은 87.85%이며 다양한 신호 대 잡음비를 갖도록 소음을 섞어서 추정을 한 후 인식을 시도한 결과 FastICA를 이용해 추정한 음성 신호에 대한 인식률은 1.65%, information-maximization을 이용해 추정한 음성 신호에 대한 인식률은 2.02% 인식률 저하가 나타났다. 따라서 어느 방법을 적용하든지 의미 있는 차이가 없음을 확인하였다.

기타언어초록

Two independent component analysis(ICA) algorithms were applied for the recognition of speech signals corrupted by a car engine noise. Speech recognition was performed by hidden markov model(HMM) for the estimated signals and recognition rates were compared with those of orginal speech signals which are not corrupted. Two different ICA methods were applied for the estimation of speech signals, one of which is FastICA algorithm that maximizes negentropy, the other is information-maximization approach that maximizes the mutual information between inputs and outputs to give maximum independence among outputs. Word recognition rate for the Korean news sentences spoken by a male anchor is 87.85%, while there is 1.65% drop of performance on the average for the estimated speech signals by FastICA and 2.02% by information-maximization for the various signal to noise ratio(SNR). There is little difference between the methods.

키워드

ICA HMM Negentropy Mutual Information Maximum Independence Car Engine Sound Speech Recognition

다운URL