语音识别综述+数字语音识别

整理文档很辛苦,赏杯茶钱您下走!

免费阅读已结束,点击下载阅读编辑剩下 ...

阅读已结束,您可以下载文档离线阅读编辑

资源描述

SpeechRecognition语音识别——ByTerrySpeechRecognitionSpeechrecognitionisahightechnologyofprocessingvoicesignalintocorrespondingtextsandcommandsbymachinerecognitionandunderstanding.Speechrecognitiontechnologyhasinvolvedsignalprocessing,patternrecognition,probabilitytheoryandinformationtheory,vocalmechanism,hearingmechanismandartificialintelligence.Speechrecognitiontechnologyismainlyconsistofthreemodule,includingfeatureextraction,patternmatchingtechnologyandmodeltraining.SpeechRecognitionTheHistoryofSpeechRecognitionDevelopment1959TenphonemerecognitionsystemAudrySystem,BellLabs20th,50slate60stoearly70sLPC,DTWVQ,HMMSphinxSystem,CarnegieMellonUniversity,ANN,HMM80s90sIBM,Apple,AT&TandNTTAHotAreainAI,MoreprocessingMethod,NowadaysSpeechRecognitionCategoryofmethod:IsolatedwordrecognitionConnectedwordrecognitionContinuousspeechRecognitionSpecificpersonrecognitionNon-specificpersonrecognitionSmallvocabularyMedianvocabularyLargevocabularyInfinitevocabularySpeechRecognitionMainlyMethods:TemplateMatchingDTW(DynamicTimeWarping)VQ(VectorQuantization)HMMDHMM(DiscreteHiddenMarkovModel)CHMM(ContinuousHiddenMarkovModel)SCHMM(Semi-ContinuousHiddenMarkovModel)ANN(ArtificialNeuralNet)SpeechRecognitionSignalPre-processingFramming-5msto50msEndpointdetection-detectthestartingpointandterminalpointSpeechEnhancement-inhibitnoiseandimprovespeechqualityICA-IndependentComponentAnalysisSpeechRecognitionFeatureExtractionLPC-LinearPredictioncoefficientLPCC-LinearPredictionCepstrumCoefficientMFCC-MelFrequencyCepstrumCoefficientCepstrum:njnjwenxeX)()(njnjwenxeX)()(njnjwenxeX)()(deeXmcjmjw|)(|ln21)(SpeechRecognitionSpeechRecognitionTemplateMatchingDTW(DynamicTimeWarping)VQ(VectorQuantization)HMMDHMM(DiscreteHiddenMarkovModel)CHMM(ContinuousHiddenMarkovModel)SCHMM(Semi-ContinuousHiddenMarkovModel)ANN(ArtificialNeuralNet)CRSIntroductionMatlabGUICRSIntroductionPeocedurePre-ProcessingFeatureExtractionDTW+VQCRSIntroductionPre-ProcessingPre-emphasisWindowing-Non-stationarysignalRectangleWindowHanningWindowHaimingWindow1()1HZuZ20.540.46cos()()10nwnNCRSIntroductionFeatureExtractionEndPointDetectionShort-timeenergyZerocrossingrate(DoubleGates)MFCC-BasedonAuditoryModelDFTDFT逆DFT信号频谱对数倒谱CRSIntroductionTemplateMatchingTemplatesetsselectingSingleOptimalSelectionMethodSFS(SequenceForwardSelecting)SBS(SequenceBackwardSelecting)GRNN(GeneralRegressionNeuralNetwork)Templatesubsets(ourown)ClassifyingaccordingtothesizeofframeA.20B.30C.ElseCRSIntroductionTemplateMatchingDTWAlgorithmNnnNnnjiCWWnynxdD11])),(),(([min)1()(,2,1)1()(,2,1,0)()1(:)(,1)1(:nwnwnwnwnwnwMNww连续条件边界条件CRSIntroductionTemplateMatchingDTWAlgorithm.)(,),()1()(,)1()(,1),(:)]2,(),1,(),,(),(min[],1[),1(的约束条件取值满足就是其中nwmnmngnwnwnwnwmngmnDmnDmngmnDmndmnDNnnNnnjiCWWnynxdD11])),(),(([minCRSIntroductionTemplateMatchingDTWAlgorithmDP(DynamicProgramming)123fori21(1.);1?(1,1):Re;2?(1,2):Re;(,)(,)min([1,2,3]);tondoforjtomdoDdijDjDijalMaxDjDijalMaxDijdijDDDendendCRSIntroductionClassicK-NNSorttheDistance-SequnceBySmalltolargeFindthefirstKdistanceelementsThebestmatch(result)isthenumberwiththelargestproportionintheKelementsOurOwn:WeightedK-NNCRSIntroductionVQTrainingCodeBook),(1)(1^TtittixxdTCDiitCx^),(minarg^ijCiyityxtdxij)(min)(iikCDCDCRSIntroductionRecognitionExperimentPerformance:70%~90%Themostrobustnumber:5Confusednumbers:0and6,2and8numberwiththeworstperformance:3Twowaveofnumber3CRSIntroductionThat’sallThanks!

1 / 20
下载文档,编辑使用

©2015-2020 m.777doc.com 三七文档.

备案号:鲁ICP备2024069028号-1 客服联系 QQ:2149211541

×
保存成功