201034021/、、。,。,[1]:、(TimeDelayEstimation,TDE)。,,、,。:(TimeDelayOfArrival,TDOA),。TDETDOA。2070TDOA,(GeneralizedCrossCorrelation,GCC)[2]、(LeastMeanSquare,LMS)[3]、[4]、[5]、[6-7]、(AcousticTransformFunctions-ratio,ATFs-ratio)[8]。,。2,1。ixi(n)=αis(n-τi)+vi(n)(1),αi(αi1),τii,vi(n)。··:1002-8684(2010)02-0042-05*,(,100084)【】,,。,3TDE,,。【】;;;【】TN643【】AResearchonTimeDelayEstimationMethodsinSourceLocationRONGXiao-zheng,LIUJia(DepartmentofElectronicEngineering,TsinghuaUniversity,Beijing100084,China)【Abstract】TDE(TimeDelayEstimation)isoneofthekeytechniquesinthearraysignalprocessing.Sincemicrophonesinarrayareusuallyplacedatdifferentplaces,TDEisusedtoestimatethetimedifferenceofsignalpropagationfromsourcetothespatiallyseparatedmicrophones.EachTDEalgorithmisintroducedbrieflytoshowitsadvantagesanddisadvantagesaccordingly.Andsomeexperimentsaremadetoshowthestablityandrobutnessofeachalgorithm.Finally,somefutureresearchdirectionsaboutTDEtechniquesarepointedout.【Keywords】timedelayofarrival;generalizedgrosscorrelation;glottalclosureinstant;acoustictransferfunctions-ratio*[](60776800);(863)(2006AA010101,2007AA04Z223、2008AA02Z414)。1αi,τivi(t)s(t)xi(t)ElementaryElectroacoustics輨輰讂20103402,。,,,xi(n)=hi(n)s(n-τi)+vi(n)=αis(n-τi)+∞p=1Σαips(n-τip)+vi(n)(2),αipτippi,hi(n)。,τipτi。,,,αipαi。,。33:、。、。,。LMS、。3.1(GeneralizedCrossCorrelation,GCC)TDE。[2]:,2Rxixj(τ)=E[xi(n)xj(n-τ)]≈αiαjRSS(τ-τij)+Rvivj(τ)(3),E[·],RSS,Rvivj2。,TDOA2。,,,,TDOA。,GCC,、,。2ΦXiXj(ω)=αiαjΦSS(ω)e-jωτij+ΦViVj(ω)(4)RGCC(τ)=∞-∞乙ψij(ω)ΦXiXj(ω)e-jωtdω(5),ψij(ω)。Knapp[2]6,PHAT,ML/HT,。GCC2。,GCC,。,,,。,GCC,(256)TDOA,。,,GCC。3.2,。(Cross-powerSpectrumPhase,CSP)[9],ψCSP(ω)=1ΦXiXj(ω)(6),GCC-PHAT[2]。TDOA,GCC,GCC。3.32,。,,[4]。,。,,,,。,*2GCCxi(n)FFTFFTψij(ω)IFFT(·)*|·|τijxj(n)×ElementaryElectroacousticS輨輱讂20103402ψP(ω)=[1-max(Emi,Emj)]γΦXiXj(ω)(7),Emiim,Emi,Emi,EmiEmi=εmi12πbmiami乙X(ω)2dω(8),ami,bmiεmim、。PHAT,(),,。3.4,(GlottalClosureInstant,GCI)[5]。,,,,,,,,,。,(LinearPredictiveCoding,LPC)GCI。,。3.5LMSLMS[5]3。xj(n)xi(n),。,1,0。4,,2。,。,LMS。GCC,LMS,LMS,,GCC。,LMS。,,GCC,,。,,,,。,,,TDE。3.6(EigenValueDecomposition,EVD)(GeneralizedEVD,GEVD)。5,,,,。,EVD[6],GEVD。EVD4。LMS,EVD2,2,LMS。,EVDLMS。GCC,EVD,。EVD:(1),;(2)GCC3,,。EVD,GEVD[7]EVD,,TDOA。GEVD,,。3LMSxi(n)xj(n)h(n)z-pe(n)LMSτij4EVDxi(n)xj(n)h2(n)e(n)LMSτijh1(n)ElementaryElectroacoustics輨輲讂201034023.7(AcousticTransferFunctionratio,ATFs-ratio)。,,。i1ATFAi(ω)=Hi(ω)H1(ω)(9),Hi(ω)i。Hi(ω)=αie-jωτi+Pip=1Σαipe-jωτip(10)Ai(ω)=αie-jωτiα1e-jωτ1ei(ω)(11),ei(ω)=1+Pip=1Σ(αipe-jωτip-αie-jωτi)1+P1p=1Σ(α1pe-jωτ1P-α1e-jωτ1)。,,ei(ω)1,TDOAAi(ω)。,,Ai(ω),、、。TDOA3:(1)ATFs-ratio,;(2),;(3)Ai(ω),ATFs-ratio,。,ATFs-ratioTDOA,。44m×5m×4m,(RT60),Image[10]。2[1.95m,1.0m,1.3m][2.05m,1.0m,1.3m]。30s、16kHz,16kHz、。,,TDOA,。TDOA2:(1):τ赞iTDOA1,。TDOA;(2)εRMSE=1NτNτi=1Σ(τ赞i-τ0)2姨(12),τ赞iTDOAi,τ0,NτTDOA。2,TDE3:、。GCC()、()ATFs-ratio()。2:(1):,SNR20dB-10dB(5dB),12。LPC,LPC。12,GCC,TDOA。LPCGCC,,LPC,LPC,,。,,GCCLPC,SNR/dB20151050-5-10GCC0.0810.1080.1640.2760.3880.5300.700LPC0000.0670.4110.5230.748ATFs-ratio0.1010.2450.3170.5060.6930.7440.5701TDOASNR/dB20151050-5-10GCC0.2910.3140.3800.4480.5260.6310.615LPC0.1010.2450.3170.5060.6930.7440.570ATFs-ratio0.2150.2230.2190.3020.4640.6380.6412TDOAElementaryElectroacousticS輨輳讂201034022。ATFs-ratio,(,),,GCCLPC。(2):,RT600600ms(100ms),34。34,GCC3,LPC,ATFs-ratio,ATFs-ratio。,ATFs-ratio。ATFs-ratioTDOAGCC,。GCCLPC,,。,TDE,。5TDOA、。GCC,。,。ATFs-ratio,2、,,,。,TDOA:TDOA。、TDE。[1]BRANDSTEINM,WARDD.MicrophoneArray[M].NewYork:Springer,2001:158-159.[2]KnappCH,CarterGC.Thegeneralizedcorrelationmethodforestimationoftimedelay[J].IEEETrans.onAcoustics,SpeechandSignalProcessing,1976,24(4):320-327.[3]YOUNDH,AHMEDN,CARTERGC.OnUsingtheLMSalgorithmfortimedelayestimation[J].IEEETrans.onAcoustics,SpeechandSignalProcessing,1982,30(5):798-801.[4]BRANDSTEINMS.Apitch-basedapproachtotime-delayestimationofreverberantspeech[C]//ProceedingsofIEEEWorkshoponApplicationsofSignalProcessingtoAudioandAcoustics.NewPaltz:IEEEPress,1997:4.[5]YEGNANARAYANAB,RAMANID.Processingofrever-berantspeechfortime-delayestimation[J].IEEETrans.onSpeechandAudioProcessing,2005,13(6):1110-1118.[6]BENESTYJ.Adaptiveeigenvaluedecompositionalgorithmforpassiveacousticsourcelocalization[J].AcousticSocietyofAmerica,2000,107(1):384-391.[7]DOCLOS.Multi-microphonenoisereductionanddereverberationtechinquesforspeechapplications[D].Leuven,Belgium:KatholiekeUniversiteitLeuven,2003.[8]DVORKINDTG,GANNOTS.Timedifferenceofarrivalestimationofspeechsourceinanoisyandreverberantenvironment[J].SignalProcessing,2005,85:177-204.[9]OMOLOGOM,SVAIZERP.AcousticsourcelocationinnoisyandreverberantenvironmentusingCSPanalysis[C]//ProceedingsofIEEEInternationalConferenceonAcoustics,Speech,andSignalProcessing.Atlanta:IEEEPress,1996:921-924.[10]LEHMANNE,JOHANSSONA,NORDHOLMS.Reverberation-timepredictionmethodforroomimpulseresponsessimulatedwiththeimage-sourcemodel[C]//ProceedingsoftheIEEEWorkshoponApplicationsofSignalProcessingtoAudioandAcoustics.NewPaltz:IEEEPress,2007:159-162.,,;,,,、、。[][]2009-11-21RT60/ms0100200300400500600GCC0.0040.0220.1180.2960.4190.5030.540LPC0.0080.0150