DOI: 10.3724/SP.J.1146.2010.01273

Journal of Electronics & Information Technology (电子与信息学报) 2011/33:9 PP.2056-2061

Real-time Unambiguious Passive Direction Finding for Multiple Sound Sources with Widely Spaced Microphone Array

A multi-source time-delay estimation algorithm based on iterative cross-spectrum weighted histogram is studied for real-time passive direction finding of multiple sound sources with a widely spaced microphone array. By using the short-time spectral sparseness and orthogonality assumption of audio signals as well as the frequency-varying characteristic of delay ambiguity periods, instantaneous maximum signal-to-noise ratio peaks on the true delays of concurrent sounds can be obtained simultaneously without obvious sidelobes caused by phase-difference wraparound ambiguity. As a result, the common limitation in most existing sparseness based methods on the microphone spacing that must be no greater than half the minimum wavelength of signals is removed, leading to array systems being able to have both large aperture and low complexity. Simulation results verify the effectiveness of the studied technique.

Key words:Microphone array,Unambiguious delay estimation,Passive multi-source direction finding,Phase- difference wraparound ambiguity

ReleaseDate:2014-07-21 15:56:52

[1] Knapp C H and Carter G C. The generalized correlation method for estimation of time delay[J]. IEEE Transactions on Acustics, Speech, and Signal Processing, 1976, 24(4): 320-327.

[2] Chen Jing-dong, Benesty J, and Huang Yi-teng. Performance of GCC-and AMDF-based time-delay estimation in practical reverberant environments[J]. EURASIP Journal on Applied Signal Processing, 2005, 2005(1): 25-36.

[3] Reed F A, Feintuch P L, and Bershad N J. Time delay estimation using the LMS adaptive filter - static behavior[J]. IEEE Transactions on Acustics, Speech, and Signal Processing, 1981, 29(3): 561-571.

[4] Yilmaz O, and Rickard S. Blind separation of speech mixtures via time-frequency masking[J]. IEEE Transactions on Signal Processing, 2004, 52(7): 1830-1847.

[5] Araki S, Sawada H, Mukai R, et al.. Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors[J]. Signal Processing, 2007, 87(8): 1833-1847.

[6] Yermeche Z, Grbi C N, and Claesson I. Blind subband beamforming with time-delay constraints for moving source speech enhancement[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(8): 2360-2372.

[7] JancoviccP, Zou Xin, and Köküer M. Underdetermined DOA estimation via independent component analysis and time-frequency masking[J]. Journal of Electrical and Computer Engineering, 2010, Article ID 450487, 7 pages, doi: 10.1155/2010/450487.

[8] Kolossa D, Astudillo R F, Hoffmann E, et al.. Independent component analysis and time-frequency masking for speech recognition in multitalker conditions[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2010, Article ID 651420, 13 pages, doi: 10.1155/2010/651420.

[9] Cobos M, Lopez J J, and Martinez D. Two-microphone multi-speaker localization based on a laplacian mixture model[J]. Digital Signal Processing, 2010, 21(1): 66-76.

[10] Ogasawara M, Nishino T, and Takeda K. A small dodecahedral microphone array for blind source separation[C]. Proceedings of 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP 2001), Dallas, USA, 2010: 229-232.

[11] Arberet S, Gribonval R, and Bimbot F. A robust method to count and locate audio sources in a multichannel underdetermined mixture[J]. IEEE Transactions on Signal Processing, 2010, 58(1): 121-133.

[12] Ballal T and Bleakley C J. Phase-difference ambiguity resolution for a single-frequency signal in the near-field using a receiver triplet[J]. IEEE Transactions on Signal Processing, 2010, 58(11): 5920-5926.

[13] Loesch B and Yang Bin. Blind source separation based on time-frequency sparseness in the presence of spatial aliasing[C]. Proceedings of the 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ ICA 2010), St. Malo, France, 2010: 1-8.

[14] Cobos M, Lopez J J, and Hinz J O. A source reassignment technique for time-frequency masking audio separation[C]. Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control (IWAENC 2008), Seattle, USA, 2008.

[15] Wang Y, Yilmaz O, and Zhou Z. Phase aliasing correction for robust blind source separation using DUET[J/OL]. http://, 2010, 7.