메뉴 건너뛰기




Volumn 4892 LNCS, Issue , 2008, Pages 283-294

To separate speech: A system for recognizing simultaneous speech

Author keywords

[No Author keywords available]

Indexed keywords

BEAMFORMING; MICROPHONES; TRANSDUCERS;

EID: 40249114843     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-78155-4_25     Document Type: Conference Paper
Times cited : (9)

References (31)
  • 1
    • 40249109687 scopus 로고    scopus 로고
    • Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
    • Gehrig, T., Klee, U., McDonough, J., Ikbal, S., Wölfel, M., Fügen, C.: Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. In: Proc. Interspeech, pp. 2594-2597 (2006)
    • (2006) Proc. Interspeech , pp. 2594-2597
    • Gehrig, T.1    Klee, U.2    McDonough, J.3    Ikbal, S.4    Wölfel, M.5    Fügen, C.6
  • 4
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • Hyvärinen, A., Oja, E.: Independent component analysis: Algorithms and applications. Neural Networks 13, 411-430 (2000)
    • (2000) Neural Networks , vol.13 , pp. 411-430
    • Hyvärinen, A.1    Oja, E.2
  • 5
    • 40249114589 scopus 로고    scopus 로고
    • McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)
    • McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)
  • 9
    • 0023313586 scopus 로고
    • Description and generation of spherically invariant speech-model signals
    • Brehm, H., Stammler, W.: Description and generation of spherically invariant speech-model signals. Signal Processing 12, 119-141 (1987)
    • (1987) Signal Processing , vol.12 , pp. 119-141
    • Brehm, H.1    Stammler, W.2
  • 10
    • 84892168937 scopus 로고    scopus 로고
    • Full expansion of contextdependent networks in large vocabulary speech recognition
    • Seattle
    • Mohri, M., Riley, M., Hindle, D., Ljolje, A., Perlera, F.: Full expansion of contextdependent networks in large vocabulary speech recognition. In: Proc. ICASSP, Seattle, vol. II, pp. 665-668 (1998)
    • (1998) Proc. ICASSP , vol.2 , pp. 665-668
    • Mohri, M.1    Riley, M.2    Hindle, D.3    Ljolje, A.4    Perlera, F.5
  • 11
    • 0036460907 scopus 로고    scopus 로고
    • Weighted finite-state transducers in speech recognition
    • Mohri, M., Pereira, F., Riley, M.: Weighted finite-state transducers in speech recognition. Computer Speech and Language 16, 69-88 (2002)
    • (2002) Computer Speech and Language , vol.16 , pp. 69-88
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 12
    • 0012263430 scopus 로고    scopus 로고
    • Network optimizations for large vocabulary speech recognition
    • Mohri, M., Riley, M.: Network optimizations for large vocabulary speech recognition. Speech Communication 25(3) (1998)
    • (1998) Speech Communication , vol.25 , Issue.3
    • Mohri, M.1    Riley, M.2
  • 13
    • 33947638544 scopus 로고    scopus 로고
    • Modeling polyphone context with weighted finitestate transducers
    • Stoimenov, E., McDonough, J.: Modeling polyphone context with weighted finitestate transducers. In: Proc. ICASSP (2006)
    • (2006) Proc. ICASSP
    • Stoimenov, E.1    McDonough, J.2
  • 14
    • 85164641250 scopus 로고    scopus 로고
    • Memory efficient modeling of polyphone context with weighted finite-state transducers
    • Stoimenov, E., McDonough, J.: Memory efficient modeling of polyphone context with weighted finite-state transducers. In: Proc. Interspeech (2007)
    • (2007) Proc. Interspeech
    • Stoimenov, E.1    McDonough, J.2
  • 15
    • 0348198473 scopus 로고    scopus 로고
    • Finite-state transducers in language and speech processing
    • Mohri, M.: Finite-state transducers in language and speech processing. Computational Linguistics 23(2) (1997)
    • (1997) Computational Linguistics , vol.23 , Issue.2
    • Mohri, M.1
  • 16
    • 85009070232 scopus 로고    scopus 로고
    • A weight pushing algorithm for large vocabulary speech recognition
    • Aarlborg, Denmark, September
    • Mohri, M., Riley, M.: A weight pushing algorithm for large vocabulary speech recognition. In: Proc. ASRU, Aarlborg, Denmark, September 2001, pp. 1603-1606 (2001)
    • (2001) Proc. ASRU , pp. 1603-1606
    • Mohri, M.1    Riley, M.2
  • 17
    • 0000116303 scopus 로고    scopus 로고
    • Minimization algorithms for sequential transducers
    • Mohri, M.: Minimization algorithms for sequential transducers. Theoretical Computer Science 234(1-2), 177-201 (2000)
    • (2000) Theoretical Computer Science , vol.234 , Issue.1-2 , pp. 177-201
    • Mohri, M.1
  • 18
    • 33846217002 scopus 로고    scopus 로고
    • The multi-channel wall street journal audio visual corpus (mc-wsj-av): Specification and initial experiments
    • November
    • Lincoln, M., McCowan, I., Vepa, J., Maganti, H.: The multi-channel wall street journal audio visual corpus (mc-wsj-av): specification and initial experiments. In: Proc. ASRU, pp. 357-362 (November 2005)
    • (2005) Proc. ASRU , pp. 357-362
    • Lincoln, M.1    McCowan, I.2    Vepa, J.3    Maganti, H.4
  • 19
    • 85032772258 scopus 로고    scopus 로고
    • Minimum variance distortionless response spectral estimation, review and refinements
    • Wölfel, M., McDonough, J.: Minimum variance distortionless response spectral estimation, review and refinements. IEEE Signal Processing Magazine 22(5), 117-126 (2005)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 117-126
    • Wölfel, M.1    McDonough, J.2
  • 21
    • 4243460174 scopus 로고    scopus 로고
    • Semi-tied covariance matrices
    • Gales, M.J.F.: Semi-tied covariance matrices. In: Proc. ICASSP (1998)
    • (1998) Proc. ICASSP
    • Gales, M.J.F.1
  • 22
    • 40249085519 scopus 로고    scopus 로고
    • Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)
    • Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)
  • 25
    • 0034855183 scopus 로고    scopus 로고
    • Improvements in linear transform based speaker adaptation
    • Uebel, L., Woodland, P.: Improvements in linear transform based speaker adaptation. In: Proc. ICASSP (2001)
    • (2001) Proc. ICASSP
    • Uebel, L.1    Woodland, P.2
  • 26
    • 40249105704 scopus 로고    scopus 로고
    • Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden
    • Wölfel, M.: Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden. In: Proc. of ESSV (2003)
    • (2003) Proc. of ESSV
    • Wölfel, M.1
  • 27
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales, M. J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12 (1998)
    • (1998) Computer Speech and Language , vol.12
    • Gales, M.J.F.1
  • 28
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language 9, 171-185 (1995)
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 29
    • 44849112578 scopus 로고    scopus 로고
    • An algorithm for fast composition of weighted finite-state transducers
    • submitted
    • McDonough, J., Stoimenov, E., Klakow, D.: An algorithm for fast composition of weighted finite-state transducers. In: Proc. ASRU (submitted, 2007)
    • (2007) Proc. ASRU
    • McDonough, J.1    Stoimenov, E.2    Klakow, D.3
  • 30
    • 0009653561 scopus 로고    scopus 로고
    • Post-filtering techniques
    • Branstein, M, Ward, D, eds, Springer, Heidelberg
    • Simmer, K.U., Bitzer, J., Marro, C.: Post-filtering techniques. In: Branstein, M., Ward, D. (eds.) Microphone Arrays, pp. 39-60. Springer, Heidelberg (2001)
    • (2001) Microphone Arrays , pp. 39-60
    • Simmer, K.U.1    Bitzer, J.2    Marro, C.3
  • 31
    • 33750570839 scopus 로고    scopus 로고
    • McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)
    • McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.