SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 13, Issue 5, 2005, Pages 712-730

SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word

(8) Hansen, John H L a,b Huang, Rongqing a,b Zhou, Bowen b,c Seadle, Michael d Deller, J R e Gurijala, Aparna R e Kurimo, Mikko f Angkititrakul, Pongtep a,g

a UNIVERSITY OF COLORADO (United States)

b UNIVERSITY OF TEXAS AT DALLAS (United States)

c IBM T J WATSON RESEARCH CENTER (United States)

d MICHIGAN STATE UNIVERSITY (United States)

e Michigan State University (United States)

f AALTO UNIVERSITY (Finland)

g Eliza Corporation (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 85008020310 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.852088 Document Type: Article

Times cited : (88)

References (104)

1
- 85008030749
- [Online]. Available
- [Online]. Available: http://www.ngsw.org

2
- 85008036539
- [Online]. Available: (Original website) http://speechfind.utdallas.edu/
- [Online]. Available: (Original website) http://speechfind.colorado.edu/; http://speechfind.utdallas.edu/

3
- 85009275098
- SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives
- Denver, CO, Sep.
- B. Zhou and J. H. L. Hansen, “SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives,” in Proc. Int. Conf. Spoken Language Process., vol. 3, Denver, CO, Sep. 2002, pp. 1969–1972.
- (2002) Proc. Int. Conf. Spoken Language Process. , vol.3 , pp. 1969-1972
- Zhou, B.¹ Hansen, J.H.L.²

4
- 85009083936
- Audio stream phrase recognition for a National Gallery of the Spoken Word: 'One small step
- Beijing, China, Oct.
- J. H. L. Hansen, B. Zhou, M. Akbacak, R. Sarikaya, and B. Pellom, “Audio stream phrase recognition for a National Gallery of the Spoken Word: 'One small step’,” in Proc. Int. Conf. Spoken Lang. Process., vol. 3, Beijing, China, Oct. 2000, pp. 1089–1092.
- (2000) Proc. Int. Conf. Spoken Lang. Process. , vol.3 , pp. 1089-1092
- Hansen, J.H.L.¹ Zhou, B.² Akbacak, M.³ Sarikaya, R.⁴ Pellom, B.⁵

5
- 84901265818
- Engineering challenges in the creation of a National Gallery of the Spoken Word: Transcript-free search of audio archives
- Roanoke, VA, Jun.
- J. H. L. Hansen, J. Deller, and M. Seadle, “Engineering challenges in the creation of a National Gallery of the Spoken Word: Transcript-free search of audio archives,” in Proc. IEEE ACM Joint Conf. Digital Libraries, Roanoke, VA, Jun. 2001, pp. 235–236.
- (2001) Proc. IEEE ACM Joint Conf. Digital Libraries , pp. 235-236
- Hansen, J.H.L.¹ Deller, J.² Seadle, M.³

6
- 85009243655
- Speech watermarking through parametric modeling
- Denver, CO, Sep.
- A. Gurijala, J. R. Deller Jr., M. S. Seadle, and J. H. L. Hansen, “Speech watermarking through parametric modeling,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 621–624.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 621-624
- Gurijala, A.¹ Deller, J.R.² Seadle, M.S.³ Hansen, J.H.L.⁴

7
- 0036992790
- Why watermark? The copyright need for an engineering solution
- Portland, OR, Jun.
- M. S. Seadle, J. R. Deller Jr., and A. Gurijala, “Why watermark? The copyright need for an engineering solution,” in Proc. Second ACM/IEEE Joint Conf. Digital Libraries, Portland, OR, Jun. 2002.
- (2002) Proc. Second ACM/IEEE Joint Conf. Digital Libraries
- Seadle, M.S.¹ Deller, J.R.² Gurijala, A.³

8
- 0036288688
- A new speaker change detection method for two-speaker segmentation
- A. Adami, S. Kajarekar, and H. Hermansky, “A new speaker change detection method for two-speaker segmentation,” in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Adami, A.¹ Kajarekar, S.² Hermansky, H.³

9
- 0037700756
- Speaker change detection and tracking in real-time news broadcasting analysis
- Paris, France, Dec.
- L. Lu and H. Zhang, “Speaker change detection and tracking in real-time news broadcasting analysis,” in Proc. ACM Multimedia, Paris, France, Dec. 2002.
- (2002) Proc. ACM Multimedia
- Lu, L.¹ Zhang, H.²

10
- 85009164449
- A new perspective on feature extraction for robust in-vehicle speech recognition
- Geneva, Switzerland, Sep.
- U. Yapanel and J. H. L. Hansen, “A new perspective on feature extraction for robust in-vehicle speech recognition,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1281–1284.
- (2003) Proc. Eurospeech , pp. 1281-1284
- Yapanel, U.¹ Hansen, J.H.L.²

11
- 0002782496
- Automatic segmentation, classification and clustering of broadcast news audio
- Chantilly, VA
- M. Siegler, U. Jain, B. Raj, and R. M. Stern, “Automatic segmentation, classification and clustering of broadcast news audio,” in Proc. DARPA Speech Recog. Workshop, Chantilly, VA, 1997, pp. 97–99.
- (1997) Proc. DARPA Speech Recog. Workshop , pp. 97-99
- Siegler, M.¹ Jain, U.² Raj, B.³ Stern, R.M.⁴

12
- 3543118757
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- S. Chen and P. Gopalakrishnan, “Speaker, environment and channel change detection and clustering via the Bayesian information criterion,” in Proc. Broadcast News Trans. Under. Workshop, 1998.
- (1998) Proc. Broadcast News Trans. Under. Workshop
- Chen, S.¹ Gopalakrishnan, P.²

13
- 0034842452
- MVDR-based feature extraction for robust speech recognition
- Salt Lake City, UT
- S. Dharanipragada and B. Rao, “MVDR-based feature extraction for robust speech recognition,” in ICASSP, Salt Lake City, UT, 2001.
- (2001) ICASSP
- Dharanipragada, S.¹ Rao, B.²

14
- 0002751623
- Segment generation and clustering in the HTK: Broadcast news transcription system
- Herndon, VA
- T. Hain, S. Johnson, A. Tuerk, P. Woodland, and S. Young, “Segment generation and clustering in the HTK: Broadcast news transcription system,” in DARPA Broadcast News Workshop, Herndon, VA, 1998.
- (1998) DARPA Broadcast News Workshop
- Hain, T.¹ Johnson, S.² Tuerk, A.³ Woodland, P.⁴ Young, S.⁵

15
- 4544369704
- Unsupervised audio segmentation and classification for robust spoken document retrieval
- Montreal, QC, Canada, May
- R. Huang and J. H. L. Hansen, “Unsupervised audio segmentation and classification for robust spoken document retrieval,” in Proc. IEEE ICASSP, vol. 1, Montreal, QC, Canada, May 2004, pp. 741–744.
- (2004) Proc. IEEE ICASSP , vol.1 , pp. 741-744
- Huang, R.¹ Hansen, J.H.L.²

16
- 22544475615
- Efficient audio stream segmentation via T2 statistic based Bayesian information criterion (T2-BIC)
- Jul.
- B. Zhou and J. H. L. Hansen, “Efficient audio stream segmentation via T2 statistic based Bayesian information criterion (T2-BIC),” IEEE Trans. Speech Audio Process., vol. 13, no. 4, Jul. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.4
- Zhou, B.¹ Hansen, J.H.L.²

17
- 0003990972
- Managing Gigabytes: Compressing and Indexing Documents and Images
- San Francisco, CA: Morgan Kaufmann
- I. H. Witten, A. Moffat, and T. C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. San Francisco, CA: Morgan Kaufmann, 1999.
- (1999)
- Witten, I.H.¹ Moffat, A.² Bell, T.C.³

18
- 80053431219
- Introduction to latent semantic analysis
- T. Landauer, P. Foltz, and D. Laham, “Introduction to latent semantic analysis,” Discourse Processes, vol. 25, pp. 259–284, 1998.
- (1998) Discourse Processes , vol.25 , pp. 259-284
- Landauer, T.¹ Foltz, P.² Laham, D.³

19
- 85008059184
- N. District Court Calif
- “N. District Court Calif.,” A&M Records, Inc. v. Napster, Inc., 99–5183, 2000.
- (2000) A&M Records, Inc. v. Napster, Inc. , pp. 99-5183

20
- 85008019005
- 11th Circuit Court of Appeals
- “11th Circuit Court of Appeals,” Estate of Martin Luther King v. CBS, 98–9079, 1999.
- (1999) Estate of Martin Luther King v. CBS , pp. 98-9079

21
- 84886521049
- Copyright in the networked world: New rules for images
- M. Seadle, “Copyright in the networked world: New rules for images,” Library Hi Tech., vol. 20, no. 2, 2002.
- (2002) Library Hi Tech. , vol.20 , Issue.2
- Seadle, M.¹

22
- 3042593473
- Whose rules? Intellectual property, culture, and indigenous communities
- Mar.
- M. Seadle, “Whose rules? Intellectual property, culture, and indigenous communities,” D-Lib Mag., vol. 8, no. 3, Mar. 2002.
- (2002) D-Lib Mag. , vol.8 , Issue.3
- Seadle, M.¹

23
- 64349124357
- Copyright in the networked world: Multimedia fair use
- M. Seadle, “Copyright in the networked world: Multimedia fair use,” Library Hi Tech., vol. 19, no. 4, 2001.
- (2001) Library Hi Tech. , vol.19 , Issue.4
- Seadle, M.¹

24
- 3042602303
- Spoken words, unspoken meanings: A DLI2 project ethnography
- Nov.
- M. Seadle, “Spoken words, unspoken meanings: A DLI2 project ethnography,” D-Lib Mag., Nov. 2000.
- (2000) D-Lib Mag.
- Seadle, M.¹

25
- 0030283741
- Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
- Nov.
- J. H. L. Hansen, “Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition,” Speech Commun., Special Issue on Speech Under Stress, vol. 20, no. 2, pp. 151–170, Nov. 1996.
- (1996) Speech Commun., Special Issue on Speech Under Stress , vol.20 , Issue.2 , pp. 151-170
- Hansen, J.H.L.¹

26
- 0033688848
- High resolution speech feature parameterization for monophone based stressed speech recognition
- Jul.
- R. Sarikaya and J. H. L. Hansen, “High resolution speech feature parameterization for monophone based stressed speech recognition,” IEEE Signal Process. Lett., vol. 7, no. 7, pp. 182–185, Jul. 2000.
- (2000) IEEE Signal Process. Lett. , vol.7 , Issue.7 , pp. 182-185
- Sarikaya, R.¹ Hansen, J.H.L.²

27
- 0034229795
- A comparative study of traditional and newly proposed features for recognition of speech under stress
- Jul.
- S. E. Bou-Ghazale and J. H. L. Hansen, “A comparative study of traditional and newly proposed features for recognition of speech under stress,” IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 429–442, Jul. 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 429-442
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

28
- 0030757418
- A study of temporal features and frequency characteristics in American English foreign accent
- Jul.
- L. M. Arslan and J. H. L. Hansen, “A study of temporal features and frequency characteristics in American English foreign accent,” J. Acoust. Soc. Amer., vol. 102, no. 1, pp. 28–40, Jul. 1997.
- (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.1 , pp. 28-40
- Arslan, L.M.¹ Hansen, J.H.L.²

29
- 85008062974
- Advances in phone-based modeling for automatic accent classification
- Speech Audio Proc., to be published.
- P. Angkititrakul and J. H. L. Hansen, “Advances in phone-based modeling for automatic accent classification,” IEEE Trans. Speech Audio Proc., to be published.
- IEEE Trans.
- Angkititrakul, P.¹ Hansen, J.H.L.²

30
- 85135191939
- Talker-Independent keyword spotting for information retrieval
- J. Foote et al., “Talker-Independent keyword spotting for information retrieval,” in Proc. Eurospeech, vol. 3, 1995, pp. 2145–2149.
- (1995) Proc. Eurospeech , vol.3 , pp. 2145-2149
- Foote, J.¹

31
- 84892177707
- Experiments in broadcast news transcription
- Seattle, WA
- P. C. Woodland et al., “Experiments in broadcast news transcription,” in Proc. IEEE ICASSP, Seattle, WA, 1998, pp. 909–912.
- (1998) Proc. IEEE ICASSP , pp. 909-912
- Woodland, P.C.¹

32
- 85008058495
- [Online]. Available
- [Online]. Available: http://speechbot.research.compaq.com/

33
- 85008018727
- [Online]. Available
- [Online]. Available: http://www.dragonsys.com/news/pr/audiomine.html

34
- 0002494419
- A system for interactively skimming recorded speech
- B. Arons, “A system for interactively skimming recorded speech,” ACM Trans. Computer-Human Interaction, vol. 4, no. 1, pp. 3–38, 1997.
- (1997) ACM Trans. Computer-Human Interaction , vol.4 , Issue.1 , pp. 3-38
- Arons, B.¹

35
- 0035278951
- Confidence measures for large vocabulary continuous speech recognition
- Mar.
- V. Wessel, R. Schluter, K. Macherey, and H. Ney, “Confidence measures for large vocabulary continuous speech recognition,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 288–298, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 288-298
- Wessel, V.¹ Schluter, R.² Macherey, K.³ Ney, H.⁴

36
- 0036296006
- Automatic speech summarization applied to English broadcast news speech
- Orlando, FL, May
- C. Hori, S. Furui, R. Malkin, Y. Hua, and A. Waibel, “Automatic speech summarization applied to English broadcast news speech,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Orlando, FL, May 2002, pp. 9–12.
- (2002) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 9-12
- Hori, C.¹ Furui, S.² Malkin, R.³ Hua, Y.⁴ Waibel, A.⁵

37
- 0032657771
- Progress in broadcast news transcription at dragon systems
- Phoenix, AZ, Mar.
- S. Wegmann, Z. P. Zhan, and L. Gillick, “Progress in broadcast news transcription at dragon systems,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Phoenix, AZ, Mar. 1999, pp. 33–36.
- (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 33-36
- Wegmann, S.¹ Zhan, Z.P.² Gillick, L.³

38
- 0032678065
- Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news
- Phoenix, AZ
- S. S. Chen, E. M. Eide, M. J. F. Gales, R. A. Gopinath, D. Kanevsky, and P. Olsen, “Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news,” in Proc. Int. Conf. Acoust. Speech, Signal Process., Phoenix, AZ, 1999, pp. 37-40.
- (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , pp. 37-40
- Chen, S.S.¹ Eide, E.M.² Gales, M.J.F.³ Gopinath, R.A.⁴ Kanevsky, D.⁵ Olsen, P.⁶

39
- 0032687479
- The Cambridge University spoken document retrieval system
- Phoenix, AZ, Mar.
- S. E. Johnson, P. Jourlin, G. L. Moore, K. S. Jones, and P. C. Woodland, “The Cambridge University spoken document retrieval system,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Phoenix, AZ, Mar. 1999, pp. 49–52.
- (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 49-52
- Johnson, S.E.¹ Jourlin, P.² Moore, G.L.³ Jones, K.S.⁴ Woodland, P.C.⁵

40
- 85009286577
- German broadcast news transcription
- Denver, CO, Sep.
- R. Hecht, J. Riedler, and G. Backfried, “German broadcast news transcription,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1753–1756.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1753-1756
- Hecht, R.¹ Riedler, J.² Backfried, G.³

41
- 0036293939
- Toward automatic corpus preparation for a German broadcast news transcription system
- Denver, CO, May
- W. Macherey and H. Ney, “Toward automatic corpus preparation for a German broadcast news transcription system,” in Proc. Int. Conf. Spoken Lang. Process., vol. 1, Denver, CO, May 2002, pp. 733–736.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , vol.1 , pp. 733-736
- Macherey, W.¹ Ney, H.²

42
- 0033693013
- A baseline for the transcription of Italian broadcast news
- Istanbul, Turkey, Jun.
- F. Brugnara, M. Cettolo, M. Federico, and D. Giuliani, “A baseline for the transcription of Italian broadcast news,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1667–1670.
- (2000) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.3 , pp. 1667-1670
- Brugnara, F.¹ Cettolo, M.² Federico, M.³ Giuliani, D.⁴

43
- 85009198487
- Morpheme-based lexical modeling for Korean broadcast news transcription
- Geneva, Switzerland, Sep.
- Y.-H. Park, D.-H. Ahn, and M. Chung, “Morpheme-based lexical modeling for Korean broadcast news transcription,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1129–1132.
- (2003) Proc. Eurospeech , pp. 1129-1132
- Park, Y.-H.¹ Ahn, D.-H.² Chung, M.³

44
- 85009227418
- Named entity extraction from Japanese broadcast news
- Geneva, Switzerland, Sep.
- A. Kobayashi, F. J. Och, and H. Ney, “Named entity extraction from Japanese broadcast news,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1125–1128.
- (2003) Proc. Eurospeech , pp. 1125-1128
- Kobayashi, A.¹ Och, F.J.² Ney, H.³

45
- 0742324997
- Sequential estimation with optimal forgetting for robust speech recognition
- Jan.
- M. Afify and O. Siohan, “Sequential estimation with optimal forgetting for robust speech recognition,” IEEE Trans. Speech and Audio Processing, vol. 12, no. 1, pp. 19–26, Jan. 2004.
- (2004) IEEE Trans. Speech and Audio Processing , vol.12 , Issue.1 , pp. 19-26
- Afify, M.¹ Siohan, O.²

46
- 85009273501
- Japanese broadcast news transcription
- Denver, CO, Sep.
- L. Nguyen, X. Guo, R. Schwartz, and J. Makhoul, “Japanese broadcast news transcription,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1749–1752.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1749-1752
- Nguyen, L.¹ Guo, X.² Schwartz, R.³ Makhoul, J.⁴

47
- 85009268616
- Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval
- Denver, CO, Sep.
- H. Nishizaki and S. Nakagawa, “Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1505–1508.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1505-1508
- Nishizaki, H.¹ Nakagawa, S.²

48
- 0036649836
- Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
- Jul.
- B. Chen, H.-M. Wang, and L.-S. Lee, “Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese,” IEEE Trans. Speech Audio Proc., vol. 10, no. 5, pp. 303–314, Jul. 2002.
- (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.5 , pp. 303-314
- Chen, B.¹ Wang, H.-M.² Lee, L.-S.³

49
- 0347968278
- Bayesian learning of speech duration models
- Nov.
- J.-T. Chien and C.-H. Huang, “Bayesian learning of speech duration models,” IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 558–567, Nov. 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 558-567
- Chien, J.-T.¹ Huang, C.-H.²

50
- 85009197924
- Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction
- Geneva, Switzerland, Sep.
- P.-C. Chang, S.-P. Liao, and L.-S. Lee, “Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 421–424.
- (2003) Proc. Eurospeech , pp. 421-424
- Chang, P.-C.¹ Liao, S.-P.² Lee, L.-S.³

51
- 3042825296
- State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition
- Jul.
- Y. Liu and P. Fung, “State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition,” IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 351–364, Jul. 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 351-364
- Liu, Y.¹ Fung, P.²

52
- 0033693282
- Retrieval of broadcast news speech in Mandarin Chinese collected in taiwan using syllable-level statistical characteristics
- Istanbul, Turkey, Jun.
- B. Chen, H.-M. Wang, and L.-S. Lee, “Retrieval of broadcast news speech in Mandarin Chinese collected in taiwan using syllable-level statistical characteristics,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1771–1774.
- (2000) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.3 , pp. 1771-1774
- Chen, B.¹ Wang, H.-M.² Lee, L.-S.³

53
- 0141702122
- Audio segmentation, classification and clustering in a broadcast news task
- Hong Kong, Apr.
- H. Meinedo and J. Neto, “Audio segmentation, classification and clustering in a broadcast news task,” in Proc. IEEE Inter. Conf. Acoust. Speech, Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 5–8.
- (2003) Proc. IEEE Inter. Conf. Acoust. Speech, Signal Process. , vol.2 , pp. 5-8
- Meinedo, H.¹ Neto, J.²

54
- 0036299263
- Audio indexing of Arabic broadcast news
- Orlando, FL, May
- J. Billa, M. Noamany, A. Srivastava, D. Liu, R. Stone, J. Xu, J. Makhoul, and F. Kubala, “Audio indexing of Arabic broadcast news,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Orlando, FL, May 2002, pp. 5–8.
- (2002) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 5-8
- Billa, J.¹ Noamany, M.² Srivastava, A.³ Liu, D.⁴ Stone, R.⁵ Xu, J.⁶ Makhoul, J.⁷ Kubala, F.⁸

55
- 0034847329
- Automatic transcription of compressed broadcast audio
- Salt Lake City, UT, May
- C. Barras, L. Lamel, and J.-L. Gauvain, “Automatic transcription of compressed broadcast audio,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Salt Lake City, UT, May 2001, pp. 265–268.
- (2001) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 265-268
- Barras, C.¹ Lamel, L.² Gauvain, J.-L.³

56
- 85009150731
- Building a test collection for speech-driven web retrieval
- Geneva, Switzerland, Sep.
- A. Fujii and K. Itou, “Building a test collection for speech-driven web retrieval,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1153–1156.
- (2003) Proc. Eurospeech , pp. 1153-1156
- Fujii, A.¹ Itou, K.²

57
- 85009275390
- Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval
- Denver, CO, Sep.
- W.-K. Lo, H. M. Meng, and P. C. Ching, “Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1513–1516.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1513-1516
- Lo, W.-K.¹ Meng, H.M.² Ching, P.C.³

58
- 85009271609
- Toward automatic closed captioning :Low latency real time broadcast news transcription
- Denver, CO, Sep.
- M. Saraclar, M. Riley, E. Bocchieri, and V. Go, “Toward automatic closed captioning :Low latency real time broadcast news transcription,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1741-1744.
- (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1741-1744
- Saraclar, M.¹ Riley, M.² Bocchieri, E.³ Go, V.⁴

59
- 79951784751
- Automatic summarization of broadcast news using structural features
- Geneva, Switzerland, Sep.
- S. R. Maskey and J. Hirschberg, “Automatic summarization of broadcast news using structural features,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1173–1176.
- (2003) Proc. Eurospeech , pp. 1173-1176
- Maskey, S.R.¹ Hirschberg, J.²

60
- 0033705979
- Automatic speech summarization based on word significance and linguistic likelihood
- Istanbul, Turkey, Jun.
- C. Hori and S. Furui, “Automatic speech summarization based on word significance and linguistic likelihood,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1579–1582.
- (2000) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.3 , pp. 1579-1582
- Hori, C.¹ Furui, S.²

61
- 0032665630
- Experiments in topic indexing of broadcast news using neural networks
- Phoenix, AZ, Mar.
- C. Neukirchen, D. Willett, and G. Rigoll, “Experiments in topic indexing of broadcast news using neural networks,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 2, Phoenix, AZ, Mar. 1999, pp. 1093–1096.
- (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.2 , pp. 1093-1096
- Neukirchen, C.¹ Willett, D.² Rigoll, G.³

62
- 0141702097
- Toward domain independent speaker clustering
- Hong Kong, Apr.
- Y. Moh, P. Nguyen, and J.-C. Junqua, “Toward domain independent speaker clustering,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 85–88.
- (2003) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.2 , pp. 85-88
- Moh, Y.¹ Nguyen, P.² Junqua, J.-C.³

63
- 0034857759
- Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
- Salt Lake City, UT, May
- K. Mori and S. Nakagawa, “Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition,” in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., vol. 1, Salt Lake City, UT, May 2001, pp. 413-416.
- (2001) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 413-416
- Mori, K.¹ Nakagawa, S.²

64
- 0032678104
- Probabilistic models for topic detection and tracking
- Phoenix, AZ, Mar.
- F. Walls, H. Jin, S. Sista, and R. Schwartz, “Probabilistic models for topic detection and tracking,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Phoenix, AZ, Mar. 1999, pp. 521–524.
- (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 521-524
- Walls, F.¹ Jin, H.² Sista, S.³ Schwartz, R.⁴

65
- 0141496213
- Unsupervised language model adaptation for broadcast news
- Hong Kong, Apr.
- C. Langzhou, J.-L. Gauvain, L. Lamel, and G. Adda, “Unsupervised language model adaptation for broadcast news,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Hong Kong, Apr. 2003, pp. 220–223.
- (2003) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 220-223
- Langzhou, C.¹ Gauvain, J.-L.² Lamel, L.³ Adda, G.⁴

66
- 84979938858
- Language modeling structures in audio transcription for retrieval of historical speeches
- Vienna, Austria, Sep.
- M. Kurimo, B. Zhou, R. Huang, and J. H. L. Hansen, “Language modeling structures in audio transcription for retrieval of historical speeches,” in Proc. 12th Eur. Signal Process. Conf, Vienna, Austria, Sep. 6–10, 2004, pp. 557–560.
- (2004) Proc. 12th Eur. Signal Process. Conf , pp. 6-10
- Kurimo, M.¹ Zhou, B.² Huang, R.³ Hansen, J.H.L.⁴

67
- 0034852839
- Multiscale-audio indexing for translingual spoken document retrieval
- Salt Lake City, UT, May
- H.-M. Wang, H. Meng, P. Schone, B. Chen, and W.-K. Lo, “Multiscale-audio indexing for translingual spoken document retrieval,” in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., vol. 1, Salt Lake City, UT, May 2001, pp. 605–608.
- (2001) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 605-608
- Wang, H.-M.¹ Meng, H.² Schone, P.³ Chen, B.⁴ Lo, W.-K.⁵

68
- 0035441593
- Spoken language recognition-a step toward multilinguality in speech processing
- Sep.
- J. Navratil, “Spoken language recognition-a step toward multilinguality in speech processing,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 678–685, Sep. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 678-685
- Navratil, J.¹

69
- 3042820894
- Automatic recognition of spontaneous speech for access to multilingual oral history archives
- Jul.
- W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and W.-J. Zhu, “Automatic recognition of spontaneous speech for access to multilingual oral history archives,” IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 420–435, Jul. 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 420-435
- Byrne, W.¹ Doermann, D.² Franz, M.³ Gustman, S.⁴ Hajic, J.⁵ Oard, D.⁶ Picheny, M.⁷ Psutka, J.⁸ Ramabhadran, B.⁹ Soergel, D.¹⁰ Ward, T.¹¹ Zhu, W.-J.¹²

70
- 0004026002
- Digital Watermarking
- San Diego, CA: Academic
- I. J. Cox, M. L. Miller, and J. A. Bloom, Digital Watermarking. San Diego, CA: Academic, 2002.
- (2002)
- Cox, I.J.¹ Miller, M.L.² Bloom, J.A.³

71
- 85009183675
- Speech watermarking by parametric embedding with an ℓ∞ fidelity criterion
- Geneva, Switzerland, Sep.
- A. Gurijala and J. R. Deller Jr., “Speech watermarking by parametric embedding with an ℓ∞ fidelity criterion,” in Proc. Interspeech/Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 2933–2936.
- (2003) Proc. Interspeech/Eurospeech , pp. 2933-2936
- Gurijala, A.¹ Deller, J.R.²

72
- 84979921765
- Discrete-Time Processing of Speech Signals
- Second ed. Piscataway, NJ: IEEE, ch. 5.
- J. R. Deller Jr., J. H. L. Hansen, and J. G. Proakis, Discrete-Time Processing of Speech Signals, Second ed. Piscataway, NJ: IEEE, 2000, ch. 5.
- (2000)
- Deller, J.R.¹ Hansen, J.H.L.² Proakis, J.G.³

73
- 4143057226
- Speech watermarking with objective fidelity and robustness criterion
- Pacific Grove, CA, Nov.
- A. Gurijala and J. R. Deller Jr., “Speech watermarking with objective fidelity and robustness criterion,” in Proc. Asilomar Conf. Signals, Syst., Comput., Pacific Grove, CA, Nov. 2003.
- (2003) Proc. Asilomar Conf. Signals, Syst., Comput.
- Gurijala, A.¹ Deller, J.R.²

74
- 85008036550
- Speech watermarking through parametric modeling
- submitted for publication.
- A. Gurijala and J. R. Deller Jr., “Speech watermarking through parametric modeling,”, submitted for publication.
- Gurijala, A.¹ Deller, J.R.²

75
- 0012482005
- SMART: A toolbox for set-membership filtering
- Budapest, Hungary
- S. Gollamudi, S. Nagaraj, S. Kapoor, and Y. F. Huang, “SMART: A toolbox for set-membership filtering,” in Proc. Eur. Conf. Circuit Theory Design, Budapest, Hungary, 1997.
- (1997) Proc. Eur. Conf. Circuit Theory Design
- Gollamudi, S.¹ Nagaraj, S.² Kapoor, S.³ Huang, Y.F.⁴

76
- 0033221637
- BEACON: An adaptive set-membership filtering technique with sparse updates
- Nov.
- S. Nagaraj, S. Gollamudi, S. Kapoor, and Y. F. Huang, “BEACON: An adaptive set-membership filtering technique with sparse updates,” IEEE Trans. Signal Process., vol. 47, no. 11, pp. 2928–2941, Nov. 1999.
- (1999) IEEE Trans. Signal Process. , vol.47 , Issue.11 , pp. 2928-2941
- Nagaraj, S.¹ Gollamudi, S.² Kapoor, S.³ Huang, Y.F.⁴

77
- 34547250779
- Set-membership identification and filtering in signal processing
- Feb.
- J. R. Deller Jr. and H. F. Huang, “Set-membership identification and filtering in signal processing,” Circuits, Syst., Signal Process., Special Issue on Signal Process. Applications, Feb. 2002.
- (2002) Circuits, Syst., Signal Process., Special Issue on Signal Process. Applications
- Deller, J.R.¹ Huang, H.F.²

78
- 85009090165
- High-level feature weighted GMM network for audio stream classification
- Jeju Island, Korea, Oct.
- R. Huang and J. H. L. Hansen, “High-level feature weighted GMM network for audio stream classification,” in Proc. Int. Conf. Spoken Language Process., Jeju Island, Korea, Oct. 2004.
- (2004) Proc. Int. Conf. Spoken Language Process.
- Huang, R.¹ Hansen, J.H.L.²

79
- 0003648234
- An Introduction to Multivariate Statistical Analysis
- New York: Wiley
- T. Anderson, An Introduction to Multivariate Statistical Analysis. New York: Wiley, 1958.
- (1958)
- Anderson, T.¹

80
- 0031177213
- Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
- S. M. Ahadi and P. C. Woodland, “Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models,” Comput. Speech Language, vol. 11, pp. 187–206, 1997.
- (1997) Comput. Speech Language , vol.11 , pp. 187-206
- Ahadi, S.M.¹ Woodland, P.C.²

81
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J. L. Gauvain and C. H. Lee, “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

82
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P. Woodland, “Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models,” Comput. Speech Language, vol. 9, pp. 171–185, 1995.
- (1995) Comput. Speech Language , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

83
- 85135155427
- A comparative study of speaker adaptation techniques
- Madrid, Spain
- L. R. Neumeyer, A. Sankar, and V. V. Digalakis, “A comparative study of speaker adaptation techniques,” in Proc. Eurospeech, Madrid, Spain, 1995, pp. 1127–1130.
- (1995) Proc. Eurospeech , pp. 1127-1130
- Neumeyer, L.R.¹ Sankar, A.² Digalakis, V.V.³

84
- 0030640789
- Structural MAP speaker adaptation using hierarchical priors
- Santa Barbara, CA
- K. Shinoda and C. H. Lee, “Structural MAP speaker adaptation using hierarchical priors,” in Proc. IEEE Workshop Automatic Speech Recognition Understanding, Santa Barbara, CA, 1997, pp. 381–388.
- (1997) Proc. IEEE Workshop Automatic Speech Recognition Understanding , pp. 381-388
- Shinoda, K.¹ Lee, C.H.²

85
- 0036461005
- Structural maximum a posteriori linear regression for fast HMM adaptation
- Jan.
- O. Siohan, T. A. Myrvoll, and C. H. Lee, “Structural maximum a posteriori linear regression for fast HMM adaptation,” Comput. Speech Language, vol. 16, no. 1, pp. 5–24, Jan. 2002.
- (2002) Comput. Speech Language , vol.16 , Issue.1 , pp. 5-24
- Siohan, O.¹ Myrvoll, T.A.² Lee, C.H.³

86
- 85135272864
- Maximum a posterior linear regression for hidden Markov model adaptation
- Budapest, Hungary
- C. Chesta, O. Siohan, and C. H. Lee, “Maximum a posterior linear regression for hidden Markov model adaptation,” in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203–206.
- (1999) Proc. Eurospeech , pp. 203-206
- Chesta, C.¹ Siohan, O.² Lee, C.H.³

87
- 84874875877
- Maximum a posterior linear regression with elliptically symmetric matrix priors
- Budapest, Hungary
- W. Chou, “Maximum a posterior linear regression with elliptically symmetric matrix priors,” in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 1–4.
- (1999) Proc. Eurospeech , pp. 1-4
- Chou, W.¹

88
- 0002615167
- Speaker adaptation: Techniques and challenges
- Keystone, CO
- P. C. Woodland, “Speaker adaptation: Techniques and challenges,” in Proc. IEEE Workshop Automatic Speech Recognition Understanding, Keystone, CO, 1999, pp. 85–90.
- (1999) Proc. IEEE Workshop Automatic Speech Recognition Understanding , pp. 85-90
- Woodland, P.C.¹

89
- 85008060129
- Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation
- to be published.
- B. Zhou and J. H. L. Hansen, “Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation,” IEEE Trans. Speech Audio Process., to be published.
- IEEE Trans. Speech Audio Process.
- Zhou, B.¹ Hansen, J.H.L.²

90
- 0003173603
- Okapi/Keenbow at TREC-8
- S. E. Robertson and S. Walker, “Okapi/Keenbow at TREC-8,” in Proc. TREC-8, 1999.
- (1999) Proc. TREC-8
- Robertson, S.E.¹ Walker, S.²

91
- 0003411512
- Simple, Proven Approaches to Text Retrieval
- Cambridge Univ., Cambridge, U.K.
- S. E. Robertson and K. S. Jones, ““Simple, Proven Approaches to Text Retrieval,” Tech. Rep., Cambridge Univ., Cambridge, U.K., 1997.
- (1997) Tech. Rep.
- Robertson, S.E.¹ Jones, K.S.²

92
- 85009102300
- Document expansion for speech retrieval
- Berkeley, CA, Aug.
- A. Singhal and F. Pereira, “Document expansion for speech retrieval,” in Proc. 22nd ACM SIGIR Conf, Berkeley, CA, Aug. 1999.
- (1999) Proc. 22nd ACM SIGIR Conf
- Singhal, A.¹ Pereira, F.²

93
- 0141702085
- Environmental sniffing: Noise knowledge estimation for robust speech systems
- Hong Kong, Apr.
- M. Akbacak and J. H. L. Hansen, “Environmental sniffing: Noise knowledge estimation for robust speech systems,” in Proc. Int. Conf. Acoust. Speech Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 113–116.
- (2003) Proc. Int. Conf. Acoust. Speech Signal Process. , vol.2 , pp. 113-116
- Akbacak, M.¹ Hansen, J.H.L.²

94
- 85009228811
- ENVIRONMENTAL SNIFFING: Robust digit recognition for an in-vehicle environment
- Geneva, Switzerland, Sep.
- M. Akbacak and J. H. L. Hansen, “ENVIRONMENTAL SNIFFING: Robust digit recognition for an in-vehicle environment,” in Proc. INTERSPEECH/Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 2177–2180.
- (2003) Proc. INTERSPEECH/Eurospeech , pp. 2177-2180
- Akbacak, M.¹ Hansen, J.H.L.²

95
- 0036816475
- Content analysis for audio classification and segmentation
- Oct.
- L. Lu, H. Zhang, and H. Jiang, “Content analysis for audio classification and segmentation,” IEEE Trans. Speech Audio Proc., vol. 10, no. 7, pp. 504–516, Oct. 2002.
- (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.7 , pp. 504-516
- Lu, L.¹ Zhang, H.² Jiang, H.³

96
- 85050713839
- Audio Parsing and Rapid Speaker Adaptation in Speech Recognition for Spoken Document Retrieval
- Ph.D. dissertation, Robust Speech Processing Group, Center for Spoken Language Research, Univ. Colorado, Boulder, CO
- B. Zhou, “Audio Parsing and Rapid Speaker Adaptation in Speech Recognition for Spoken Document Retrieval,” Ph.D. dissertation, Robust Speech Processing Group, Center for Spoken Language Research, Univ. Colorado, Boulder, CO, 2003.
- (2003)
- Zhou, B.¹

97
- 85008052248
- [Online]. Available
- [Online]. Available: http://www.ukans.edu/carrie/docs/am-docs_index.html

98
- 85008062974
- Advances in phone-based modeling for automatic accent classification
- Speech Audio Proc., to be published.
- P. Angkititrakul and J. H. L. Hansen, “Advances in phone-based modeling for automatic accent classification,” IEEE Trans. Speech Audio Proc., to be published.
- IEEE Trans.
- Angkititrakul, P.¹ Hansen, J.H.L.²

99
- 0030784572
- Stochastic trajectory modeling and sentences searching for continuous speech recognition
- Jan.
- Y. Gong, “Stochastic trajectory modeling and sentences searching for continuous speech recognition,” IEEE Trans. Speech. Audio Proc., vol. 5, no. 1, pp. 33–44, Jan. 1997.
- (1997) IEEE Trans. Speech. Audio Proc. , vol.5 , Issue.1 , pp. 33-44
- Gong, Y.¹

100
- 85008017681
- Discriminative in-set/out-of-set speaker recognition
- Speech Audio Processing, submitted for publication.
- P. Angkititrakul and J. H. L. Hansen, “Discriminative in-set/out-of-set speaker recognition,” IEEE Trans. Speech Audio Processing, submitted for publication.
- IEEE Trans.
- Angkititrakul, P.¹ Hansen, J.H.L.²

101
- 85050187568
- Lattice-based search for spoken utterance retrieval
- Boston, MA, May
- M. Saraclar and R. Sproat, “Lattice-based search for spoken utterance retrieval,” in Proc. HLT-NAACL, Boston, MA, May 2004, pp. 129–136.
- (2004) Proc. HLT-NAACL , pp. 129-136
- Saraclar, M.¹ Sproat, R.²

102
- 0027929445
- On structuring probabilistic dependencies in stochastic language modeling
- H. Ney, U. Essen, and R. Kneser, “On structuring probabilistic dependencies in stochastic language modeling,” Comput. Speech Language, vol. 8, pp. 1–38, 1994.
- (1994) Comput. Speech Language , vol.8 , pp. 1-38
- Ney, H.¹ Essen, U.² Kneser, R.³

103
- 84891308106
- SRILM - An extensible language modeling toolkit
- Denver, CO, Sep.
- A. Stolcke, “SRILM - An extensible language modeling toolkit,” in Proc. Int. Conf. Spoken Language Process., Denver, CO, Sep. 2002, pp. 901–904.
- (2002) Proc. Int. Conf. Spoken Language Process. , pp. 901-904
- Stolcke, A.¹

104
- 24144437364
- Speech transcription and spoken document retrieval in Finnish in machine learning for multimodal interaction
- Lecture Notes in Computer Science
- M. Kurimo, V. Turunen, and I. Ekman, “Speech transcription and spoken document retrieval in Finnish in machine learning for multimodal interaction,” in Revised Selected Papers MLMI 2004 Workshop, vol. 3361, Lecture Notes in Computer Science, 2005, pp. 253–262.
- (2005) Revised Selected Papers MLMI 2004 Workshop , vol.3361 , pp. 253-262
- Kurimo, M.¹ Turunen, V.² Ekman, I.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.