-
1
-
-
0001979427
-
Meeting browser: Tracking and summarizing meetings
-
A.Waibel, M. Bett, and M. Finke, "Meeting browser: tracking and summarizing meetings," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998, pp. 281-286.
-
Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998
, pp. 281-286
-
-
Waibel, A.1
Bett, M.2
Finke, M.3
-
2
-
-
84857722303
-
The ICSI meeting project: Resources and research
-
A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macías- Guarasa, N. Morgan, B. Peskin, E Shriberg, A. Stolcke, C. Wooters, and B. Wrede, "The ICSI meeting project: resources and research," in Proc. ICASSP'04 Meeting Recognition Workshop, 2004.
-
Proc. ICASSP'04 Meeting Recognition Workshop, 2004
-
-
Janin, A.1
Ang, J.2
Bhagat, S.3
Dhillon, R.4
Edwards, J.5
Macías-Guarasa, J.6
Morgan, N.7
Peskin, B.8
Shriberg, E.9
Stolcke, A.10
Wooters, C.11
Wrede, B.12
-
3
-
-
32044458420
-
Browsing recorded meetings with Ferret
-
P. Wellner, M. Flynn, and M. Guillemot, "Browsing recorded meetings with Ferret," in Proc. ICMI-MLMI, 2004, pp. 12-21.
-
Proc. ICMI-MLMI, 2004
, pp. 12-21
-
-
Wellner, P.1
Flynn, M.2
Guillemot, M.3
-
4
-
-
44849090969
-
Recognition and understanding of meetings the AMI and AMIDA projects
-
S. Renals, T. Hain, and H. Bourlard, "Recognition and understanding of meetings The AMI and AMIDA projects," in Proc. ASRU, 2007, pp. 238-247.
-
Proc. ASRU, 2007
, pp. 238-247
-
-
Renals, S.1
Hain, T.2
Bourlard, H.3
-
5
-
-
67649528017
-
The CALO meeting speech recognition and understanding system
-
G. Tur, A. Stolcke, L. Voss, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Hakkani-Tür, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, S. Peters, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. Yang, "The CALO meeting speech recognition and understanding system," in Proc. SLT, 2008.
-
Proc. SLT, 2008
-
-
Tur, G.1
Stolcke, A.2
Voss, L.3
Dowding, J.4
Favre, B.5
Fernandez, R.6
Frampton, M.7
Frandsen, M.8
Frederickson, C.9
Graciarena, M.10
Hakkani-Tür, D.11
Kintzing, D.12
Leveque, K.13
Mason, S.14
Niekrasz, J.15
Peters, S.16
Purver, M.17
Riedhammer, K.18
Shriberg, E.19
Tien, J.20
Vergyri, D.21
Yang, F.22
more..
-
6
-
-
60949097180
-
A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization
-
K. Otsuka, S. Araki, K. Ishizuka, M. Fujimoto, M. Heinrich, and J. Yamato, "A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization," in Proc. ICMI, 2008, pp. 257-264.
-
Proc. ICMI, 2008
, pp. 257-264
-
-
Otsuka, K.1
Araki, S.2
Ishizuka, K.3
Fujimoto, M.4
Heinrich, M.5
Yamato, J.6
-
7
-
-
74049143046
-
A multimedia retrieval system using speech input
-
A. Popescu-Belis, P. Poller, and J. Kilgour, "A multimedia retrieval system using speech input," in Proc. ICMI-MLMI, 2009, pp. 223-224.
-
Proc. ICMI-MLMI, 2009
, pp. 223-224
-
-
Popescu-Belis, A.1
Poller, P.2
Kilgour, J.3
-
8
-
-
70450174924
-
Real-time ASR from meetings
-
P. N. Garner, J. Dines, T. Hain, E. A. Hannani, M. Karafiát, D. Korchagin, M. Lincoln, V. Wan, and L. Zhang, "Real-time ASR from meetings," in Proc. Interspeech, 2009, pp. 2119-2122.
-
Proc. Interspeech, 2009
, pp. 2119-2122
-
-
Garner, P.N.1
Dines, J.2
Hain, T.3
Hannani, E.A.4
Karafiát, M.5
Korchagin, D.6
Lincoln, M.7
Wan, V.8
Zhang, L.9
-
9
-
-
70450204727
-
A study of mutual front-end processing method based on statistical model for noise robust speech recognition
-
M. Fujimoto, K. Ishizuka, and T. Nakatani, "A study of mutual front-end processing method based on statistical model for noise robust speech recognition," in Proc. Interspeech, 2009, pp. 1235-1238.
-
Proc. Interspeech, 2009
, pp. 1235-1238
-
-
Fujimoto, M.1
Ishizuka, K.2
Nakatani, T.3
-
10
-
-
33645758265
-
NTT Speech recognizer with Outlook on the Next generation: SOLON
-
[Online]. Available
-
T. Hori, "NTT Speech recognizer with Outlook On the Next generation: SOLON," in Proc. NTT Workshop on Communication Scene Analysis, 2004, pp. SP-6. [Online]. Available: www.kecl.ntt.co.jp/icl/signal/hori/publications/thori csa2004.pdf.
-
Proc. NTT Workshop on Communication Scene Analysis, 2004
-
-
Hori, T.1
-
11
-
-
77957745677
-
Blind separation and dereverberation of speech mixtures by joint optimization
-
accepted for publication, doi:10.1109/TASL.2010.2045183
-
T. Yoshioka, T. Nakatani, T. Miyoshi, and H. G. Okuno, "Blind separation and dereverberation of speech mixtures by joint optimization," IEEE Transactions on Audio, Speech, and Language Processing, 2010, accepted for publication, doi:10.1109/TASL.2010.2045183.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
-
-
Yoshioka, T.1
Nakatani, T.2
Miyoshi, T.3
Okuno, H.G.4
-
12
-
-
51449113843
-
Speaker indexing and speech enhancement in real meeting / conversations
-
S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, "Speaker indexing and speech enhancement in real meeting / conversations," in Proc. ICASSP, 2008, vol. I, pp. 93-96.
-
Proc. ICASSP, 2008
, vol.1
, pp. 93-96
-
-
Araki, S.1
Fujimoto, M.2
Ishizuka, K.3
Sawada, H.4
Makino, S.5
-
13
-
-
0016990291
-
The generalized correlation method for estimation of time delay
-
C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech and Signal Processing, vol. 24, no. 4, pp. 320-327, 1976.
-
(1976)
IEEE Trans. Acoust. Speech and Signal Processing
, vol.24
, Issue.4
, pp. 320-327
-
-
Knapp, C.H.1
Carter, G.C.2
-
14
-
-
34247223586
-
Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors
-
Aug
-
S. Araki, H. Sawada, R. Mukai, and S. Makino, "Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors," Signal Processing, vol. 77, no. 8, pp. 1833-1847, Aug 2007.
-
(2007)
Signal Processing
, vol.77
, Issue.8
, pp. 1833-1847
-
-
Araki, S.1
Sawada, H.2
Mukai, R.3
Makino, S.4
-
15
-
-
50449097931
-
Noise robust voice activity detection based on switching Kalman filter
-
M. Fujimoto and K. Ishizuka, "Noise robust voice activity detection based on switching Kalman filter," in Proc. Interspeech, 2007, pp. 2933-2936.
-
Proc. Interspeech, 2007
, pp. 2933-2936
-
-
Fujimoto, M.1
Ishizuka, K.2
-
16
-
-
0032762471
-
A statistical model-based voice activity detection
-
January
-
J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, January 1999.
-
(1999)
IEEE Signal Processing Letters
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
17
-
-
33745207361
-
A Japanese national project on spontaneous speech corpus and processing technology
-
S. Furui, K. Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. ASR, 2000, pp. 244-248.
-
Proc. ASR, 2000
, pp. 244-248
-
-
Furui, S.1
Maekawa, K.2
Isahara, H.3
-
18
-
-
78049393373
-
A comparative study on methods of weighted language model training for reranking LVCSR n-best hypotheses
-
T. Oba, T. Hori, and A. Nakamura, "A comparative study on methods of weighted language model training for reranking LVCSR n-best hypotheses," in Proc. ICASSP, 2010, pp. 5126-5129.
-
Proc. ICASSP, 2010
, pp. 5126-5129
-
-
Oba, T.1
Hori, T.2
Nakamura, A.3
-
19
-
-
45849093239
-
Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
-
T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Effi- cient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 4, pp. 1352-1365, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.4
, pp. 1352-1365
-
-
Hori, T.1
Hori, C.2
Minami, Y.3
Nakamura, A.4
-
20
-
-
33646426591
-
Generalized fast on-the-fly composition algorithm for WFST-based speech recognition
-
T. Hori and A. Nakamura, "Generalized fast on-the-fly composition algorithm for WFST-based speech recognition," in Proc. Interspeech- Eurospeech, 2005, pp. 557-560.
-
Proc. Interspeech-Eurospeech, 2005
, pp. 557-560
-
-
Hori, T.1
Nakamura, A.2
-
21
-
-
85009271609
-
Towards automatic closed captioning: Low latency real time broadcast news transcription
-
M. Saraclar, M. Riley, E. Bocchieri, and V. Goffin, "Towards automatic closed captioning: low latency real time broadcast news transcription," in Proc. ICSLP, 2002, pp. 1741-1744.
-
Proc. ICSLP, 2002
, pp. 1741-1744
-
-
Saraclar, M.1
Riley, M.2
Bocchieri, E.3
Goffin, V.4
-
22
-
-
38049176869
-
CLEAR evaluation of acoustic event detection and classification systems
-
A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, and M. Omologo, "CLEAR evaluation of acoustic event detection and classification systems," Multimodal Technologies for Perception of Humans, pp. 311-322, 2007.
-
(2007)
Multimodal Technologies for Perception of Humans
, pp. 311-322
-
-
Temko, A.1
Malkin, R.2
Zieger, C.3
Macho, D.4
Nadeu, C.5
Omologo, M.6
-
23
-
-
77956207114
-
Topic tracking model for analyzing consumer purchase behavior
-
T. Iwata, S. Watanabe, T. Yamada, and N. Ueda, "Topic tracking model for analyzing consumer purchase behavior," in Proc. IJCAI, 2009, pp. 1427-1432.
-
Proc. IJCAI, 2009
, pp. 1427-1432
-
-
Iwata, T.1
Watanabe, S.2
Yamada, T.3
Ueda, N.4
-
24
-
-
70450162101
-
Memory-based particle filter for face pose tracking robust under complex dynamics
-
D. Mikami, K. Otsuka, and J. Yamato, "Memory-based particle filter for face pose tracking robust under complex dynamics," in Proc. CVPR, 2009, pp. 999-1006.
-
Proc. CVPR, 2009
, pp. 999-1006
-
-
Mikami, D.1
Otsuka, K.2
Yamato, J.3
|