SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 5, Issue 6, 2011, Pages 1252-1261

Lyric synchronizer: Automatic synchronization system between musical audio signals and lyrics

(4) Fujihara, Hiromasa a Goto, Masataka a Ogata, Jun a Okuno, Hiroshi G b

a NATIONAL INSTITUTE OF ADVANCED INDUSTRIAL SCIENCE AND TECHNOLOGY AIST (Japan)

b KYOTO UNIVERSITY (Japan)

Author keywords

Alignment; Lyrics; Singing voice; Viterbi algorithm; Vocal

Indexed keywords

ALIGNMENT TECHNIQUE; AUTOMATIC SYNCHRONIZATION; CONVENTIONAL METHODS; LYRICS; MUSIC-PLAYBACK INTERFACES; MUSICAL AUDIO SIGNAL; SPEECH SIGNALS; VITERBI; VOCAL; VOCAL SIGNALS;

ALIGNMENT; SIGNAL DETECTION; SPEECH; SYNCHRONIZATION; VITERBI ALGORITHM;

AUDIO ACOUSTICS;

EID: 80052978670 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2011.2159577 Document Type: Article

Times cited : (81)

References (27)

1
- 13444270474
- LyricAlly: Automatic synchronization of acoustic musical signals and textual lyrics
- ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
- Y.Wang, M.-Y. Kan, T. L. Nwe, A. Shenoy, and J. Yin, "Lyrically: Automatic synchronization of acoustic musical signals and textual lyrics," in Proc. 12th ACM Int. Conf. Multimedia, 2004, pp. 212-219. (Pubitemid 40211749)
- (2004) ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia , pp. 212-219
- Wang, Y.¹ Kan, M.-Y.² Nwe, T.L.³ Shenoy, A.⁴ Yin, J.⁵

2
- 33846989449
- Automatic lyrics alignment for Cantonese popular music
- C. H. Wong, W. M. Szeto, and K. H. Wong, "Automatic lyrics alignment for Cantonese popular music," Multimedia Syst., vol. 4-5, no. 12, pp. 307-323, 2007.
- (2007) Multimedia Syst. , vol.4-5 , Issue.12 , pp. 307-323
- Wong, C.H.¹ Szeto, W.M.² Wong, K.H.³

3
- 84860824163
- Low-delay singing voice alignment to text
- A. Loscos, P. Cano, and J. Bonada, "Low-delay singing voice alignment to text," in Proc. Int. Comput. Music Conf. (ICMC99), 1999.
- (1999) Proc. Int. Comput. Music Conf. (ICMC99)
- Loscos, A.¹ Cano, P.² Bonada, J.³

4
- 85009187525
- An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker
- C.-K.Wang, R.-Y. Lyu, and Y.-C. Chiang, "An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker," in Proc. 8th Euro. Conf. Speech Commun. Technol. (Eurospeech'03), 2003, pp. 1197-1200.
- (2003) Proc. 8th Euro. Conf. Speech Commun. Technol. (Eurospeech'03) , pp. 1197-1200
- Wang, C.-K.¹ Lyu, R.-Y.² Chiang, Y.-C.³

5
- 84873581013
- Phoneme recognition in popular music
- M. Gruhne, K. Schmidt, and C. Dittmar, "Phoneme recognition in popular music," in Proc. 8th Int. Conf. Music Inf. Retrieval (ISMIR'07), 2007, pp. 369-370.
- (2007) Proc. 8th Int. Conf. Music Inf. Retrieval (ISMIR'07) , pp. 369-370
- Gruhne, M.¹ Schmidt, K.² Dittmar, C.³

6
- 76949102667
- A modeling of singing voice robust to accompaniment sounds and its application to singer identification and vocal-timbre-similarity based music information retrieval
- Mar
- H. Fujihara, M. Goto, T. Kitahara, and H. G. Okuno, "A modeling of singing voice robust to accompaniment sounds and its application to singer identification and vocal-timbre-similarity based music information retrieval," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 638-648, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 638-648
- Fujihara, H.¹ Goto, M.² Kitahara, T.³ Okuno, H.G.⁴

7
- 4644242508
- A real-time music-scene-description system: Predominant- F0 estimation for detecting melody and bass lines in real-world audio signals
- M. Goto, "A real-time music-scene-description system: Predominant- F0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
- (2004) Speech Commun. , vol.43 , Issue.4 , pp. 311-329
- Goto, M.¹

8
- 84873632675
- Evaluation of multiple-F0 estimation and tracking systems
- M. Bay, A. F. Ehmann, and J. S. Downie, "Evaluation of multiple-F0 estimation and tracking systems," in Proc. 10th Int. Soc. Music Inf. Retrieval Conf. (ISMIR'09), 2009, pp. 315-320.
- (2009) Proc. 10th Int. Soc. Music Inf. Retrieval Conf. (ISMIR'09) , pp. 315-320
- Bay, M.¹ Ehmann, A.F.² Downie, J.S.³

9
- 48849095345
- Melody transcription from music audio: Approaches and evaluation
- May
- G. E. Poliner, D. P. Ellis, A. F. Ehmann, E. Gómez, S. Streich, and B. Ong, "Melody transcription from music audio: Approaches and evaluation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1247-1256, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1247-1256
- Poliner, G.E.¹ Ellis, D.P.² Ehmann, A.F.³ Gómez, E.⁴ Streich, S.⁵ Ong, B.⁶

10
- 0035688686
- Locating singing voice segments within music signals
- A. L. Berenzweig and D. P. W. Ellis, "Locating singing voice segments within music signals," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), 2001, pp. 119-122. (Pubitemid 34080608)
- (2001) IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 119-122
- Berenzweig, A.L.¹ Ellis, D.P.W.²

11
- 4544255234
- Automatic detection and tracking of target singer in multi-singer music recordings
- W.-H. Tsai and H.-M. Wang, "Automatic detection and tracking of target singer in multi-singer music recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04), 2004, pp. 221-224.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04) , pp. 221-224
- Tsai, W.-H.¹ Wang, H.-M.²

12
- 33745209127
- Automatic detection of vocal segments in popular songs
- T. L. Nwe and Y. Wang, "Automatic detection of vocal segments in popular songs," in Proc. 5th Int. Conf. Music Inf. Retrieval (ISMIR'04), 2004, pp. 138-145.
- (2004) Proc. 5th Int. Conf. Music Inf. Retrieval (ISMIR'04) , pp. 138-145
- Nwe, T.L.¹ Wang, Y.²

13
- 0018306059
- A threshold selection method from gray-level histograms
- N. Otsu, "A threshold selection method from gray-level histograms," IEEE Trans. System, Man, Cybern., vol. SMC-9, no. 1, pp. 62-66, Jan. 1979. (Pubitemid 9413341)
- (1979) IEEE Trans Syst Man Cybern , vol.SMC-9 , Issue.1 , pp. 62-66
- Otsu Nobuyuki¹

14
- 0038000307
- Generalized functional approximation for source filter system modeling
- X. R. T. Galas, "Generalized functional approximation for source filter system modeling," in Proc. 2nd Eur. Conf. Speech Commun. Technol. (Eurospeech'91), 1991, pp. 1085-1088.
- (1991) Proc. 2nd Eur. Conf. Speech Commun. Technol. (Eurospeech'91) , pp. 1085-1088
- Galas, X.R.T.¹

15
- 0029406853
- Adaptive cepstral analysis of speech
- Nov
- K. Tokuda, T. Kobayashi, and S. Imai, "Adaptive cepstral analysis of speech," IEEE Trans. Speech Audio Process., vol. 3, no. 6, pp. 481-489, Nov. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.6 , pp. 481-489
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

16
- 29144452868
- Pitch-dependent identification of musical instrument sounds
- DOI 10.1007/s10489-005-4612-1
- T. Kitahara, M. Goto, and H. G. Okuno, "Pitch-dependent identification of musical instrument sounds," Appl. Intell., vol. 23, no. 3, pp. 267-275, 2005. (Pubitemid 41801688)
- (2005) Applied Intelligence , vol.23 , Issue.3 , pp. 267-275
- Kitahara, T.¹ Goto, M.² Okuno, H.G.³

17
- 51449085941
- IPSJ SIG Tech. Rep. no. 90
- H. Kameoka, M. Goto, and S. Sagayama, "Selective amplifier of periodic and non-periodic components in concurrent audio signals with spectral control envelopes," 2006, vol. 2006, pp. 77-84, IPSJ SIG Tech. Rep., no. 90.
- (2006) Selective Amplifier of Periodic and Non-Periodic Components in Concurrent Audio Signals with Spectral Control Envelopes , vol.2006 , pp. 77-84
- Kameoka, H.¹ Goto, M.² Sagayama, S.³

18
- 0030676908
- Accurate keyword spotting using strictly lexical fillers
- R. E. Méliani, "Accurate keyword spotting using strictly lexical fillers," in Proc. 1997 IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), 1997, pp. 907-910.
- (1997) Proc. 1997 IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97) , pp. 907-910
- Méliani, R.E.¹

19
- 0030715925
- A segment-based wordspotter using phonetic filler models
- A. S. Manos and V. W. Zue, "A segment-based wordspotter using phonetic filler models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), 1997, pp. 899-902.
- (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97) , pp. 899-902
- Manos, A.S.¹ Zue, V.W.²

20
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.-H.²

21
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

22
- 0019053271
- Comparison of parametric representation for monosyllabic word recognition
- S. B. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllabic word recognition," IEEE Trans. Acoustic, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980. (Pubitemid 11464930)
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis Steven, B.¹ Mermelstein Paul²

23
- 0141623871
- RWCmusic database: Popular, classical, and jazz music databases
- Oct
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWCmusic database: Popular, classical, and jazz music databases," in Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR'02), Oct. 2002, pp. 287-288.
- (2002) Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR'02) , pp. 287-288
- Goto, M.¹ Hashiguchi, H.² Nishimura, T.³ Oka, R.⁴

24
- 85009067482
- Recent progress of open-source LVCSR engine Julius and Japanese model repository-software of continuous speech recognition consortium
- T. Kawahara, A. Lee, K. Takeda, and K. Shikano, "Recent progress of open-source LVCSR engine Julius and Japanese model repository- Software of continuous speech recognition consortium-," in Proc. 6th Int. Conf. Spoken Lang. Process. (Interspeech'04 ICSLP), 2004.
- (2004) Proc. 6th Int. Conf. Spoken Lang. Process. (Interspeech'04 ICSLP)
- Kawahara, T.¹ Lee, A.² Takeda, K.³ Shikano, K.⁴

25
- 85111262831
- Applying conditional random fields to Japanese morphological analysis
- T. Kudo, K. Yamamoto, and Y. Matsumoto, "Applying conditional random fields to Japanese morphological analysis," in Proc. Conf. Empirical Methods in Natural Lang. Process., 2004, pp. 230-237.
- (2004) Proc. Conf. Empirical Methods in Natural Lang. Process. , pp. 230-237
- Kudo, T.¹ Yamamoto, K.² Matsumoto, Y.³

26
- 33646802604
- An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognition
- DOI 10.1109/ICASSP.2005.1415094, 1415094, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Speech Processing
- A. Sasou, M. Goto, S. Hayamizu, and K. Tanaka, "An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'05), 2005, pp. I-237-I-240. (Pubitemid 43761134)
- (2005) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.I
- Sasou, A.¹ Goto, M.² Hayamizu, S.³ Tanaka, K.⁴

27
- 34547519365
- Active music listening interfaces based on signal processing
- DOI 10.1109/ICASSP.2007.367351, 4218382, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
- M. Goto, "Active music listening interfaces based on signal processing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'07), 2007, pp. IV-1441-IV-1444. (Pubitemid 47178651)
- (2007) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.4
- Goto, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.