SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 3, 2008, Pages 639-650

Specmurt analysis of polyphonic music signals

(5) Saito, Shoichiro a,b Kameoka, Hirokazu a,c Takahashi, Keigo a,d Nishimoto, Takuya a Sagayama, Shigeki a

a UNIVERSITY OF TOKYO (Japan)

b NTT CORPORATION (Japan)

c NTT Communication Science Laboratories (Japan)

d Natl Police Agency (Japan)

Author keywords

Inverse filtering; Iteration algorithm; Multipitch analysis; Pitch visualization; Polyphonic music signals

Indexed keywords

INVERSE FILTERING; ITERATION ALGORITHM; MULTIPITCH ANALYSIS; PITCH VISUALIZATION; POLYPHONIC MUSIC SIGNALS;

ALGORITHMS; ELECTRONIC MUSICAL INSTRUMENTS; HARMONIC ANALYSIS; ITERATIVE METHODS; NATURAL FREQUENCIES; POWER SPECTRUM; SIGNAL PROCESSING; SPECTRUM ANALYSIS; VISUALIZATION;

FOURIER TRANSFORMS;

EID: 64849091277 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.912998 Document Type: Article

Times cited : (46)

References (29)

1
- 0002161311
- The quefrency alanysis of time series for echos: Cepstrum, pseudo-autocovariance, cross-cepstrum. and saphe-cracking
- B. P. Bogert, M. J. R. Healry, and J. W. Tukey, "The quefrency alanysis of time series for echos: Cepstrum, pseudo-autocovariance, cross-cepstrum. and saphe-cracking," in Proc. Symp. Time Series Analysis. 1963, pp. 209-243.
- (1963) Proc. Symp. Time Series Analysis , pp. 209-243
- Bogert, B.P.¹ Healry, M.J.R.² Tukey, J.W.³

2
- 84953653667
- Short-time spectrum and 'cepstrum' techniques for vocal-pitch detection
- Feb
- A. M. Noll, "Short-time spectrum and 'cepstrum' techniques for vocal-pitch detection,"J. Acoust. Soc. Amer., vol. 36, no. 2, pp. 296-302, Feb. 1964.
- (1964) J. Acoust. Soc. Amer , vol.36 , Issue.2 , pp. 296-302
- Noll, A.M.¹

3
- 34548108312
- On individuality in a dynamic measure of speech
- in Japanese, Jul
- S. Sagayama and F. Itakura, "On individuality in a dynamic measure of speech," in Proc. ASJ Conf. (in Japanese), Jul. 1979, pp. 589-590.
- (1979) Proc. ASJ Conf , pp. 589-590
- Sagayama, S.¹ Itakura, F.²

4
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. E. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust, Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.E.¹ Mermelstein, P.²

5
- 0007958098
- Speech analysis synthesis system using log magnitude approximation filter, (in Japanese)
- S. Imai and T. Kitamura, "Speech analysis synthesis system using log magnitude approximation filter," (in Japanese) Trans. IEICE Japan, vol. J61-A, no. 6, pp. 527-534, 1978.
- (1978) Trans. IEICE Japan , vol.J61-A , Issue.6 , pp. 527-534
- Imai, S.¹ Kitamura, T.²

6
- 0347337997
- Multiple fundamental frequency estimation based on harmonicitv and spectral smoothness
- Nov
- A. Klapuri, "Multiple fundamental frequency estimation based on harmonicitv and spectral smoothness," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 806-816, Nov. 2003.
- (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 806-816
- Klapuri, A.¹

7
- 0003182324
- Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism
- K. Kashino, K. Nakadai, T. Kinoshita, and H. Tanaka, "Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism," in Proc. Int. Joint Conf. Artif. Intell., 1995, vol. 1, pp. 158-164.
- (1995) Proc. Int. Joint Conf. Artif. Intell , vol.1 , pp. 158-164
- Kashino, K.¹ Nakadai, K.² Kinoshita, T.³ Tanaka, H.⁴

8
- 0026744657
- Musical fundamental frequency tracking using a pattern recognition method
- J. C. Brown, "Musical fundamental frequency tracking using a pattern recognition method," J. Acoust. Soc. Amer., vol. 92-3, pp. 1394-1402. 1992.
- (1992) J. Acoust. Soc. Amer , vol.92 -3 , pp. 1394-1402
- Brown, J.C.¹

9
- 0033677009
- A robust predominant-FO estimation method for real-time detection of melody and bass lines in CD recordings
- Jun
- M. Goto, "A robust predominant-FO estimation method for real-time detection of melody and bass lines in CD recordings," in Proc. IEEE Int. Conf. Acoust, Speech, Signal Process., Jun. 2000. vol. 2, pp. 757-760.
- (2000) Proc. IEEE Int. Conf. Acoust, Speech, Signal Process , vol.2 , pp. 757-760
- Goto, M.¹

10
- 0034848863
- A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models
- Sep
- M. Goto. "A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Sep. 2001, vol. 5, pp. 3365-3368.
- (2001) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.5 , pp. 3365-3368
- Goto, M.¹

11
- 4644242508
- A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals
- M. Goto, "A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
- (2004) Speech Commun , vol.43 , Issue.4 , pp. 311-329
- Goto, M.¹

12
- 33947644951
- Real-time pitch determination of one or more voices bv nonnegative matrix factorisation
- F. Sha and F. Saul, "Real-time pitch determination of one or more voices bv nonnegative matrix factorisation," in Proc. Neural Inf. Process. Syst, 2004, pp. 1233-1240.
- (2004) Proc. Neural Inf. Process. Syst , pp. 1233-1240
- Sha, F.¹ Saul, F.²

13
- 33947100466
- Shifted non-negative matrix factorization for sound source separation
- D. FitzGerald, M. Cranitch, and E. Coyle, "Shifted non-negative matrix factorization for sound source separation," in IEEE Workshop Statist. Signal Process., 2005, pp. 1132-1137.
- (2005) IEEE Workshop Statist. Signal Process , pp. 1132-1137
- FitzGerald, D.¹ Cranitch, M.² Coyle, E.³

14
- 33144463127
- Unsupervised analysis of polyphonic music by sparse coding
- Jan
- S. A. Abdallah and M. D. Plumbley, "Unsupervised analysis of polyphonic music by sparse coding," IEEE Trans. Neural Netw., vol. 17, no. 1, pp. 179-196, Jan. 2006.
- (2006) IEEE Trans. Neural Netw , vol.17 , Issue.1 , pp. 179-196
- Abdallah, S.A.¹ Plumbley, M.D.²

15
- 33744987389
- Sparse and shift-invariant representations of music
- Jan
- T. Blumensath and M. Davies, "Sparse and shift-invariant representations of music," IEEE Trans. Audio, Speech, Lang. Process., vol. 14. no. 1, pp. 50-57, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 50-57
- Blumensath, T.¹ Davies, M.²

16
- 17344378590
- Bayesian harmonic models for musical pitch estimation and analysis
- S. Godsill and M. Davy, "Bayesian harmonic models for musical pitch estimation and analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 2, pp. 1769-1772.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1769-1772
- Godsill, S.¹ Davy, M.²

17
- 0036297211
- Separation of harmonic sounds using linear models for the overtone series
- T. Virtanen and A. Klapuri, "Separation of harmonic sounds using linear models for the overtone series," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 2, pp. 1757-1760.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1757-1760
- Virtanen, T.¹ Klapuri, A.²

18
- 84872690534
- Robust multipitch estimation for the analysis and manipulation of polyphonic musical signals
- A. Klapuri, T. Virtanen, and J. Holm, "Robust multipitch estimation for the analysis and manipulation of polyphonic musical signals," in Proc. COST-G6 Conf. Digital Audio Effects, 2000, pp. 233-236.
- (2000) Proc. COST-G6 Conf. Digital Audio Effects , pp. 233-236
- Klapuri, A.¹ Virtanen, T.² Holm, J.³

19
- 19944419547
- Extraction of multiple fundamental frequencies from polyphonic music
- H. Kameoka, T. Nishimoto, and S. Sagayama, "Extraction of multiple fundamental frequencies from polyphonic music," Proc. Int. Congr. Acoust., pp. 59-62, 2004.
- (2004) Proc. Int. Congr. Acoust , pp. 59-62
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

20
- 4544303298
- Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds
- May
- H. Kameoka, T. Nishimoto, and S. Sagayama, "Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2004, vol. 4, pp. 297-300.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.4 , pp. 297-300
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

21
- 0036497684
- Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform
- T. Irino and R. D. Patterson, "Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform," Speech Commun., vol. 36, no. 3. pp. 181-203, 2002.
- (2002) Speech Commun , vol.36 , Issue.3 , pp. 181-203
- Irino, T.¹ Patterson, R.D.²

22
- 85131821539
- Mel-generalized cepstral analysis - A unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis - A unified approach to speech spectral estimation," in Proc. Int. Conf. Spoken Lang. Process., 1994, pp. 1043-1046.
- (1994) Proc. Int. Conf. Spoken Lang. Process , pp. 1043-1046
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

23
- 0043087956
- in Japanese Elec. Commun. Lab, NTT, Tokyo, Japan, Tech. Rep. 3107
- S. Saito and F. ltakura, "The theoretical consideration of statistically optimum methods for speech spectral density," (in Japanese) Elec. Commun. Lab., NTT, Tokyo, Japan, 1966, Tech. Rep. 3107.
- (1966) The theoretical consideration of statistically optimum methods for speech spectral density
- Saito, S.¹ ltakura, F.²

24
- 0003386064
- Predictive coding of speech signals
- B. S. Atal and M. R. Schroeder, "Predictive coding of speech signals," in Proc. Int. Conf. Speech Commun. and Process., 1967, pp. 360-361.
- (1967) Proc. Int. Conf. Speech Commun. and Process , pp. 360-361
- Atal, B.S.¹ Schroeder, M.R.²

25
- 2442437071
- RWC music database: Music genre database and musical instrument sound database
- Oct
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Music genre database and musical instrument sound database," in Proc. Int. Conf. Music Inf. Retrieval, Oct. 2003, pp. 229-230.
- (2003) Proc. Int. Conf. Music Inf. Retrieval , pp. 229-230
- Goto, M.¹ Hashiguchi, H.² Nishimura, T.³ Oka, R.⁴

26
- 64849094083
- Iterative multipitch estimation algorithm for MAP specmurt analvsis,
- Aug. 2006, 2006-MUS-66, pp
- S. Saito, H. Kameoka, N. Ono, and S. Sagayama, "Iterative multipitch estimation algorithm for MAP specmurt analvsis," (in Japanese) IPSJ SIG Tech. Rep., Aug. 2006, vol. 2006-MUS-66, pp. 85-92.
- in Japanese) IPSJ SIG Tech. Rep , pp. 85-92
- Saito, S.¹ Kameoka, H.² Ono, N.³ Sagayama, S.⁴

27
- 0141623871
- RWC music database: Popular, classical, and jazz music database
- Oct
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Popular, classical, and jazz music database," in Proc. Int. Svmp. Music Inf. Retrieval, Oct. 2002, pp. 287-288.
- (2002) Proc. Int. Svmp. Music Inf. Retrieval , pp. 287-288
- Goto, M.¹ Hashiguchi, H.² Nishimura, T.³ Oka, R.⁴

28
- 50249173884
- A multipitch analyzer based on harmonic temporal structured clustering
- Mar
- H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 982-994
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

29
- 11144277258
- Automatic rhythm transcription from multiphonic MIDI signals
- Oct
- H. Takeda, T. Nishimoto, and S. Sagayama, "Automatic rhythm transcription from multiphonic MIDI signals," in Proc. Int. Conf. Music Inf. Retrieval, Oct. 2003, pp. 263-264.
- (2003) Proc. Int. Conf. Music Inf. Retrieval , pp. 263-264
- Takeda, H.¹ Nishimoto, T.² Sagayama, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.