-
1
-
-
84892200847
-
-
New York, NY, USA: Springer
-
, A. Klapuri and M. Davy, Eds., Signal Processing Methods for Music Transcription . New York, NY, USA: Springer, 2006.
-
(2006)
Signal Processing Methods for Music Transcription
-
-
Klapuri, A.1
Davy, M.2
-
2
-
-
64849117345
-
Unsupervised single-channel music source separation by average harmonic structure modeling
-
May
-
Z. Duan, Y. Zhang, C. Zhang, and Z. Shi, "Unsupervised single-channel music source separation by average harmonic structure modeling," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 4, pp. 766-778, May 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.16
, Issue.4
, pp. 766-778
-
-
Duan, Z.1
Zhang, Y.2
Zhang, C.3
Shi, Z.4
-
3
-
-
80051662928
-
Improving melody extraction using probabilistic latent component analysis
-
J. Han and C.-W. Chen, "Improving melody extraction using probabilistic latent component analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011, pp. 33-36.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011
, pp. 33-36
-
-
Han, J.1
Chen, C.-W.2
-
4
-
-
69249202377
-
Monaural speech separation and recognition challenge
-
M. Cooke, J. R. Hershey, and S. Rennie, "Monaural speech separation and recognition challenge," Comput. Speech Lang., vol. 24, pp. 1-15, 2010.
-
(2010)
Comput. Speech Lang.
, vol.24
, pp. 1-15
-
-
Cooke, M.1
Hershey, J.R.2
Rennie, S.3
-
5
-
-
33646791479
-
Prosody analysis andmodeling for emotional speech synthesis
-
D.-n. Jiang, W. Zhang, L.-q. Shen, and L.-h. Cai, "Prosody analysis andmodeling for emotional speech synthesis," in Proc. IEEE Int. Conf. Audio, Speech, Signal Process. (ICASSP), 2005, pp. 281-284.
-
Proc. IEEE Int. Conf. Audio, Speech, Signal Process. (ICASSP), 2005
, pp. 281-284
-
-
Jiang, D.-N.1
Zhang, W.2
Shen, L.-Q.3
Cai, L.-H.4
-
6
-
-
80052339383
-
Some experiments on the recognition of speech, with one and two ears
-
E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-979, 1953.
-
(1953)
J. Acoust. Soc. Amer.
, vol.25
, pp. 975-979
-
-
Cherry, E.C.1
-
7
-
-
0034319894
-
Acomputationally efficient multipitch analysis model
-
Nov.
-
T. Tolonen and M. Karjalainen, "Acomputationally efficient multipitch analysis model," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708-716, Nov. 2000.
-
(2000)
IEEE Trans. Speech Audio Process.
, vol.8
, Issue.6
, pp. 708-716
-
-
Tolonen, T.1
Karjalainen, M.2
-
8
-
-
0032663192
-
Multiple period estimation and pitch perceptionmodel
-
A. de Cheveigné and H. Kawahara, "Multiple period estimation and pitch perceptionmodel," Speech Commun., vol. 27, pp. 175-185, 1999.
-
(1999)
Speech Commun.
, vol.27
, pp. 175-185
-
-
De Cheveigné, A.1
Kawahara, H.2
-
9
-
-
33645360635
-
Bayesian analysis of polyphonic western tonal music
-
M. Davy, S. J. Godsill, and J. Idier, "Bayesian analysis of polyphonic western tonal music," J. Acoust. Soc. Amer., vol. 119, pp. 2498-2517, 2006.
-
(2006)
J. Acoust. Soc. Amer.
, vol.119
, pp. 2498-2517
-
-
Davy, M.1
Godsill, S.J.2
Idier, J.3
-
10
-
-
0028210066
-
Fundamental frequency estimation of musical signals using a two-way mismatch procedure
-
R. C. Maher and J. W. Beauchamp, "Fundamental frequency estimation of musical signals using a two-way mismatch procedure," J. Acoust. Soc. Amer., vol. 95, no. 4, pp. 2254-2263, 1994.
-
(1994)
J. Acoust. Soc. Amer.
, vol.95
, Issue.4
, pp. 2254-2263
-
-
Maher, R.C.1
Beauchamp, J.W.2
-
11
-
-
4644242508
-
A real-time music-scene-description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals
-
M. Goto, "A real-time music-scene-description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
-
(2004)
Speech Commun.
, vol.43
, Issue.4
, pp. 311-329
-
-
Goto, M.1
-
12
-
-
33646767114
-
Multiple fundamental frequency estimation of polyphonic music signals
-
C. Yeh, A. Röbel, and X. Rodet, "Multiple fundamental frequency estimation of polyphonic music signals," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2005, pp. 225-228.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2005
, pp. 225-228
-
-
Yeh, C.1
Röbel, A.2
Rodet, X.3
-
13
-
-
51449099172
-
Multiple fundamental frequency estimation using gaussian smoothness
-
A. Pertusa and J. M. Inesta, "Multiple fundamental frequency estimation using gaussian smoothness," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 105-108.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008
, pp. 105-108
-
-
Pertusa, A.1
Inesta, J.M.2
-
14
-
-
84873444806
-
Multiple fundamental frequency estimation by summing harmonic amplitudes
-
A. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. ISMIR, 2006, pp. 216-221.
-
Proc. ISMIR, 2006
, pp. 216-221
-
-
Klapuri, A.1
-
16
-
-
64849091277
-
Specmurt analysis of polyphonic music signals
-
S. Saito, H. Kameoka, K. Takahashi, T. Nishimoto, and S. Sagayama, "Specmurt analysis of polyphonic music signals," IEEE Trans. Speech Audio Process., vol. 16, no. 3, pp. 639-650, 2008.
-
(2008)
IEEE Trans. Speech Audio Process.
, vol.16
, Issue.3
, pp. 639-650
-
-
Saito, S.1
Kameoka, H.2
Takahashi, K.3
Nishimoto, T.4
Sagayama, S.5
-
17
-
-
77956540787
-
Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions
-
Nov.
-
Z. Duan, B. Pardo, and C. Zhang, "Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2121-2133, Nov. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.8
, pp. 2121-2133
-
-
Duan, Z.1
Pardo, B.2
Zhang, C.3
-
18
-
-
0037767686
-
A multipitch tracking algorithm for noisy speech
-
May
-
M. Wu, D. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 229-241, May 2003.
-
(2003)
IEEE Trans. Speech Audio Process.
, vol.11
, Issue.3
, pp. 229-241
-
-
Wu, M.1
Wang, D.2
Brown, G.J.3
-
19
-
-
84899027288
-
Real-time pitch determination of one or more voices by nonnegative matrix factorization
-
F. Sha and L. Saul, "Real-time pitch determination of one or more voices by nonnegative matrix factorization," in Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2005, pp. 1233-1240.
-
Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2005
, pp. 1233-1240
-
-
Sha, F.1
Saul, L.2
-
20
-
-
33646773610
-
Discriminative training of hidden Markov models for multiple pitch tracking
-
F. Bach and M. Jordan, "Discriminative training of hidden Markov models for multiple pitch tracking," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2005, pp. 489-492.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2005
, pp. 489-492
-
-
Bach, F.1
Jordan, M.2
-
21
-
-
85008056718
-
HMM-based multipitch tracking for noisy and reverberant speech
-
Jul.
-
Z. Jin and D. Wang, "HMM-based multipitch tracking for noisy and reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1091-1102, Jul. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.5
, pp. 1091-1102
-
-
Jin, Z.1
Wang, D.2
-
23
-
-
33846199251
-
A discriminative model for polyphonic piano transcription
-
DOI:10.1155/2007/48317
-
G. E. Poliner and D. P. W. Ellis, "A discriminative model for polyphonic piano transcription," EURASIP J. Adv. Signal Process., 2007, DOI:10.1155/2007/48317.
-
(2007)
EURASIP J. Adv. Signal Process.
-
-
Poliner, G.E.1
Ellis, D.P.W.2
-
24
-
-
50249173884
-
A multipitch analyzer based on harmonic temporal structured clustering
-
Mar.
-
H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.3
, pp. 982-994
-
-
Kameoka, H.1
Nishimoto, T.2
Sagayama, S.3
-
25
-
-
77955816771
-
Multiple-f0 tracking based on a high-order HMM model
-
W.-C. Chang, A. W. Y. Su, C. Yeh, A. Robel, and X. Rodet, "Multiple-f0 tracking based on a high-order HMM model," in Proc. Int. Conf. Digital Audio Effects (DAFx), 2008.
-
Proc. Int. Conf. Digital Audio Effects (DAFx), 2008
-
-
Chang, W.-C.1
Su, A.W.Y.2
Yeh, C.3
Robel, A.4
Rodet, X.5
-
26
-
-
50249167077
-
Single and multiple f0 contour estimation through parametric spectrogram modeling of speech in noisy environments
-
Jul.
-
J. Le Roux, H. Kameoka, N. Ono, A. de Cheveigne, and S. Sagayama, "Single and multiple f0 contour estimation through parametric spectrogram modeling of speech in noisy environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1135-1145, Jul. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.4
, pp. 1135-1145
-
-
Le Roux, J.1
Kameoka, H.2
Ono, N.3
De Cheveigne, A.4
Sagayama, S.5
-
27
-
-
78049397081
-
Song-level multi-pitch tracking by heavily constrained clustering
-
Z. Duan, J. Han, and B. Pardo, "Song-level multi-pitch tracking by heavily constrained clustering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2010, pp. 57-60.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2010
, pp. 57-60
-
-
Duan, Z.1
Han, J.2
Pardo, B.3
-
28
-
-
0032678384
-
A sound source identification system for ensemblemusic based on template adaptation andmusic streamextraction
-
K. Kashino and H. Murase, "A sound source identification system for ensemblemusic based on template adaptation andmusic streamextraction," Speech Commun., pp. 337-349, 1999.
-
(1999)
Speech Commun.
, pp. 337-349
-
-
Kashino, K.1
Murase, H.2
-
29
-
-
33744978751
-
Musical source separation using time-frequency source priors
-
DOI 10.1109/TSA.2005.860342
-
E. Vincent, "Musical source separation using time-frequency source priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 91-98, 2006. (Pubitemid 43863456)
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.1
, pp. 91-98
-
-
Vincent, E.1
-
30
-
-
84873461145
-
Second fiddle is important too: Pitch tracking individual voices in polyphonic music
-
M. Bay, A. F. Ehmann, J. W. Beauchamp, P. Smaragdis, and J. S. Downie, "Second fiddle is important too: Pitch tracking individual voices in polyphonic music," in Proc. Int. Soc. Music Inf. Retrieval Conf. (ISMIR), 2012, pp. 319-324.
-
Proc. Int. Soc. Music Inf. Retrieval Conf. (ISMIR), 2012
, pp. 319-324
-
-
Bay, M.1
Ehmann, A.F.2
Beauchamp, J.W.3
Smaragdis, P.4
Downie, J.S.5
-
31
-
-
79951599228
-
A probabilistic interaction model for multipitch tracking with factorial hidden Markov models
-
May
-
M. Wohlmayr, M. Stark, and F. Pernkopf, "A probabilistic interaction model for multipitch tracking with factorial hidden Markov models," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 799-810, May 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.4
, pp. 799-810
-
-
Wohlmayr, M.1
Stark, M.2
Pernkopf, F.3
-
32
-
-
84867946385
-
An unsupervised approach to cochannel speech separation
-
Jan.
-
K. Hu and D. Wang, "An unsupervised approach to cochannel speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 1, pp. 122-131, Jan. 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.21
, Issue.1
, pp. 122-131
-
-
Hu, K.1
Wang, D.2
-
34
-
-
84863772450
-
Speech analysis/synthesis based on a sinusoidal representation
-
Aug.
-
R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug. 1986.
-
(1986)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.34
, Issue.4
, pp. 744-754
-
-
McAulay, R.J.1
Quatieri, T.F.2
-
35
-
-
0027154415
-
Tracking of partials for additive sound synthesis using hiddenmarkovmodels
-
P. Depalle, G. Garcia, and X. Rodet, "Tracking of partials for additive sound synthesis using hiddenmarkovmodels," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1993, pp. 225-228.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1993
, pp. 225-228
-
-
Depalle, P.1
Garcia, G.2
Rodet, X.3
-
36
-
-
34547507985
-
Sound source tracking and formation using normalized cuts
-
M. Lagrange and G. Tzanetakis, "Sound source tracking and formation using normalized cuts," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 61-64.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007
, pp. 61-64
-
-
Lagrange, M.1
Tzanetakis, G.2
-
38
-
-
0042377235
-
Constrained k-means clustering with background knowledge
-
K. Wagstaff, C. Cardie, S. Rogers, and S. Schroedl, "Constrained k-means clustering with background knowledge," in Proc. Int. Conf. Mach. Learn. (ICML), 2001, pp. 577-584.
-
Proc. Int. Conf. Mach. Learn. (ICML), 2001
, pp. 577-584
-
-
Wagstaff, K.1
Cardie, C.2
Rogers, S.3
Schroedl, S.4
-
39
-
-
36849049804
-
Efficient incremental constrained clustering
-
I. Davidson, S. Ravi, and M. Ester, "Efficient incremental constrained clustering," in Proc. ACM Conf. Knowl. Discov. Data Mining (KDD), 2007, pp. 240-249.
-
Proc. ACM Conf. Knowl. Discov. Data Mining (KDD), 2007
, pp. 240-249
-
-
Davidson, I.1
Ravi, S.2
Ester, M.3
-
40
-
-
34547508917
-
Analysis of musical instrument sounds by source-filter-decay model
-
A. Klapuri, "Analysis of musical instrument sounds by source-filter-decay model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 53-56.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007
, pp. 53-56
-
-
Klapuri, A.1
-
41
-
-
76949083398
-
Dynamic spectral envelope-modeling for timbre analysis of musical instrument sounds
-
Mar.
-
J. J. Burred, A. Röbel, and T. Sikora, "Dynamic spectral envelope-modeling for timbre analysis of musical instrument sounds," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 663-674, Mar. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.3
, pp. 663-674
-
-
Burred, J.J.1
Röbel, A.2
Sikora, T.3
-
43
-
-
80052999230
-
Soundprism: An online system for score-informed source separation of music audio
-
Dec.
-
Z. Duan and B. Pardo, "Soundprism: An online system for score-informed source separation of music audio," IEEE J. Sel. Topics Signal Process., vol. 5, no. 6, pp. 1205-1215, Dec. 2011.
-
(2011)
IEEE J. Sel. Topics Signal Process.
, vol.5
, Issue.6
, pp. 1205-1215
-
-
Duan, Z.1
Pardo, B.2
-
44
-
-
85137465653
-
An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sounds
-
T. Galas and X. Rodet, "An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sounds," in Proc. Int. Comput. Music Conf. (ICMC), 1990, pp. 82-84.
-
Proc. Int. Comput. Music Conf. (ICMC), 1990
, pp. 82-84
-
-
Galas, T.1
Rodet, X.2
-
45
-
-
84856280484
-
-
Master's thesis, Univ. Stuttgart Fakultät Informatik, Stuttgart, Germany
-
D. Schwarz, "Spectral envelopes in sound analysis and synthesis," Master's thesis, Univ. Stuttgart Fakultät Informatik, Stuttgart, Germany, 1998.
-
(1998)
Spectral Envelopes in Sound Analysis and Synthesis
-
-
Schwarz, D.1
-
46
-
-
0029529426
-
Regularized estimation of cepstrum envelope from discrete frequency points
-
O. Cappé, J. Laroche, and E. Moulines, "Regularized estimation of cepstrum envelope from discrete frequency points," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), 1995, pp. 213-216.
-
Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), 1995
, pp. 213-216
-
-
Cappé, O.1
Laroche, J.2
Moulines, E.3
-
47
-
-
0036214787
-
Yin, a fundamental frequency estimator for speech and music
-
A. de Cheveigné and H. Kawahara, "Yin, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, pp. 1917-1930, 2002.
-
(2002)
J. Acoust. Soc. Amer.
, vol.111
, pp. 1917-1930
-
-
De Cheveigné, A.1
Kawahara, H.2
-
48
-
-
84865703367
-
A pitch tracking corpus with evaluation on multipitch tracking scenario
-
G. Pirker, M. Wohlmayr, S. Petrik, and F. Pernkopf, "A pitch tracking corpus with evaluation on multipitch tracking scenario," in Proc. Interspeech, 2011, pp. 1509-1512.
-
Proc. Interspeech, 2011
, pp. 1509-1512
-
-
Pirker, G.1
Wohlmayr, M.2
Petrik, S.3
Pernkopf, F.4
-
49
-
-
0003548585
-
-
NTIS, order number PB01-100354, now available from LDC
-
J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM," NTIS, order number PB01-100354, 1993, now available from LDC.
-
(1993)
The DARPA TIMIT Acoustic-phonetic Continuous Speech Corpus CDROM
-
-
Garofolo, J.1
Lamel, L.2
Fisher, W.3
Fiscus, J.4
Pallett, D.5
Dahlgren, N.6
-
50
-
-
4444257069
-
Praat, a system for doing phonetics by computer
-
P. Boersma, "Praat, a system for doing phonetics by computer," Glot Int., vol. 5, no. 9/10, pp. 341-345, 2001.
-
(2001)
Glot Int.
, vol.5
, Issue.9-10
, pp. 341-345
-
-
Boersma, P.1
-
51
-
-
77955695149
-
A tandem algorithm for pitch estimation and voiced speech segregation
-
Nov.
-
G. Hu and D. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.8
, pp. 2067-2079
-
-
Hu, G.1
Wang, D.2
|