메뉴 건너뛰기




Volumn 22, Issue 11, 2011, Pages 1744-1756

Learning speaker-specific characteristics with a deep neural architecture

Author keywords

Deep neural architecture; hybrid learning strategy; overcomplete representation; speaker comparison; speaker segmentation; speaker verification; speaker specific characteristics

Indexed keywords

HYBRID LEARNING; NEURAL ARCHITECTURES; OVERCOMPLETE REPRESENTATIONS; SPEAKER COMPARISON; SPEAKER SEGMENTATIONS; SPEAKER VERIFICATION; SPEAKER-SPECIFIC CHARACTERISTICS;

EID: 80455143732     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2011.2167240     Document Type: Article
Times cited : (108)

References (41)
  • 1
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • Sep
    • J. Campbell, "Speaker recognition: A tutorial," IEEE Proc., vol. 85, no. 8, pp. 1437-1462, Sep. 1997.
    • (1997) IEEE Proc. , vol.85 , Issue.8 , pp. 1437-1462
    • Campbell, J.1
  • 4
    • 36248934935 scopus 로고    scopus 로고
    • Ph.D. thesis, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA
    • Q. Jin, "Robust speaker recognition," Ph.D. thesis, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 2007.
    • (2007) Robust Speaker Recognition
    • Jin, Q.1
  • 5
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition: A feature-based approach
    • Sep
    • R. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition: A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 1-58, Sep. 1996.
    • (1996) IEEE Signal Process. Mag. , vol.13 , Issue.5 , pp. 1-58
    • Mammone, R.1    Zhang, X.2    Ramachandran, R.3
  • 6
    • 0029355999 scopus 로고
    • Speaker Identification and verification using Gaussian mixture speaker models
    • Aug
    • D. A. Reynolds, "Speaker Identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, nos. 1-2, pp. 91-108, Aug. 1995.
    • (1995) Speech Commun. , vol.17 , Issue.1-2 , pp. 91-108
    • Reynolds, D.A.1
  • 7
    • 0001091769 scopus 로고
    • Speaker Identification and verification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds, "Speaker Identification and verification using Gaussian mixture speaker models," MIT Lincoln Lab. J., vol. 8, pp. 173-191, Jan. 1995.
    • (1995) MIT Lincoln Lab.J. , vol.8 , pp. 173-191
    • Reynolds, D.A.1
  • 9
    • 79953810111 scopus 로고    scopus 로고
    • Iterative gaussianization: From ICA to random rotations
    • Apr
    • V. Laparra, G. Camps-Valls, and J. Malo, "Iterative gaussianization: From ICA to random rotations," IEEE Trans. Neural Netw., vol. 22, no. 4, pp. 537-549, Apr. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.4 , pp. 537-549
    • Laparra, V.1    Camps-Valls, G.2    Malo, J.3
  • 10
    • 34547975052 scopus 로고    scopus 로고
    • Scaling learning algorithms toward AI
    • Cambridge, MA: MIT Press
    • Y. Bengio and Y. LeCun, "Scaling learning algorithms toward AI," in Large Scale Kernel Machine. Cambridge, MA: MIT Press, 2007, pp. 321-360.
    • (2007) Large Scale Kernel Machine , pp. 321-360
    • Bengio, Y.1    Le Cun, Y.2
  • 11
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y. Bengio, "Learning deep architectures for AI," Foundations Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
    • (2009) Foundations Trends Mach. Learn. , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 12
    • 35348818718 scopus 로고    scopus 로고
    • Learning multiple layers of representation
    • Oct
    • G. E. Hinton, "Learning multiple layers of representation," Trends Cogn. Sci., vol. 11, no. 10, pp. 428-434, Oct. 2007.
    • (2007) Trends Cogn. Sci. , vol.11 , Issue.10 , pp. 428-434
    • Hinton, G.E.1
  • 13
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • Jul
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, Jul. 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 14
    • 0032203257 scopus 로고    scopus 로고
    • Gradient based learning applied to document recognition
    • Nov
    • Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient based learning applied to document recognition," IEEE Proc., vol. 86, no. 11, pp. 2278-2324, Nov. 1998.
    • (1998) IEEE Proc. , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 15
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • Jul
    • G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, Jul. 2006.
    • (2006) Neural Comput. , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3
  • 20
    • 80455158726 scopus 로고    scopus 로고
    • Learning a non-linear embedding by preserving class neighbourhood structure
    • R. Salakhutdinov and G. Hinton, "Learning a non-linear embedding by preserving class neighbourhood structure," in Proc. Art. Intell. Statis., vol. 2. 2007, pp. 412-419.
    • (2007) Proc. Art. Intell. Statis. , vol.2 , pp. 412-419
    • Salakhutdinov, R.1    Hinton, G.2
  • 21
    • 34249661090 scopus 로고    scopus 로고
    • Synergistic face detection and pose estimation with energy-based models
    • May
    • M. Osadchy, Y. LeCun, and M. Miller, "Synergistic face detection and pose estimation with energy-based models," J.Mach. Learn. Res., vol. 8, pp. 1197-1215, May 2007.
    • (2007) J.Mach. Learn. Res. , vol.8 , pp. 1197-1215
    • Osadchy, M.1    Le Cun, Y.2    Miller, M.3
  • 25
    • 77956502334 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, Y. Largman, P. Pham, and A. Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks," in Proc. Adv. Neural Informat. Process. Syst., vol. 22. 2010, pp. 1-9.
    • (2010) Proc. Adv. Neural Informat. Process. Syst. , vol.22 , pp. 1-9
    • Lee, H.1    Largman, Y.2    Pham, P.3    Ng, A.Y.4
  • 26
    • 80455140956 scopus 로고    scopus 로고
    • Linguistic Data Consortium (LDC), Philadelphia PA [Online], Available
    • Linguistic Data Consortium (LDC), Philadelphia, PA [Online]. Available: http://www.ldc.upenn.edu.com
  • 27
    • 80455158727 scopus 로고    scopus 로고
    • Russian Speech Corpus [Online], Available
    • Russian Speech Corpus [Online]. Available: http://www.repository. voxforge1.org/
  • 28
    • 80455132745 scopus 로고    scopus 로고
    • Shenzhen Inst. Advanced Technology, Chinese Academy Science, Shenzhen, China, Tech. Rep
    • L. Wang, "Chinese speech corpus for speaker recognition," Shenzhen Inst. Advanced Technology, Chinese Academy Science, Shenzhen, China, Tech. Rep., pp. 1-66, 2008.
    • (2008) Chinese Speech Corpus for Speaker Recognition , pp. 1-66
    • Wang, L.1
  • 29
    • 84862617580 scopus 로고    scopus 로고
    • Loss functions for discriminative training of energy-based models
    • Y. LeCun and F. J. Huang, "Loss functions for discriminative training of energy-based models," in Proc. Art. Intell. Statist., 2005, pp. 1-8.
    • (2005) Proc. Art. Intell. Statist. , pp. 1-8
    • Le Cun, Y.1    Huang, F.J.2
  • 31
    • 0036487238 scopus 로고    scopus 로고
    • Toward better making a decision in speaker verification
    • Feb
    • K. Chen, "Toward better making a decision in speaker verification," Pattern Recog., vol. 36, no. 2, pp. 329-346, Feb. 2003.
    • (2003) Pattern Recog. , vol.36 , Issue.2 , pp. 329-346
    • Chen, K.1
  • 32
    • 0036505591 scopus 로고    scopus 로고
    • Capture inter-speaker information with a neural network for speaker identification
    • Mar
    • L. Wang, K. Chen, and H. Chi, "Capture inter-speaker information with a neural network for speaker identification," IEEE Trans. Neural Netw., vol. 13, no. 2, pp. 436-445, Mar. 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 436-445
    • Wang, L.1    Chen, K.2    Chi, H.3
  • 35
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 37
    • 79951670099 scopus 로고    scopus 로고
    • Optimized discriminative kernel for SVM scoring and its application to speaker verification
    • Feb
    • S. Zhang and M. Mak, "Optimized discriminative kernel for SVM scoring and its application to speaker verification," IEEE Trans. Neural Netw., vol. 22, no. 2, pp. 173-185, Feb. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.2 , pp. 173-185
    • Zhang, S.1    Mak, M.2
  • 38
    • 38949122754 scopus 로고    scopus 로고
    • Speaker segmentation and clustering
    • May
    • M. Kotti, V. Moschou, and C. Kotropoulos, "Speaker segmentation and clustering," Signal Process., vol. 88, no. 5, pp. 1091-1124, May 2008.
    • (2008) Signal Process. , vol.88 , Issue.5 , pp. 1091-1124
    • Kotti, M.1    Moschou, V.2    Kotropoulos, C.3
  • 39
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: A speaker-based segmentation for audio data indexing
    • Sep
    • P. Delacourt and C. Wellekens, "DISTBIC: A speaker-based segmentation for audio data indexing," Speech Commun., vol. 32, nos. 1-2, pp. 111-126, Sep. 2000.
    • (2000) Speech Commun. , vol.32 , Issue.1-2 , pp. 111-126
    • Delacourt, P.1    Wellekens, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.