메뉴 건너뛰기




Volumn 20, Issue 1, 2013, Pages 77-116

More is better: Likelihood ratio-based forensic voice comparison with vocalic segmental cepstra frontends

Author keywords

Cepstrum; Forensic voice comparison; Likelihood ratio; Vowel spectra

Indexed keywords


EID: 84880076430     PISSN: 17488885     EISSN: 17488893     Source Type: Journal    
DOI: 10.1558/ijsll.v20i1.77     Document Type: Article
Times cited : (19)

References (45)
  • 1
    • 1342324886 scopus 로고    scopus 로고
    • Evaluation of trace evidence in the form of multivariate data
    • Aitken, C.G.G. and Lucy, D. (2004) Evaluation of trace evidence in the form of multivariate data. Applied Statistics 53(4): 109-122. http://dx.doi.org/10.1046/j.0035-9254.2003.05271.x
    • (2004) Applied Statistics , vol.53 , Issue.4 , pp. 109-122
    • Aitken, C.G.G.1    Lucy, D.2
  • 3
    • 29044433376 scopus 로고    scopus 로고
    • Application independent evaluation of speaker detection
    • Brümmer, N. and du Preez, J. (2006) Application independent evaluation of speaker detection. Computer Speech and Language 20(2-3): 230-275. http://dx.doi.org/10.1016/j.csl.2005.08.001
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 , pp. 230-275
    • Brümmer, N.1    du Preez, J.2
  • 5
    • 0034225372 scopus 로고    scopus 로고
    • Static and dynamic vowels in a 'cepstro-phonetic' subspace
    • Clermont, F. and Itahashi, S. (2000) Static and dynamic vowels in a 'cepstro-phonetic' subspace. Journal of the Acoustic Society of Japan 21(4): 221-223. http://dx.doi.org/10.1250/ast.21.221
    • (2000) Journal of the Acoustic Society of Japan , vol.21 , Issue.4 , pp. 221-223
    • Clermont, F.1    Itahashi, S.2
  • 7
    • 0010928235 scopus 로고
    • An evaluation of selected acoustic parameters for use in speaker identification
    • Doherty, E.T. (1976) An evaluation of selected acoustic parameters for use in speaker identification. Journal of Phonetics 4: 321-326.
    • (1976) Journal of Phonetics , vol.4 , pp. 321-326
    • Doherty, E.T.1
  • 8
    • 0027495731 scopus 로고
    • An illustration of the advantages of efficient statistical methods for RFLP analysis in forensic science
    • Evett, I.W., Scrange, J. and Pinchin, R. (1993) An illustration of the advantages of efficient statistical methods for RFLP analysis in forensic science. American Journal of Human Genetics 52: 498-505.
    • (1993) American Journal of Human Genetics , vol.52 , pp. 498-505
    • Evett, I.W.1    Scrange, J.2    Pinchin, R.3
  • 9
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Furui, S. (1981) Cepstral analysis technique for automatic speaker verification. IEEE Transactions on Acoustics Speech and Signal Processing 29(2): 254-272. http://dx.doi.org/10.1109/TASSP.1981.1163530
    • (1981) IEEE Transactions on Acoustics Speech and Signal Processing , vol.29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 12
    • 0043115132 scopus 로고
    • Phoneme-level voice individuality used in speaker recognition
    • Furui, S. and Matsui, T. (1994) Phoneme-level voice individuality used in speaker recognition. Proceedings of ICSLP: 1463-1466.
    • (1994) Proceedings of ICSLP , pp. 1463-1466
    • Furui, S.1    Matsui, T.2
  • 13
    • 0032681727 scopus 로고    scopus 로고
    • Channel-robust speaker identification using modifiedmean cepstral mean normalisation with frequency warping
    • Garcia, A.A. and Mammone, R.J. (1999) Channel-robust speaker identification using modifiedmean cepstral mean normalisation with frequency warping. Proceedings of ICASSP 1: 325-328.
    • (1999) Proceedings of ICASSP , vol.1 , pp. 325-328
    • Garcia, A.A.1    Mammone, R.J.2
  • 14
    • 84865736815 scopus 로고    scopus 로고
    • Speaker recognition using temporal contours in linguistic units: The case of formant and formant-bandwidth trajectories
    • Gonzalez-Rodriguez, J. (2011) Speaker recognition using temporal contours in linguistic units: the case of formant and formant-bandwidth trajectories. Proceedings of Interspeech 2011: 133-135.
    • (2011) Proceedings of Interspeech , vol.2011 , pp. 133-135
    • Gonzalez-Rodriguez, J.1
  • 15
    • 29044435164 scopus 로고    scopus 로고
    • Robust estimation, interpretation and assessment of likelihood ratios in forensic speaker recognition
    • Gonzalez-Rodriguez J., Drygajlo A., Ramos-Castro D., Garcia-Gomar M. and Ortega-Garcia, J. (2006) Robust estimation, interpretation and assessment of likelihood ratios in forensic speaker recognition. Computer Speech and Language 20(2-3): 331-355. http://dx.doi.org/10.1016/j.csl.2005.08.005
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 , pp. 331-355
    • Gonzalez-Rodriguez, J.1    Drygajlo, A.2    Ramos-Castro, D.3    Garcia-Gomar, M.4    Ortega-Garcia, J.5
  • 16
    • 64249127238 scopus 로고    scopus 로고
    • Emulating DNA: Rigorous quantification of evidential weight in transparent and testable forensic speaker recognition
    • Gonzalez-Rodriguez, J., Rose, P., Ramos, D., Torre, D. and Ortega-García, J. (2007) Emulating DNA: rigorous quantification of evidential weight in transparent and testable forensic speaker recognition. IEEE Transactions Audio Speech and Language Processing 15(7): 2104-2115. http://dx.doi.org/10.1109/TASL.2007.902747
    • (2007) IEEE Transactions Audio Speech and Language Processing , vol.15 , Issue.7 , pp. 2104-2115
    • Gonzalez-Rodriguez, J.1    Rose, P.2    Ramos, D.3    Torre, D.4    Ortega-García, J.5
  • 18
    • 84861578630 scopus 로고    scopus 로고
    • Score-based likelihood ratios for handwriting evidence
    • Hepler, A.B., Saunders, C.P., Davis, L.J. and Buscaglia, J. (2012) Score-based likelihood ratios for handwriting evidence. Forensic Science International 219(1-3): 129-140. http://dx.doi.org/10.1016/j.forsciint.2011.12.009
    • (2012) Forensic Science International , vol.219 , Issue.1-3 , pp. 129-140
    • Hepler, A.B.1    Saunders, C.P.2    Davis, L.J.3    Buscaglia, J.4
  • 19
    • 0017732744 scopus 로고
    • Speaker identification by long-term spectra under normal and distorted speech conditions
    • Hollien, H. and Majewski, W. (1977) Speaker identification by long-term spectra under normal and distorted speech conditions. Journal of the Acoustical Society of America 62: 975-979. http://dx.doi.org/10.1121/1.381592
    • (1977) Journal of the Acoustical Society of America , vol.62 , pp. 975-979
    • Hollien, H.1    Majewski, W.2
  • 20
    • 85032644657 scopus 로고    scopus 로고
    • Using formant frequencies in speech recognition
    • Holmes, J.N., Holmes, W.J. and Garner, P.N. (1997) Using formant frequencies in speech recognition. Eurospeech 97: 2083-2086.
    • (1997) Eurospeech , vol.97 , pp. 2083-2086
    • Holmes, J.N.1    Holmes, W.J.2    Garner, P.N.3
  • 21
    • 84880058542 scopus 로고    scopus 로고
    • A forensic text comparison in SMS messages: A likelihood ratio approach with lexical features
    • In N. Clarke, T. Tryfonas and R. Dodge (eds)
    • Ishihara, S. (2012) A forensic text comparison in SMS messages: a likelihood ratio approach with lexical features. In N. Clarke, T. Tryfonas and R. Dodge (eds) Proceedings of the 7th International Workshop on Digital Forensics and Incident Analysis: 55-65.
    • (2012) Proceedings of the 7th International Workshop on Digital Forensics and Incident Analysis , pp. 55-65
    • Ishihara, S.1
  • 24
    • 78649528111 scopus 로고    scopus 로고
    • Forensic voice comparison
    • In I. Freckelton and H. Selby (eds) Ch 99. Sydney: Thomson Reuters
    • Morrison, G.S. (2010) Forensic voice comparison. In I. Freckelton and H. Selby (eds) Expert Evidence. Ch 99. Sydney: Thomson Reuters.
    • (2010) Expert Evidence
    • Morrison, G.S.1
  • 25
    • 78649503176 scopus 로고    scopus 로고
    • A comparison of procedures for the calculation of forensic likelihood rations from acoustic-phonetic data: Multivariate kernel density (MVKD) versus Gaussian mixture model-universal background model (GMM-UBM)
    • Morrison, G.S. (2011a) A comparison of procedures for the calculation of forensic likelihood rations from acoustic-phonetic data: multivariate kernel density (MVKD) versus Gaussian mixture model-universal background model (GMM-UBM). Speech Communication 53: 24-256. http://dx.doi.org/10.1016/j.specom.2010.09.005
    • (2011) Speech Communication , vol.53 , pp. 24-256
    • Morrison, G.S.1
  • 26
    • 80052260613 scopus 로고    scopus 로고
    • Measuring the validity and reliability of forensic likelihood-ratio systems
    • Morrison, G.S. (2011b) Measuring the validity and reliability of forensic likelihood-ratio systems. Science and Justice 51: 91-98. http://dx.doi.org/10.1016/j.scijus.2011.03.002
    • (2011) Science and Justice , vol.51 , pp. 91-98
    • Morrison, G.S.1
  • 27
    • 84880049286 scopus 로고    scopus 로고
    • Tutorial on logistic regression calibration and fusion: Converting a score to a likelihood ratio
    • Morrison, G.S. (2012) Tutorial on logistic regression calibration and fusion: converting a score to a likelihood ratio. Australian Journal of Forensic Sciences: 1-25.
    • (2012) Australian Journal of Forensic Sciences , pp. 1-25
    • Morrison, G.S.1
  • 29
    • 84858425984 scopus 로고    scopus 로고
    • Quantifying the weight of evidence from a forensic fingerprint comparison: A new paradigm
    • Neuman, C., Evett, I.W. and Skerrett, L. (2012) Quantifying the weight of evidence from a forensic fingerprint comparison: a new paradigm. Journal of the Royal Statistical Society 175(2): 371-415. http://dx.doi.org/10.1111/j.1467-985X.2011.01027.x
    • (2012) Journal of the Royal Statistical Society , vol.175 , Issue.2 , pp. 371-415
    • Neuman, C.1    Evett, I.W.2    Skerrett, L.3
  • 30
    • 56249138466 scopus 로고
    • Text-dependent speaker verification using isolated word utterances based on dynamic programming
    • [In Japanese]
    • Osanai, T., Tanimoto, M., Kido, H. and Suzuki, T. (1995) Text-dependent speaker verification using isolated word utterances based on dynamic programming. [In Japanese]. National Research Institute for Police Science Report 48(1): 15-19.
    • (1995) National Research Institute for Police Science Report , vol.48 , Issue.1 , pp. 15-19
    • Osanai, T.1    Tanimoto, M.2    Kido, H.3    Suzuki, T.4
  • 31
    • 0033902487 scopus 로고    scopus 로고
    • Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
    • Pigeon, S., Druyts, P. and Verlinde, P. (2000) Applying logistic regression to the fusion of the NIST'99 1-speaker submissions. Digital Signal Processing 10(1-3): 237-248. http://dx.doi.org/10.1006/dspr.1999.0358
    • (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 237-248
    • Pigeon, S.1    Druyts, P.2    Verlinde, P.3
  • 34
    • 3843145869 scopus 로고    scopus 로고
    • The technical comparison of forensic voice samples
    • In I. Freckelton and H. Selby (eds) Sydney: Thomson Reuters
    • Rose, P. (2003) The technical comparison of forensic voice samples. In I. Freckelton and H. Selby (eds) Expert Evidence 1051-6102. Sydney: Thomson Reuters.
    • (2003) Expert Evidence , pp. 1051-6102
    • Rose, P.1
  • 35
    • 29044446618 scopus 로고    scopus 로고
    • Technical forensic speaker recognition: Evaluation, types and testing of evidence
    • Rose, P. (2006) Technical forensic speaker recognition: evaluation, types and testing of evidence. Computer Speech and Language 20(2-3): 159-191. http://dx.doi.org/10.1016/j.csl.2005.07.003
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 , pp. 159-191
    • Rose, P.1
  • 36
    • 77958606572 scopus 로고    scopus 로고
    • The effect of correlation on strength of evidence estimates in forensic voice comparison: Uni-and multivariate likelihood ratio-based discrimination with Australian English vowel acoustics
    • Rose, P. (2010a) The effect of correlation on strength of evidence estimates in forensic voice comparison: uni-and multivariate likelihood ratio-based discrimination with Australian English vowel acoustics. International Journal of Biometrics 2(14): 316-329. http://dx.doi.org/10.1504/IJBM.2010.035447
    • (2010) International Journal of Biometrics , vol.2 , Issue.14 , pp. 316-329
    • Rose, P.1
  • 37
    • 84880059321 scopus 로고    scopus 로고
    • Bernard's 18-vowel inventory size and strength of forensic voice comparison evidence
    • In M. Tabain, J. Fletcher, B. Grayden, J. Hajek and A. Butcher (eds)
    • Rose, P. (2010b) Bernard's 18-vowel inventory size and strength of forensic voice comparison evidence. In M. Tabain, J. Fletcher, B. Grayden, J. Hajek and A. Butcher (eds) Proceedings of the 13th Australasian International Conference on Speech Science and Technology: 30-33.
    • (2010) Proceedings of the 13th Australasian International Conference on Speech Science and Technology , pp. 30-33
    • Rose, P.1
  • 38
    • 80051620852 scopus 로고    scopus 로고
    • Forensic voice comparison with secular shibboleths-a hybrid fused GMMmultivariate likelihood ratio-based approach using alveolo-palatal fricative cepstral spectra
    • Rose, P. (2011a) Forensic voice comparison with secular shibboleths-a hybrid fused GMMmultivariate likelihood ratio-based approach using alveolo-palatal fricative cepstral spectra. Proceedings of the International Conference on Acoustics Speech and Signal Processing: 5900-5903.
    • (2011) Proceedings of the International Conference on Acoustics Speech and Signal Processing , pp. 5900-5903
    • Rose, P.1
  • 39
    • 84878014934 scopus 로고    scopus 로고
    • Forensic voice comparison with Japanese vowels-a likelihood ratio-based approach using segmental cepstra
    • In W. Lee and E. Zee (eds)
    • Rose, P. (2011b) Forensic voice comparison with Japanese vowels-a likelihood ratio-based approach using segmental cepstra. In W. Lee and E. Zee (eds) Proceedings of the 17th International Congress of Phonetic Sciences: 1718-1721.
    • (2011) Proceedings of the 17th International Congress of Phonetic Sciences , pp. 1718-1721
    • Rose, P.1
  • 40
    • 0035322647 scopus 로고    scopus 로고
    • A comparison of two acoustic methods for forensic speaker discrimination
    • Rose, P. and Clermont, F. (2001) A comparison of two acoustic methods for forensic speaker discrimination. Acoustics Australia 29(1): 31-35.
    • (2001) Acoustics Australia , vol.29 , Issue.1 , pp. 31-35
    • Rose, P.1    Clermont, F.2
  • 41
    • 3843106634 scopus 로고    scopus 로고
    • Strength of forensic speaker identification evidence-multispeaker formant and cepstrum based segmental discrimination with a Bayesian likelihood ratio as threshold
    • Rose, P., Osanai, T. and Kinoshita, Y. (2003) Strength of forensic speaker identification evidence-multispeaker formant and cepstrum based segmental discrimination with a Bayesian likelihood ratio as threshold. International Journal of Speech Language and the Law. 10(2): 179-202.
    • (2003) International Journal of Speech Language and the Law. , vol.10 , Issue.2 , pp. 179-202
    • Rose, P.1    Osanai, T.2    Kinoshita, Y.3
  • 43
    • 0004283130 scopus 로고
    • Cambridge: Cambridge University Press
    • Wells, J.C. (1982) Accents of English. Cambridge: Cambridge University Press.
    • (1982) Accents of English
    • Wells, J.C.1
  • 44
    • 84880049808 scopus 로고    scopus 로고
    • Are nasals better? Likelihood ratio-based forensic voice comparison with segmental cepstra from Cantonese and Japanese syllabic/mora nasals
    • In F. Cox, K. Demuth, S. Lin, K. Miles, S. Palethorpe, J. Shaw and I. Yuen (eds)
    • Yim, A.C.S. and Rose, P. (2012) Are nasals better? Likelihood ratio-based forensic voice comparison with segmental cepstra from Cantonese and Japanese syllabic/mora nasals. In F. Cox, K. Demuth, S. Lin, K. Miles, S. Palethorpe, J. Shaw and I. Yuen (eds) Proceedings of the 14th Australasian International Conference on Speech Science and Technology: 5-8.
    • (2012) Proceedings of the 14th Australasian International Conference on Speech Science and Technology , pp. 5-8
    • Yim, A.C.S.1    Rose, P.2
  • 45
    • 0027368837 scopus 로고
    • Spectral-shape features versus formants as acoustic correlates for vowels
    • Zahorian, S.A. and Jagharghi, A.J. (1993) Spectral-shape features versus formants as acoustic correlates for vowels. Journal of the Acoustical Society of America 94(4): 1966-1982. http://dx.doi.org/10.1121/1.407520
    • (1993) Journal of the Acoustical Society of America , vol.94 , Issue.4 , pp. 1966-1982
    • Zahorian, S.A.1    Jagharghi, A.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.