SCOPUS 정보 검색 플랫폼

Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010

Volumn , Issue , 2010, Pages 1539-1544

Modified LTSE-VAD algorithm for applications requiring reduced silence frame misclassification

(7) Luengo, Iker a Navas, Eva a Odriozola, Igor a Saratxaga, Ibon a Hernaez, Inmaculada a Sainz, Iñaki a Erro, Daniel a

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH RECOGNITION;

BACKGROUND NOISE LEVELS; BEST-KNOWN ALGORITHMS; EMOTION CLASSIFICATION; EMOTION IDENTIFICATIONS; EMOTION RECOGNITION; MISCLASSIFICATION RATES; MODIFIED ALGORITHMS; VOICE ACTIVITY DETECTION;

SIGNAL TO NOISE RATIO;

EID: 79959855030 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (17)

1
- 4243315328
- 3GPP2
- 3GPP2. 2004. Enhanced Variable Rate Codec, Speech Service Option 3 for Wideband Spread Spectrum Digital Systems.
- (2004) Enhanced Variable Rate Codec, Speech Service Option 3 for Wideband Spread Spectrum Digital Systems

2
- 0030093965
- Acoustic profiles in vocal emotion expression
- Rainer Banse and Klaus R. Scherer. 1996. Acoustic profiles in vocal emotion expression. Journal of Personality and Social Pathology, 70(3):614-636.
- (1996) Journal of Personality and Social Pathology , vol.70 , Issue.3 , pp. 614-636
- Banse, R.¹ Scherer, K.R.²

3
- 34547505647
- Combining efforts for improving automatic classification of emotional user states
- Ljubljana (Slovenia)
- Anton Batliner, Stefan Steidl, Bjrn Schuller, Dino Seppi, Kornel Laskowski, Thurid Vogt, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loic Kessous, and Vered Aharonson. 2006. Combining efforts for improving automatic classification of emotional user states. In Information Society - Language TechnologiesConference (IS-LTC), pages 240-245, Ljubljana (Slovenia).
- (2006) Information Society - Language TechnologiesConference (IS-LTC) , pp. 240-245
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Laskowski, K.⁵ Vogt, T.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Kessous, L.¹⁰ Aharonson, V.¹¹

4
- 0002689942
- Verification of acoustical correlates of emotional speech using formant-synthesis
- Belfast
- Felix Burkhardt and Walter F. Sendlmeier. 2000. Verification of acoustical correlates of emotional speech using formant-synthesis. In ISCA Tutorial and Research Workshop on Speech and Emotion, pages 151-156, Belfast.
- (2000) ISCA Tutorial and Research Workshop on Speech and Emotion , pp. 151-156
- Burkhardt, F.¹ Sendlmeier, W.F.²

5
- 34547958553
- Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
- Salvatore Casale, Alessandra Russo, and Salvatore Serano. 2007. Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Communication, 49(10):801-810.
- (2007) Speech Communication , vol.49 , Issue.10 , pp. 801-810
- Casale, S.¹ Russo, A.² Serano, S.³

6
- 85037524190
- ETSI
- ETSI. 1997. ES 301 249: Digital cellular telecommunications system (Phase 2); Voice Activity Detector (VAD) for Enhanced Full Rate (EFR) speech traffic channels (GSM 06.82 version 4.0.1).
- (1997) ES 301 249: Digital Cellular Telecommunications System (Phase 2); Voice Activity Detector (VAD) for Enhanced Full Rate (EFR) Speech Traffic Channels (GSM 06.82 Version 4.0.1)

7
- 0442317754
- ETSI
- ETSI. 2003. ES 202 050: Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms.
- (2003) ES 202 050: Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms

8
- 84910032186
- SPEECON - Speech databases for consumer devices: Database specification and validation
- Las Palmas, Spain
- Dorota Iskra, Beate Grosskopf, Krzysztof Marasek, Henk van del Heuvel, Frank Diehl, and Andreas Kiessling. 2002. SPEECON - speech databases for consumer devices: database specification and validation. In Language Resources and Evaluation Conference (LREC), pages 329-333, Las Palmas, Spain.
- (2002) Language Resources and Evaluation Conference (LREC) , pp. 329-333
- Iskra, D.¹ Grosskopf, B.² Marasek, K.³ Van Del Heuvel, H.⁴ Diehl, F.⁵ Kiessling, A.⁶

9
- 85037535893
- ITU-T
- ITU-T. 2007. Recommendation G.729 Annex B: A silence compression scheme for G.729 optimized for terminals conforming to ITU-T Recommendation V.70.
- (2007) Recommendation G.729 Annex B: A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to ITU-T Recommendation V.70

10
- 48149087416
- Real-time emotion detection system using speech: Multi-modal fusion of different timescale features
- Crete
- Samuel Kim, Panayiotis G. Georgiou, Sungbok Lee, and Shrikanth Narayanan. 2007. Real-time emotion detection system using speech: Multi-modal fusion of different timescale features. In IEEE Workshop on Multimedia Signal Processing, pages 48-51, Crete.
- (2007) IEEE Workshop on Multimedia Signal Processing , pp. 48-51
- Kim, S.¹ Georgiou, P.G.² Lee, S.³ Narayanan, S.⁴

11
- 0141478766
- Pitch maxima for robust speaker recognition
- Hong Kong
- S. Krishnakumar, K.R. Prasanna Kumar, and N. Balakrishnan. 2003. Pitch maxima for robust speaker recognition. In ICASSP, Volume 2, pages 201-204, Hong Kong.
- (2003) ICASSP , vol.2 , pp. 201-204
- Krishnakumar, S.¹ Prasanna Kumar, K.R.² Balakrishnan, N.³

12
- 85046873967
- The DET curve in assessment of detection task performance
- Rhodes, Greece
- Alvin F. Martin, George R. Doddington, Terri Kamm, Mark Ordowski, and Mark A. Przybocki. 1997. The DET curve in assessment of detection task performance. In Eurospeech, pages 1895-1898, Rhodes, Greece.
- (1997) Eurospeech , pp. 1895-1898
- Martin, A.F.¹ Doddington, G.R.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.A.⁵

13
- 0242721417
- Speech emotion recognition using hidden Markov models
- Tin Lay Nwe, Say Wei Foo, and Liyanage C. de Silva. 2003. Speech emotion recognition using hidden Markov models. Speech Communication, 41(4):603-623.
- (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² De Silva, L.C.³

14
- 33646093001
- Feature representation and discrimination based on Gaussian mixture model probability densities - Practices and algorithms
- Pekka Paalanen, Joni-Kristian Kamarainen, Jarmo Ilonen, and Heikki Klviinen. 2006. Feature representation and discrimination based on gaussian mixture model probability densities - practices and algorithms. Pattern Recognition, 39(7):1346-1358.
- (2006) Pattern Recognition , vol.39 , Issue.7 , pp. 1346-1358
- Paalanen, P.¹ Kamarainen, J.-K.² Ilonen, J.³ Klviinen, H.⁴

15
- 23144440245
- Global trend of fundamental frequency in emotional speech
- Nara, Japan
- A. Paeschke. 2004. Global trend of fundamental frequency in emotional speech. In Speech Prosody, pages 671-674, Nara, Japan.
- (2004) Speech Prosody , pp. 671-674
- Paeschke, A.¹

16
- 1842476689
- Efficient voice activity detection algorithms using long term speech information
- Javier Ramirez, Jose C. Segura, Carmen Benitez, Angel de la Torre, and Antonio Rubio. 2004. Efficient voice activity detection algorithms using long term speech information. Speech Communication, 42:271-287.
- (2004) Speech Communication , vol.42 , pp. 271-287
- Ramirez, J.¹ Segura, J.C.² Benitez, C.³ De La Torre, A.⁴ Rubio, A.⁵

17
- 38049048651
- Frame vs. Turn-level: Emotion recognition from speech considering static and dynamic processing
- Bogdan Vlasenko, Bjrn Schuller, Andreas Wendemuth, and Gerhard Rigoll. 2007. Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing. Lecture Notes on Computer Science, 4738:139-147.
- (2007) Lecture Notes on Computer Science , vol.4738 , pp. 139-147
- Vlasenko, B.¹ Schuller, B.² Wendemuth, A.³ Rigoll, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.