Result Details
Automatic Language Identification using Phoneme and Automatically Derived Unit Strings
        MATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206, p. 147-154.  ISSN: 0302-9743.
    
                Type
            
        
                journal article
            
        
                Language
            
        
                English
            
        
            Authors
            
        
                Matějka Pavel, Ing., Ph.D., DCGM (FIT), UREL (FEEC)
                
Szőke Igor, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Schwarz Petr, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
        Szőke Igor, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Schwarz Petr, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
                    Abstract
            
        Phonemes and Automatically Derived Units in Automatic Language Identification
                Keywords
            
        language identificaton, phoneme recognizer, speech processing, ergodic hidden Markov model
                URL
            
        
                    Annotation
                
            Language identification (LID) based on phono-tactic modeling is presented in this paper. Approaches using phoneme strings and strings of units automatically derived by  an Ergodic HMM (EHMM)  are compared. The  phoneme recognizers were trained on 6  languages from OGI multi-language-corpus and Czech SpeechDat-E. The LID results are obtained on 4 languages. The results show superiority of Czech phoneme recognizer while used in LID and promising trends using
the EHMM-derived units.
                Published
            
            
                    2004
                    
                
            
                    Pages
                
            
                        147–154
                
            
                    Journal
                
            
                    Lecture Notes in Computer Science, vol. 2004, no. 3206, ISSN 0302-9743
                
            
                        Book
                
            
                    Lecture Notes in Computer Science
                
            
                    BibTeX
                
            @article{BUT45738,
  author="Pavel {Matějka} and Igor {Szőke} and Petr {Schwarz} and Jan {Černocký}",
  title="Automatic  Language Identification using Phoneme and Automatically Derived Unit Strings",
  journal="Lecture Notes in Computer Science",
  year="2004",
  volume="2004",
  number="3206",
  pages="147--154",
  issn="0302-9743",
  url="http://www.springerlink.com/index/CUFLYEGQA8W1LNBE"
}
                
                Projects
            
        
        
            
        
    
    
        Data driven and anthropic coding and recognition of speech, GACR, Postdoktorandské granty, GP102/02/D108, start: 2002-09-01, end: 2005-08-30, completed
                
Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed
        Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed
                Research groups
            
        
                Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
            
        
                Departments