Result Details
BUT system for low resource Indian language ASR
        PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. no. 9, p. 3182-3186.  ISSN: 1990-9772.
    
                Type
            
        
                conference paper
            
        
                Language
            
        
                English
            
        
            Authors
            
        
                Pulugundla Bhargav, M.Sc., DCGM (FIT)
                
Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Kesiraju Santosh, Ph.D., DCGM (FIT)
Egorova Ekaterina, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
        Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Kesiraju Santosh, Ph.D., DCGM (FIT)
Egorova Ekaterina, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
                    Abstract
            
        This paper describes the BUT Jilebi teams speech recognitionsystems created for the 2018 low resource speech recognitionchallenge for Indian languages. We investigate modifications ofmultilingual time-delay neural network (TDNN) architectureswith transfer learning and compare them to bi-directionalresidual memory networks (BRMN) and bi-directional LSTM.Our best submission based on system combination achievedword error rates of 13.92% (Tamil), 14.71% (Telugu) and14.06% (Gujarati). We present the details of submitted systemsand also the post-evaluation analysis done for lexicon discoveryusing unsupervised word segmentation.
                Keywords
            
        Indian languages, low resource ASR, multilingual, LF-MMI
                URL
            
        
                Published
            
            
                    2018
                    
                
            
                    Pages
                
            
                        3182–3186
                
            
                    Journal
                
            
                    Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
                
            
                        Proceedings
                
            
                    Proceedings of Interspeech 2018
                
            
                    Conference
                
            
                    Interspeech Conference
                
            
                    Publisher
                
            
                    International Speech Communication Association
                
            
                    Place
                
            
                    Hyderabad
                
            
                    DOI
                
            
                    UT WoS
                
            
                    000465363900663
                
            
                EID Scopus
                
            
                    BibTeX
                
            @inproceedings{BUT155101,
  author="Bhargav {Pulugundla} and Murali Karthick {Baskar} and Santosh {Kesiraju} and Ekaterina {Egorova} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
  title="BUT system for low resource Indian language ASR",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="3182--3186",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1302",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html"
}
                
                Files
            
        
                Projects
            
        
        
    
    
        DARPA Low Resource Languages for Emergent Incidents (LORELEI) - Exploiting Language Information for Situational Awareness (ELISA), University of Southern California, start: 2015-09-01, end: 2020-03-31, completed
                
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
        IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
                Research groups
            
        
                Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
            
        
                Departments