Project Details
Augmented Multi-party Interaction
Project Period: 1. 1. 2004 – 31. 12. 2006
Project Type: grant
Code: 506811-AMI
multi-modal interaction, speech recognition, video processing, multi-modal recognition, meeting data collection, meeting data annotation
Jointly managed by Prof. Herve Bourlard (IDIAP, http://www.idiap.ch) and Prof. Steve Renals (University of Edinburgh, http://www.iccs.informatics.ed.ac.uk), AMI targets computer enhanced multi-modal interaction in the context of meetings. The project aims at substantially advancing the state-of-the-art, within important underpinning technologies (such as human-human communication modeling, speech recognition, computer vision, multimedia indexing and retrieval). It will also produce tools for off-line and on-line browsing of multi-modal meeting data, including meeting structure analysis and summarizing functions. The project also makes recorded and annotated multimodal meeting data widely available for the European research community, thereby contributing to the research infrastructure in the field.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Kadlec Jaroslav, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (DCGM)
Matějka Pavel, Ing., Ph.D. (DCGM)
Motlíček Petr, doc. Ing., Ph.D. (DCGM)
Pečiva Jan, Ing., Ph.D. (DCGM)
Potúček Igor, Ing., Ph.D.
Schwarz Petr, Ing., Ph.D. (DCGM)
Sumec Stanislav, Ing., Ph.D.
Španěl Michal, doc. Ing., Ph.D. (DCGM)
Zemčík Pavel, prof. Dr. Ing., dr. h. c. (DCGM)
2008
- GRÉZL, F.; FOUSEK, P. Optimizing bottle-neck features for LVCSR. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008.
p. 4729-4732. ISBN: 1-4244-1484-9. Detail
2007
- KADLEC, J. Code Characterization for Automatic User Interface Generation. Innovations and Advanced Techniques in Computer and Information Sciences and Engineering. Dordrecht: Springer London, 2007.
p. 255-260. ISBN: 978-1-4020-6267-4. Detail
2006
- FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006.
p. 410-416. ISBN: 3-540-32205-1. Detail - GATICA-PEREZ, D.; RIGOLL, G.; SCHREIBER, S.; SMITH, K.; POTÚČEK, I.; BERAN, V. 2D Multi-Person Tracking: A Comparative Study in AMI Meetings. Lecture Notes in Computer Science. Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol. 4122. ...: Springer Science+Business Media B.V., 2006.
p. 1-12. ISBN: 978-3-540-69567-7. Detail - HRADIŠ, M.; JURÁNEK, R. Sledování učastníků ve videozáznamech z jednání. Proceedings of the 12th Conference STUDENT EEICT 2006 Volume 2. Brno: Vysoké učení technické v Brně, 2006.
s. 203-205. ISBN: 80-214-3161-X. Detail - KARAFIÁT, M.; GRÉZL, F.; SCHWARZ, P.; BURGET, L.; ČERNOCKÝ, J. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Lecture Notes in Computer Science. Berlin: Springer Verlag, 2006.
p. 275-284. ISBN: 3-540-69267-3. Detail - MATĚJKA, P.; BURGET, L.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan: 2006.
p. 57-64. ISBN: 1-4244-0472-X. Detail - PEČIVA, J. Active Transaction Approach for Collaborative Virtual Environments. ACM International Conference on Virtual Reality Continuum and its Applications (VRCIA). Chinese University of Hong Kong: Association for Computing Machinery, 2006.
p. 171-178. ISBN: 1-59593-324-7. Detail - STOLCKE, A.; GRÉZL, F.; HWANG, M.; LEI, X.; MORGAN, N.; VERGYRI, D. Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons. 2006 IEEE International Conference on Acoustic, Speech, and Signal Processing. Toulouse: IEEE Signal Processing Society, 2006.
p. 321-324. ISBN: 978-3-540-74627-0. Detail - SZŐKE, I. Keyword Spotting in Meeting Data. Proceedings of the 12th Conference Student EEICT 2006 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2006.
p. 440-444. ISBN: 80-214-3163-6. Detail
2005
- ASHBY, S.; BOURBAN, S.; CARLETTA, J.; FLYNN, M.; GUILLEMOT, M.; HAIN, T.; KARAISKOS, V.; KRAAIJ, W.; KRONENTHAL, M.; LATHOUD, G.; LINCOLN, M.; LISOWSKA, A.; MCCOWAN, I.; POST, W.; REIDSMA, D.; WELLNER, P.; KADLEC, J. The AMI Meeting Corpus. Measuring Behavior 2005 Proceedings Book. Wageningen: 2005.
p. 1-4. Detail - FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; ČERNOCKÝ, J.; SMRŽ, P.; BURGET, L.; KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005.
p. 0-0. Detail - FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. Detail - GRÉZL, F. Spectral plane investigation for probabilistic features for ASR. Edinburgh: 2005.
p. 82-86. Detail - KADLEC, J.; POTÚČEK, I.; SUMEC, S.; ZEMČÍK, P. Evaluation of Tracking and Recognition Methods. Proceedings of the 11th conference EEICT. Brno: 2005.
p. 617-622. ISBN: 80-214-2890-2. Detail - MATĚJKA, P. Phoneme Recognition Tuning for Language Identification System. Proceedings of the 11th conference STUDENT EEICT 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 658-653. ISBN: 80-214-2890-2. Detail - MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J.; CHYTIL, P. Phonotactic Language Identification using High Quality Phoneme Recognition. Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. European Conference EUROSPEECH. Lisbon: International Speech Communication Association, 2005.
p. 2237-2240. ISSN: 1018-4074. Detail - MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J.; CHYTIL, P. Phonotactic Language Identification. Proceedings of Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 140-143. ISBN: 80-214-2904-6. Detail - MOTLÍČEK, P.; BURGET, L.; ČERNOCKÝ, J. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 187-190. ISBN: 80-214-2904-6. Detail - NIJHOLT, A.; ZWIERS, J.; PEČIVA, J. The Distributed Virtual Meeting Room Exercise. Proceedings ICMI 2005 Workshop on Multimodal multiparty meeting processing. Trento: 2005.
p. 93-99. Detail - PEČIVA, J. Omnipresent Collaborative Virtual Environments for Open Inventor Applications. INTETAIN 2005. Springer Lecture Notes in Artificial Intelligence. Madonna di Campiglio: Springer Verlag, 2005.
p. 272-276. ISBN: 3-540-30509-2. Detail - SMRŽ, P. Parallel Metagrammar for Closely Related Languages - A Case Study of Czech and Russian. Research on Language & Computation, 2005, vol. 3, no. 2,
p. 101-128. ISSN: 1570-7075. Detail - SMRŽ, P.; FAPŠO, M. Vyhledávání v záznamech přednášek. Sborník semináře Technologie pro e-vzdělávání. Praha: České vysoké učení technické, 2005.
s. 21-26. ISBN: 80-01-03274-4. Detail - SUMEC, S.; POTÚČEK, I.; ZEMČÍK, P. AUTOMATIC MOBILE MEETING ROOM. Proceedings of 3IA'2005 International Conference in Computer Graphics and Artificial Intelligence. Limoges: 2005.
p. 171-177. ISBN: 2-914256-07-8. Detail - SZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. Detail - SZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302-309. ISSN: 0302-9743. Detail
2004
- BERAN, V. Augmented Multi-User Communication System. Proceedings of the working conference on Advanced visual interfaces. Gallipoli: Association for Computing Machinery, 2004.
p. 257-260. ISBN: 1-58113-867-9. Detail - BERAN, V.; POTÚČEK, I. REAL-TIME RECONSTRUCTION OF INCOMPLETE HUMAN MODEL USING COMPUTER VISION. Proceeding of the 10th Conference and Competition STUDENT EEICT 2004, Volume 2. Brno: Faculty of Electrical Engineering and Communication BUT, 2004.
p. 298-302. ISBN: 80-214-2635-7. Detail - BURGET, L. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co, 2004.
p. 2549-2552. Detail - FOUSEK, P.; SVOJANOVSKÝ, P.; GRÉZL, F.; HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 348-351. ISSN: 1225-4111. Detail - FUČÍK, O.; ZEMČÍK, P.; TUPEC, P.; BRYAN, L.; HEROUT, A. The Networked Photo-Enforcement and Traffic Monitoring System. Proceedings of Engineering of Computer-Based Systems. Los Alamitos: IEEE Computer Society, 2004.
p. 423-428. ISBN: 0-7695-2125-8. Detail - HEROUT, A.; ZEMČÍK, P. Animated Particle Rendering in DSP and FPGA. SCCG 2004 Proceedings. Bratislava: Slovak University of Technology in Bratislava, 2004.
p. 237-242. ISBN: 80-223-1918-X. Detail - HEROUT, A.; ZEMČÍK, P.; BERAN, V.; KADLEC, J. Image and Video Processing Software Framework for Fast Application Development. Joint AMI/PASCAL/IM2/M4 workshop. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 0-0. Detail - KARAFIÁT, M.; GRÉZL, F.; BURGET, L. Combination of MFCC and TRAP features for LVCSR of meeting data. Martigny: 2004.
p. 0-0. Detail - KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 437-440. ISSN: 1225-4111. Detail - MOTLÍČEK, P. Segmentace nahrávek živých jednání podle mluvčího. Sborník příspěvků a prezentací akce Odborné semináře 2004. REL03V. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2004.
s. 0-0. Detail - MOTLÍČEK, P. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004.
p. 0-0. Detail - MOTLÍČEK, P.; BURGET, L.; ČERNOCKÝ, J. PHONEME RECOGNITION OF MEETINGS USING AUDIO-VISUAL DATA. AMI Workshop. Martigny: 2004.
p. 0-0. Detail - MOTLÍČEK, P.; ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Lecture Notes in Computer Science. Brno: Springer Verlag, 2004.
p. 379-384. ISBN: 3-540-23049-1. ISSN: 0302-9743. Detail - MOTLÍČEK, P.; ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 379-384. ISSN: 0302-9743. Detail - PEČIVA, J. Collaborative Virtual Environments. Poster at MLMI'04 workshop. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail - SCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Recognition from a Long Temporal Context. poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail