Project Details
Augmented Multi-party Interaction
Project Period: 1. 1. 2004 – 31. 12. 2006
Project Type: grant
Code: 506811-AMI
multi-modal interaction, speech recognition, video processing, multi-modal recognition, meeting data collection, meeting data annotation
Jointly managed by Prof. Herve Bourlard (IDIAP, http://www.idiap.ch) and Prof. Steve Renals (University of Edinburgh, http://www.iccs.informatics.ed.ac.uk), AMI targets computer enhanced multi-modal interaction in the context of meetings. The project aims at substantially advancing the state-of-the-art, within important underpinning technologies (such as human-human communication modeling, speech recognition, computer vision, multimedia indexing and retrieval). It will also produce tools for off-line and on-line browsing of multi-modal meeting data, including meeting structure analysis and summarizing functions. The project also makes recorded and annotated multimodal meeting data widely available for the European research community, thereby contributing to the research infrastructure in the field.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Kadlec Jaroslav, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (DCGM)
Matějka Pavel, Ing., Ph.D. (DCGM)
Motlíček Petr, doc. Ing., Ph.D. (DCGM)
Pečiva Jan, Ing., Ph.D. (DCGM)
Potúček Igor, Ing., Ph.D.
Schwarz Petr, Ing., Ph.D. (DCGM)
Sumec Stanislav, Ing., Ph.D.
Španěl Michal, Ing., Ph.D. (DCGM)
Zemčík Pavel, prof. Dr. Ing., dr. h. c. (DCGM)
2008
- GRÉZL, F.; FOUSEK, P. Optimizing bottle-neck features for LVCSR. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008.
p. 4729-4732. ISBN: 1-4244-1484-9. Detail
2006
- PEČIVA, J. Active Transaction Approach for Collaborative Virtual Environments. In ACM International Conference on Virtual Reality Continuum and its Applications (VRCIA). Chinese University of Hong Kong: Association for Computing Machinery, 2006.
p. 171-178. ISBN: 1-59593-324-7. Detail
2005
- FAPŠO, M., SCHWARZ, P., SZŐKE, I., ČERNOCKÝ, J., SMRŽ, P., BURGET, L., KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005. Detail
- FAPŠO, M., SMRŽ, P., SCHWARZ, P., SZŐKE, I., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. In Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. Detail - GRÉZL, F. Spectral plane investigation for probabilistic features for ASR. Edinburgh: 2005.
p. 82 ( p.) Detail - KADLEC, J., POTÚČEK, I., SUMEC, S., ZEMČÍK, P. Evaluation of Tracking and Recognition Methods. In Proceedings of the 11th conference EEICT. Brno: 2005.
p. 617-622. ISBN: 80-214-2890-2. Detail - MATĚJKA, P. Phoneme Recognition Tuning for Language Identification System. In Proceedings of the 11th conference STUDENT EEICT 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 658-653. ISBN: 80-214-2890-2. Detail - MATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., CHYTIL, P. Phonotactic Language Identification. In Proceedings of Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 140-143. ISBN: 80-214-2904-6. Detail - MOTLÍČEK, P., BURGET, L., ČERNOCKÝ, J. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 187-190. ISBN: 80-214-2904-6. Detail - NIJHOLT, A., ZWIERS, J., PEČIVA, J. The Distributed Virtual Meeting Room Exercise. In Proceedings ICMI 2005 Workshop on Multimodal multiparty meeting processing. Trento: 2005.
p. 93-99. Detail - PEČIVA, J. Omnipresent Collaborative Virtual Environments for Open Inventor Applications. In INTETAIN 2005. Springer Lecture Notes in Artificial Intelligence. Madonna di Campiglio: Springer Verlag, 2005.
p. 272-276. ISBN: 3-540-30509-2. Detail - SMRŽ, P. Parallel Metagrammar for Closely Related Languages - A Case Study of Czech and Russian. Research on Language & Computation, 2005, vol. 3, no. 2,
p. 101-128. ISSN: 1570-7075. Detail - SMRŽ, P.; FAPŠO, M. Vyhledávání v záznamech přednášek. In Sborník semináře Technologie pro e-vzdělávání. Praha: České vysoké učení technické, 2005.
s. 21-26. ISBN: 80-01-03274-4. Detail - SUMEC, S., POTÚČEK, I., ZEMČÍK, P. AUTOMATIC MOBILE MEETING ROOM. In Proceedings of 3IA'2005 International Conference in Computer Graphics and Artificial Intelligence. Limoges: 2005.
p. 171-177. ISBN: 2-914256-07-8. Detail - SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. Detail - SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302 ( p.) ISSN: 0302-9743. Detail
2004
- BERAN, V. Augmented Multi-User Communication System. Proceedings of the working conference on Advanced visual interfaces. Gallipoli: Association for Computing Machinery, 2004.
p. 257-260. ISBN: 1-58113-867-9. Detail - BERAN, V.; POTÚČEK, I. REAL-TIME RECONSTRUCTION OF INCOMPLETE HUMAN MODEL USING COMPUTER VISION. Proceeding of the 10th Conference and Competition STUDENT EEICT 2004, Volume 2. Brno: Faculty of Electrical Engineering and Communication BUT, 2004.
p. 298-302. ISBN: 80-214-2635-7. Detail - BURGET, L. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. In Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co, 2004.
p. 2549-2552. Detail - FOUSEK, P., SVOJANOVSKÝ, P., GRÉZL, F., HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 348-351. ISSN: 1225-4111. Detail - FUČÍK, O., ZEMČÍK, P., TUPEC, P., CRHA, L., HEROUT, A. The Networked Photo-Enforcement and Traffic Monitoring System. In Proceedings of Engineering of Computer-Based Systems. Los Alamitos: IEEE Computer Society, 2004.
p. 423-428. ISBN: 0-7695-2125-8. Detail - HEROUT, A.; ZEMČÍK, P. Animated Particle Rendering in DSP and FPGA. SCCG 2004 Proceedings. Bratislava: Slovak University of Technology in Bratislava, 2004.
p. 237-242. ISBN: 80-223-1918-X. Detail - KARAFIÁT, M., GRÉZL, F., ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 437-440. ISSN: 1225-4111. Detail - MOTLÍČEK, P. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004. Detail
- MOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 379 ( p.) ISSN: 0302-9743. Detail - MOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. In 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Lecture Notes in Computer Science. Brno: Springer Verlag, 2004.
p. 379-384. ISBN: 3-540-23049-1. ISSN: 0302-9743. Detail - PEČIVA, J. Collaborative Virtual Environments. In Poster at MLMI'04 workshop. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail - SCHWARZ, P., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Recognition from a Long Temporal Context. In poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail