Detail projektu
Augmented Multi-party Interaction
Období řešení: 1. 1. 2004 – 31. 12. 2006
Typ projektu: grant
Kód: 506811-AMI
multimodální interakce, rozpoznávání řeči, zpracování videa, multimodální rozpoznávání, sběr dat z jednání, anotace dat z jednání
Evropský projekt AMI je společně řízen Prof. Herve Bourlardem (IDIAP, http://www.idiap.ch) a Prof. Stevem Renalsem (University of Edinburgh, http://www.iccs.informatics.ed.ac.uk). Je zaměřen na multimodální interakci během živých jednání (meetingů) s počítačovou podporou. Projekt si klade za cíl podstatný posun state-of-the-art tohoto oboru a jeho technologií (modelování komunikace člověka s člověkem, rozpoznávání řeči, počítačové vidění, multimediální indexace a vyhledávání). Jeho výstupem bude mj. off-line a on-line software pro prohlížení (browsing) multimodálních dat, včetně analýzy struktury jednání a jeho sumarizace. V rámci projektu jsou také pořizována a distribuována nahraná a anotovaná multimodální data z jednání. Projekt tímto přispívá výzkumné infrastruktuře v tomto oboru a evropské výzkumné komunitě.
Burget Lukáš, doc. Ing., Ph.D. (UPGM)
Černocký Jan, prof. Dr. Ing. (UPGM)
Grézl František, Ing., Ph.D. (UPGM)
Kadlec Jaroslav, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (UPGM)
Matějka Pavel, Ing., Ph.D. (UPGM)
Motlíček Petr, doc. Ing., Ph.D. (UPGM)
Pečiva Jan, Ing., Ph.D. (UPGM)
Potúček Igor, Ing., Ph.D.
Schwarz Petr, Ing., Ph.D. (UPGM)
Sumec Stanislav, Ing., Ph.D.
Španěl Michal, Ing., Ph.D. (UPGM)
Zemčík Pavel, prof. Dr. Ing., dr. h. c. (UPGM)
2008
- GRÉZL, F.; FOUSEK, P. Optimizing bottle-neck features for LVCSR. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008.
p. 4729-4732. ISBN: 1-4244-1484-9. Detail
2006
- PEČIVA, J. Active Transaction Approach for Collaborative Virtual Environments. In ACM International Conference on Virtual Reality Continuum and its Applications (VRCIA). Chinese University of Hong Kong: Association for Computing Machinery, 2006.
p. 171-178. ISBN: 1-59593-324-7. Detail
2005
- FAPŠO, M., SCHWARZ, P., SZŐKE, I., ČERNOCKÝ, J., SMRŽ, P., BURGET, L., KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005. Detail
- FAPŠO, M., SMRŽ, P., SCHWARZ, P., SZŐKE, I., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. In Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. Detail - GRÉZL, F. Spectral plane investigation for probabilistic features for ASR. Edinburgh: 2005.
p. 82 ( p.) Detail - KADLEC, J., POTÚČEK, I., SUMEC, S., ZEMČÍK, P. Evaluation of Tracking and Recognition Methods. In Proceedings of the 11th conference EEICT. Brno: 2005.
p. 617-622. ISBN: 80-214-2890-2. Detail - MATĚJKA, P. Phoneme Recognition Tuning for Language Identification System. In Proceedings of the 11th conference STUDENT EEICT 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 658-653. ISBN: 80-214-2890-2. Detail - MATĚJKA, P., SCHWARZ, P., ČERNOCKÝ, J., CHYTIL, P. Phonotactic Language Identification. In Proceedings of Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 140-143. ISBN: 80-214-2904-6. Detail - MOTLÍČEK, P., BURGET, L., ČERNOCKÝ, J. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 187-190. ISBN: 80-214-2904-6. Detail - NIJHOLT, A., ZWIERS, J., PEČIVA, J. The Distributed Virtual Meeting Room Exercise. In Proceedings ICMI 2005 Workshop on Multimodal multiparty meeting processing. Trento: 2005.
p. 93-99. Detail - PEČIVA, J. Omnipresent Collaborative Virtual Environments for Open Inventor Applications. In INTETAIN 2005. Springer Lecture Notes in Artificial Intelligence. Madonna di Campiglio: Springer Verlag, 2005.
p. 272-276. ISBN: 3-540-30509-2. Detail - SMRŽ, P. Parallel Metagrammar for Closely Related Languages - A Case Study of Czech and Russian. Research on Language & Computation, 2005, vol. 3, no. 2,
p. 101-128. ISSN: 1570-7075. Detail - SMRŽ, P.; FAPŠO, M. Vyhledávání v záznamech přednášek. In Sborník semináře Technologie pro e-vzdělávání. Praha: České vysoké učení technické, 2005.
s. 21-26. ISBN: 80-01-03274-4. Detail - SUMEC, S., POTÚČEK, I., ZEMČÍK, P. AUTOMATIC MOBILE MEETING ROOM. In Proceedings of 3IA'2005 International Conference in Computer Graphics and Artificial Intelligence. Limoges: 2005.
p. 171-177. ISBN: 2-914256-07-8. Detail - SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. Detail - SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302 ( p.) ISSN: 0302-9743. Detail
2004
- BERAN, V. Augmented Multi-User Communication System. Proceedings of the working conference on Advanced visual interfaces. Gallipoli: Association for Computing Machinery, 2004.
p. 257-260. ISBN: 1-58113-867-9. Detail - BERAN, V.; POTÚČEK, I. REAL-TIME RECONSTRUCTION OF INCOMPLETE HUMAN MODEL USING COMPUTER VISION. Proceeding of the 10th Conference and Competition STUDENT EEICT 2004, Volume 2. Brno: Faculty of Electrical Engineering and Communication BUT, 2004.
p. 298-302. ISBN: 80-214-2635-7. Detail - BURGET, L. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. In Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co, 2004.
p. 2549-2552. Detail - FOUSEK, P., SVOJANOVSKÝ, P., GRÉZL, F., HEŘMANSKÝ, H. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 348-351. ISSN: 1225-4111. Detail - FUČÍK, O., ZEMČÍK, P., TUPEC, P., CRHA, L., HEROUT, A. The Networked Photo-Enforcement and Traffic Monitoring System. In Proceedings of Engineering of Computer-Based Systems. Los Alamitos: IEEE Computer Society, 2004.
p. 423-428. ISBN: 0-7695-2125-8. Detail - HEROUT, A.; ZEMČÍK, P. Animated Particle Rendering in DSP and FPGA. SCCG 2004 Proceedings. Bratislava: Slovak University of Technology in Bratislava, 2004.
p. 237-242. ISBN: 80-223-1918-X. Detail - KARAFIÁT, M., GRÉZL, F., ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 437-440. ISSN: 1225-4111. Detail - MOTLÍČEK, P. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004. Detail
- MOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 379 ( p.) ISSN: 0302-9743. Detail - MOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. In 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Lecture Notes in Computer Science. Brno: Springer Verlag, 2004.
p. 379-384. ISBN: 3-540-23049-1. ISSN: 0302-9743. Detail - PEČIVA, J. Collaborative Virtual Environments. In Poster at MLMI'04 workshop. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail - SCHWARZ, P., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Recognition from a Long Temporal Context. In poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). Detail