Project Details
OCR, ClassificAtion & Machine Translation
Project Period: 1. 10. 2019 – 30. 9. 2021
Project Type: grant
Agency: Evropská unie
Program: Connecting Europe Facility (CEF)
OCR, Classification, Machine Translation
OCCAM (OCR, ClassificAtion & Machine Translation) responds to action line "Integration projects" on the integration (and extension) of CEF (Connecting Europe Facility) Automated Translation into multilingual digital cross-border services. The Action proposes the integration of image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT) to support the automated translation of scanned documents - a document type that currently cannot be processed by the CEF eTranslation service. OCCAM will develop two use cases: (i) the Business Registers Interconnection System (BRIS) use case and (ii) the Digital Humanities use case.
Homoliak Ivan, Ing., Ph.D. (DITS)
Michal Bohumil, Ing. (CVT)
Mrazíková Libuše, Mgr. (DFIT-PO)
Musil Petr, Ing., Ph.D. (DCGM)
Najman Pavel, Ing. (DCGM)
Otáhalová Sylva (DCGM)
Pirová Zuzana, Ing. (DFIT-FO)
2021
- KIŠŠ, M.; BENEŠ, K.; HRADIŠ, M. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions. In Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science. Lausanne: Springer Nature Switzerland AG, 2021.
p. 463-477. ISBN: 978-3-030-86336-4. Detail