Publication Details

Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases

OTRUSINA, L.; SMRŽ, P. Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases. Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017). Kolkata: 2017. p. 523-532.
Type
conference paper
Language
English
Authors
URL
Abstract

This paper deals with semantic enrichment of textual resources by means of automatically generated named entity recognizers-linkers and advanced indexing and searching mechanisms that can be integrated into various information retrieval and information extraction systems. It introduces a new system transforming Wikipedia and other available sources into task-specific knowledge bases and employs contextual information to build state-of-the-art entity disambiguation components. Although some components are language-dependent (for example, that responsible for the morphology analysis or the semantic role identification), they can be easily replaced by existing tools providing specific functions. As a case study, we demonstrate an instantiation of the system for the task of semantic annotation of Czech bibliographic databases in the context of the CPK project. We particularly stress the role of problem-specific knowledge sources that can be easily integrated into our system and play a key role in the success of the tool in real applications.

Published
2017
Pages
523–532
Proceedings
Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)
Place
Kolkata
BibTeX
@inproceedings{BUT168464,
  author="Lubomír {Otrusina} and Pavel {Smrž}",
  title="Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases",
  booktitle="Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)",
  year="2017",
  pages="523--532",
  address="Kolkata",
  url="http://www.aclweb.org/anthology/W/W17/W17-7563"
}
Back to top