Publication Details
Automatic Acquisition of Semantics-Extraction Patterns
parallel corpus, lexico-semantic patterns
This paper examines the use of parallel and comparable corpora for automatic acquisition of semantics-extraction patterns. It presents a new method of the pattern extraction which takes advantage of parallel texts to "port" text mining solutions from a source language to a target language. It is shown that the technique can help in situations when the extraction procedure is to be applied in a language (languages) with a limited set of available resources, e.g. domain-specific thesauri. The primary motivation of our work lies in a particular multilingual e-learning system. For testing purposes, other applications of the given approach were implemented. They include pattern extraction from general texts (tested on wordnet relations), acquisition of domain-specific patterns from large parallel corpus of legal EU documents, and mining of subjectivity expressions for multilingual opinion extraction system.
@inproceedings{BUT22422,
author="Pavel {Smrž}",
title="Automatic Acquisition of Semantics-Extraction Patterns",
booktitle="Proceedings of the 5th International Conference on Language Resources and Evaluation",
year="2006",
pages="1--4",
publisher="European Language Resources Association",
address="Paris",
isbn="2-9517408-2-4"
}