Publication Details

Signature Extraction Methods for Text

KORČÁK, Z.; ZENDULKA, J.; ZEZULA, P. Signature Extraction Methods for Text. Proceedings of the Conference Modelling and Simulation of systems. Sv.Hostýn - Bystřice pod Hostýnem: 1998. p. 51-58. ISBN: 80-85988-24-0.
Type
conference paper
Language
English
Authors
Keywords

compression-based method, analytic model, two-level organization of signatures

Annotation

Signature files seem to be a promising method for text retrieval and document retrieval. According to this method the documents are stored in one file ("text file") while abstractions of the documents ("signatures") are stored in another file ("signature file"). In order to resolve a query, the signature file is searched first and many non-qualifying documents are immediately rejected. In this paper we present signature extraction methods for text and study their performance as a function of the query weight. We derive approximate formulas for estimating the query time of several existing methods. We also propose a new two-level organization method. All analytic formulas are verified by experiments.

Published
1998
Pages
51–58
Proceedings
Proceedings of the Conference Modelling and Simulation of systems
Volume
2
ISBN
80-85988-24-0
Place
Sv.Hostýn - Bystřice pod Hostýnem
BibTeX
@inproceedings{BUT191391,
  author="Zdeněk {Korčák} and Jaroslav {Zendulka} and Pavel {Zezula}",
  title="Signature Extraction Methods for Text",
  booktitle="Proceedings of the Conference Modelling and Simulation of systems",
  year="1998",
  volume="2",
  pages="51--58",
  address="Sv.Hostýn - Bystřice pod Hostýnem",
  isbn="80-85988-24-0"
}
Back to top