Publication Details

Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation

ŠEBEK, M.; ZENDULKA, J. Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation. Proceedings of the Twelfth International Conference on Informatics 2013. Košice: The University of Technology Košice, 2013. p. 289-292. ISBN: 978-80-8143-127-2.
Czech title
Generátor syntetických datových sad pro vyhodnocení dolování hierarchických sekvenčních vzorů
Type
conference paper
Language
English
Authors
Šebek Michal, Ing., Ph.D.
Zendulka Jaroslav, doc. Ing., CSc. (UIFS)
Keywords

Sequence pattern mining, synthetic dataset generators, taxonomy

Abstract

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Annotation

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Published
2013
Pages
289–292
Proceedings
Proceedings of the Twelfth International Conference on Informatics 2013
ISBN
978-80-8143-127-2
Publisher
The University of Technology Košice
Place
Košice
BibTeX
@inproceedings{BUT103555,
  author="Michal {Šebek} and Jaroslav {Zendulka}",
  title="Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation",
  booktitle="Proceedings of the Twelfth International Conference on Informatics 2013",
  year="2013",
  pages="289--292",
  publisher="The University of Technology Košice",
  address="Košice",
  isbn="978-80-8143-127-2",
  url="https://www.fit.vut.cz/research/publication/10435/"
}
Files
Back to top