Publication Details

Combined Density- and Grid- Based Method for Clustering of Protein Substructures

BURGETOVÁ, I. Combined Density- and Grid- Based Method for Clustering of Protein Substructures. ZNALOSTI 2009, Proceedings of the 8th annual conference. Brno: Vydavateľstvo STU, 2009. p. 201-212. ISBN: 978-80-227-3015-0.
Czech title
Kombinovaná metoda pro shlukování proteinových substruktur
Type
conference paper
Language
English
Authors
Keywords

Cluster analysis, data mining, PDB, sequence-structure fragments, protein
structure prediction

Abstract

Data mining techniques may reveal interesting knowledge in various datasets. The
biological databases are enormously large and therefore, data mining techniques
could be extremely helpful to extract the knowledge from them. In our study, we
focused on data mining in PDB - Protein Data Bank. We used cluster analysis to
identify the sequences that occur in limited number of structural conformations
(sequence-structure fragments). This knowledge about protein fragments can be
used in protein structure predictions. In this paper, we present a combined
density- and grid-based method that we developed for clustering of protein
structures. We also compare this method with a simple density-based clustering
method that we used in the first part of our study to prove the existence of
protein sequences that occur in more than one structural conformation but the
number of its structural conformations is limited.

Published
2009
Pages
201–212
Proceedings
ZNALOSTI 2009, Proceedings of the 8th annual conference
Conference
Znalosti 2009, Brno, CZ
ISBN
978-80-227-3015-0
Publisher
Vydavateľstvo STU
Place
Brno
BibTeX
@inproceedings{BUT30201,
  author="Ivana {Burgetová}",
  title="Combined Density- and Grid- Based Method for Clustering of Protein Substructures",
  booktitle="ZNALOSTI 2009, Proceedings of the 8th annual conference",
  year="2009",
  pages="201--212",
  publisher="Vydavateľstvo STU",
  address="Brno",
  isbn="978-80-227-3015-0"
}
Back to top