Adaptive Spectral Clustering with Application to Tripeptide Conformation Analysis

Partitioning of two-dimensional data sets as test for our adaptive spectral clustering algorithm

Fiete Haack, Konstantin Fackeldey, Susanna Röblitz, Olga Scharkoi, Marcus Weber, and Burkhard Schmidt

A decomposition of a molecular conformational space into sets or functions (states) allows for a reduced description of the dynamical behavior in terms of transition probabilities between these states. Spectral clustering of the corresponding transition probability matrix can then reveal metastabilities. The more states are used for the decomposition, the smaller the risk to cover multiple conformations with one state, which would make these conformations indistinguishable. However, since the computational complexity of the clustering algorithm increases quadratically with the number of states, it is desirable to have as few states as possible. To balance these two contradictory goals, we present an algorithm for an adaptive decomposition of the position space starting from a very coarse decomposition. The algorithm is applied to small data classification problems where it was shown to be superior to commonly used algorithms, e.g., k-means. We also applied this algorithm to the conformation analysis of a tripeptide molecule where six-dimensional time series are successfully analyzed.

J. Chem. Phys. 139 (19), 194110 (2013)