Clustering with Gaussian Process Models
In this project, we develop an understanding of the idea for clustering with Gaussian process models, according to the work of Hyun-Chul Kim and Jaewook Lee. Based on that, we implement a Gaussian Process Clustering Python Package, and perform some clustering tests with different datasets.
More details about this project can be found in the report
Project in Artificial Intelligence and Machine Learning WS 2013/14
— Clustering Algorithm based on Gaussian Process
File Index:
1.1. Folders:
Dataset -- stores dataset files
Markers -- stores the markers with differernt colors for GMap Clustering Application
results -- stores the clustering results of ACT Schools
1.2. Notebook:
GPClustering_Impl.ipynb -- Implementation of GPClustering Package
GPClustering_Variance.ipynb -- Variance Test with Differernt Covariance Function Parameters
GPClustering_Test.ipynb -- GP Clustering Test with different datasets
GPClustering_Test_HighDimension.ipynb -- GP Clustering Test with high dimensional dataset
KMeans_Test.ipynb -- KMeans Clustering Test with different datasets
1.3 Other Files:
GPClustering.py -- The python package of our implemented GPClustering, which can be exported in notebook GPClustering_Impl.ipynb
schools.html -- The Clustering Application of ACT Schools
HOW TO:
2.1. Intallation of Ipython notebook
A installation Guide: http://ipython.org/install.html
2.2. Install Python Dependency Packages if some needed packages are missing
2.3. Go to the root path of the project, and start ipython notebook with the command:
ipython notebook
2.4. Open notebooks and run the codes (for some big dataset, the clustering test might take some time..)
The Package GPClustering.py which is used in clustering test can be re-downloaded in the notebook GPClustering_Impl.ipynb, if it is missing
2.5. Open schools.html with IE Explore to see the Application of ACT Schools (Chrome and Firefox cannot work beause of ActiveX)
Dataset Download:
Shape Sets: R15, path-based2:spiral
UCI datasets: Iris
http://cs.joensuu.fi/sipu/datasets/
ACT Schools:
https://www.data.act.gov.au/Education/ACT-School-Locations/q8rt-q8cy
OPENSOURCE
Copyright (c) 2014 [Fangzhou Yang]
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
CONTACT
Fangzhou Yang: fangzhou.sjtu@gmail.com