focused-clustering
focused-clustering copied to clipboard
Focused Clustering and Outlier Detection in Large Attributed Graphs
Bryan Perozzi
Disclaimers
- This code is very "research", and so is probably more useful as an example than a product
- Distance Metric Learning based on original code from Eric Xing, available here
Implementation Overview
There are two programs. The first is a matlab script which learns a distance metric and reweighs the input graph. The second is a java program which extracts communities & outliers from the reweighted graph
Running
An example batch file focusco.bat shows how to use the matlab program from the command line. (it'll run the whole thing soon). It can be run like so:
>focusco.bat example.edges example.features example.similar
Which will produce focusco.out.weighted.edges.
Installation
Requirements
- A recent version of Matlab
- Java 6+
Setup
The only required step should be to build the java, e.g. with maven:
$ cd java_src$ mvn clean install