This page contains Matlab source code and data sets related to the International Conference on Machine Learning 2008 publication A Decoupled Approach to Exemplar-based Unsupervised Learning. The materials are made available under an open-source license in order to foster the discussion and adoption of this and related approaches.
Abstract. A recent trend in exemplar based unsupervised learning is to formulate the learning problem as a convex optimization problem. Convexity is achieved by restricting the set of possible prototypes to training exemplars. In particular, this has been done for clustering, vector quantization and mixture model density estimation. In this paper we propose a novel algorithm that is theoretically and practically superior to these convex formulations. This is possible by posing the unsupervised learning problem as a single convex master problem with non-convex subproblems. We show that for the above learning tasks the subproblems are extremely well-behaved and can be solved efficiently.
Features and Demo
The distributed source code can perform the following functions:
- Objectives: maximization of margin, log-likelihood or mean-sample-response
- Kernel functions: "uniform disc" kernel, Epanechnikov, Gaussian RBF
- Mode finding: meanshift, annealed meanshift
- Master problem re-optimization: using COIN-OR IpOpt non-linear primal-dual interior point solver
All of the code is written in Matlab but uses the IpOpt solver Matlab interface.
Distribution: source code, precompiled binary and demo file
- infex-1.0.tar.bz2 (1.6 Mb)
- coin-ipopt-3.4.1.tar.bz2 Linux 32 bit version (2 Mb)
- coin-ipopt-3.4.1-x86_64.tar.bz2 Linux 64 bit version (5 Mb)
License: The infex software is licensed under the GNU General Public License, version 2. A copy of the license document is included in the distribution.
Installation: for your convenience, the distribution includes statically compiled binaries of COIN-OR IpOpt for x86-32 and x86-64 Linux, linked with the linear solver MUMPS. However, to obtain the results in the paper a more robust linear solver such as PARDISO or HSL is required. In the paper HSL MA27 has been used. Unfortunately, I am not allowed to redistribute binaries linked against these libraries, so if you want to the best possible performance, you need to manually recompile IpOpt using either PARDISO or HSL, please refer to the IpOpt documentation for instructions.
If you have the frequently encountered problem complaining about GLIBC_3.4.9 or GCC_3.3 not being found when you run the mex functions, please refer to this discussion at the Mathworks forums. The easiest solution is to simply delete the Mathworks-included libstdc++ library in the sys/os/glnx86/ folder.
Also, if Matlab crashes upon running the included ipopt function, please recompile Ipopt for your system. Sorry for the inconvenience.
Demo The distribution includes some demos, simply try to run demo_infex1 or demo_kvq_vs_infex.
- A Decoupled Approach to Exemplar-based Unsupervised Learning, ICML 2008, Sebastian Nowozin and Gökhan BakIr.
- firstname.lastname@example.org, corresponding author
If you have comments or questions, please feel free to contact me. Thanks!