Code Monkey home page Code Monkey logo

acol-gan's Introduction

ACOL-GAN

Abstract

Clustering is one of the research hotspots of deep learning. Recently, deep generative models provide a new way to achieve clustering. However, from all the proposed models, the performance of the clustering variants based on the Variational Autoencoder(VAE) are superior than that based on the Generative Adversarial Network (GAN), which is mainly because the former allows the data to be multi-mode in latent space but the latter does not do so, making the boundaries of different classes obscure and difficult to distinguish. In this paper, we propose a new GAN-based clustering model named Auto-clustering Output Layer Generative Adversarial Network(ACOL-GAN), which replaces the normal distribution that standard GAN relied on with Gaussian mixture distribution generated by sampling networks and adopts the Auto-clustering Output Layer(ACOL) as the output layer in discriminator. Due to Graph-based Activity Regularization(GAR) terms, softmax nodes of parent-classes are specialized as the competition between each other occurs during training. The experimental results show that ACOL-GAN is superior to most unsupervised clustering algorithms on MNIST, USPS and Fashion-MNIST datasets.

Comparison

.

The ACOL-GAN algorithm was compared to the most advanced clustering methods. As shown in Table 1, the ACOL-GAN clustering algorithm is better than or equal to other algorithms in most cases, and the performance on the Fashion-MNIST dataset is the best in the recorded algorithm clustering results.

Visualization

.

The latent representation of the MNIST data set is visualized by t-SNE[5]. ACOL-GAN can quickly capture the data distribution of the MNIST dataset in the latent space and make the actual data to fit the specified Gaussian mixture for fast clustering. And only 5 epochs are needed to achieve different boundaries in the latent space.

Performance of Different Number to be Clustered

.

The number of clusters required for the ACOL-GAN model is set 7 and 14 on the MNIST and Fashion-MNIST datasets. In the results of the cluster number set to 7 categories, similar pictures such as the combination of 0 and 6 in MNIST dataset and the combination of the sandals and the sneakers in Fashion-MNIST dataset are grouped together. The result that the number of clusters is set to 14 shows that even in the same class, some differences such as the tilt angle and the shape in the same category can be divided into a plurality of classes. Thus, by observing the experimental results, it can be found that ACOL-GAN can effectively accomplish the task even if the number of clusters is not the number of categories in the real world.

acol-gan's People

Contributors

wusongyuan avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.