Code Monkey home page Code Monkey logo

cvis's Introduction

Computer Vision Project

Diana Triantafyllidou & Raphael Vorias

This project is divided into four main sections:

  1. A Convolutional Auto-Encoder
  2. A Multi-label Classifier
  3. A Dual AE/Classifier
  4. An Image Segmentator

Each section consists of build files that constuct a model and save it locally. Then, pipeline modules use the built models in order to train. Images and masks are preproccesed and then handled via flow_from_dataframe.

Some sections, such as the AE section, contain visualization files.

While these models are not getting state of the art results, the

AE

Tested models:

Model Accuracy Params
U-Net - unfrozen 0.71 2 M
Squeeze U-Net - unfrozen 0.78 726 K
Baseline - frozen 0.65 116 K
Baseline - unfrozen 0.43 170 K
Baseline - blank 0.68 170 K
Dual architecture 0.66 440 K

Example reconstruction:

AE prediction

Classifier

Three variations were tested:

  1. Baseline Scratch: baseline architecture with reinitialized weights.
  2. Baseline Finetune: trained encoder part of the AE to finetune these pre-trained weights.
  3. Baseline Frozen: trained encoder part of the AE which are frozen, only the last dense layers are trained.

Next to self-made models, U-Net architectures were used.

Model Accuracy Params
U-Net - unfrozen 0.71 2 M
Squeeze U-Net - unfrozen 0.78 726 K
Baseline - frozen 0.65 116 K
Baseline - unfrozen 0.43 170 K
Baseline - blank 0.68 170 K
Dual architecture 0.66 440 K

CL plots

Dual network

This network has both an AutoEncoder and a Classifier and are trained simultaneously.

Weight plots of the first convolutional layer:

Untrained Trained
DU untrained DU trained

TSNE-plot of the final dense layer:

DU tsne

Confusion plot after 250 epochs:

DU conf

Segmentor

Trained using Dice Loss. Examples:

SE ex1SE ex2SE ex3SE ex4

cvis's People

Contributors

rvorias avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.