Code Monkey home page Code Monkey logo

dl-vision's Introduction

Deep Learning Vision Algorithms

What is this?

You're seeing a computer vision test field, under the presented architecture, you can easily try some architecture such as AlexNet, VGGx16, VGGx19 and ResNetx18

Usage

To run virtual environments

Windows

python -m venv env
env\Scripts\activate

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
pip install numpy matplotlib python-dotenv plyer

Linux

python -m venv env
source env/bin/activate

pip install torch torchvision torchaudio
pip install numpy matplotlib python-dotenv plyer

Setting environment

.env file is required to setting the environment of this project, along with the virtual environment this file sets each section of the project. Here is a example

# Architecture
NET_ARCH=AlexNet
USE_CUDA=1

# Using model
MODELS_PATH="models"
USE_MODEL=1

# Dataset
DATASET="CIFAR10"
DATA_PATH="./data"
BATCH_SIZE=8

# Image management
IMG_SIZE=224
IMG_START_INDEX=0

# Training
ITERATIONS=1
LEARNING_RATE=0.01
MOMENTUM_VALUE=0.8
CATCH_INTERVAL=5

# Loss
LOST_CRITERIA="CrossEntropyLoss"

# Management
RESULTS_PATH="results"
LOG_PATH="log"
AUTOCLEAR=0

Available architectures

Just write any of the following on the NET_ARCH env var

  • AlexNet
  • VGG16
  • VGG19
  • ResNet

USE_CUDA=1 means that the host can and will use CUDA by default it uses the processor

Available dataset

Data sets can be defined inside the .env file in the $DATASET$ variable, available dataset work for image detection or segmentation from PyTorch documentation specification follow this link to see other options

Keyword Size Dataset
"CelebA" 200K CelebA
"CIFAR10" 60K CIFAR
"CIFAR100" 60K CIFAR

Note: The difference between CIFAR10 and CIFAR100 is the amount of classes, CIFAR10 contains 10 while CIFAR100 contains 100 see "The CIFAR-100 dataset" specifications

LOST_CRITERIA means the lost function, available options are "BCELoss" and "CrossEntropyLoss"

dl-vision's People

Contributors

andresmpa avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.