Code Monkey home page Code Monkey logo

vin-classifier's Introduction

SKLearn VIN Classifier

Several different types of classifiers are used to predict the make, model, and year of a vehicle from its vehicle identification number (VIN).

This project was submitted as coursework for COS 475 (Machine Learning) at the University of Southern Maine. The complete academic report is included in the repo.

alt text

Getting Started

The project consists of four parts: a data scraper targeting Cars.com, a utility script for cleaning scraped data, the classifiers themselves, and a standalone script allowing the user to enter a VIN and receive a prediction.

Dataset

A small dataset of approximately 3300 training examples is included, and can be found in the data-cleaner directory. The user may scrape additional examples from Cars.com using the included script.

Directory Structure

  • data-cleaner: Contains the script for cleaning raw scraped data. Raw and clean data are contained within respective subfolders.
  • web-scaper: Contains configuration file for Chrome web scraper plugin. Has its own readme with further instructions.
  • sklearn-text-classifier: Contains scripts for testing and evaluating numerous classifiers against different aspects of our problem. Also contains scripts that will train various types of decision trees and output decision tree charts in PDF format. Decision tree PDFs are also contained here.
  • standalone-stdin-app: Contains scripts that allow the user to input VIN numbers and receive a prediction from trained model(s).

Authors

  • Brendan McSweeney
  • Seth Percy
  • Nathan Rich

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.