Code Monkey home page Code Monkey logo

thereswaldo's Introduction

There's Waldo!

This repository contains an implementation of the fully convolutional Tiramisu network for semantic segmentation, applied to the task of playing Where's Waldo?.

Where’s Waldo (or Wally) is a series of children’s books that challenges the reader with finding the eponymous character and his compadres in densely illustrated images.

..

Actual Predicted Segementation Indicated by Transparency Mask

Please see my recent blog post Where's Waldo: Terminator Edition for an overview of my approach.

I've also provided a complete end-to-end walk-through of my There's Waldo project in this notebook.

You can find all training / testing images and corresponding predictions here

Set Up

I've provided for you the original images and bounding box xml's necessary to recreate my process.

First, unpack them:

$ unzip imgs_boxes.zip

To create the binary labels:

$ python make_targets.py
$ python preprocessing.py

The next step is to create the Waldo sub-images.

$ python create_subimages.py

Training

I didn't put together a script for training the model; training neural networks isn't straightforward and relies alot on human judgement. If you'd like to train your own model, please take a look at the accompanying notebook.

Predicting

To make predictions on new images:

$ python predict.py image1.jpg [image2.jpg image3.jpg ...] --output_path /output/path

This will load weights from my model by default. Makes predictions and saves them as transparency masks overlaid on the original image. They'll be saved as output_{i}.png in the specified folder.

If you want to use your own model weights, just specify with the --model flag.

Acknowledgments

This project is inspired by material studied while interning with Jeremy Howard and Rachel Thomas' fast.ai, in particular Lesson 14 of their course Cutting Edge Deep Learning for Coders, taught at USF’s Data Institute.

thereswaldo's People

Contributors

bckenstler avatar giulic3 avatar

Watchers

paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.