Code Monkey home page Code Monkey logo

Comments (8)

rom1504 avatar rom1504 commented on May 29, 2024

ideas :

  • download picture from google image
  • extract pictures from youtube videos
  • use image net (but that's too easy)
  • find some interesting picture dataset in kaggle
  • use a crawler to download images from website :

problem with 1M pictures is it also means hosting it somewhere. Might be good to keep the url so it can be used for visualization ?

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

might be reasonable to start with a simple example with 100 pictures, use that as basic example and expand to the more complete example afterwards

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

https://lionbridge.ai/datasets/top-10-image-classification-datasets-for-machine-learning/
https://www.tensorflow.org/datasets/catalog/sun397

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

plan :

  • use https://www.tensorflow.org/datasets/catalog/sun397
  • download it
  • select 100 decent images of it
  • run inference
  • commit the 100 images and 100 embeddings
  • build basic python knn and js knn
  • then run fast inference on all 100k images
  • try python knn on them
  • put screenshots in readme
  • write blog post

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

using tf_flowers instead as sun397 is too big for a simple example

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

bootstrapped in https://github.com/rom1504/image-embeddings/blob/master/ImageEmb.ipynb
Next steps :

  • put that in clean .py files
  • create setup.py / requirements.txt
  • create clean README
  • consider putting on pypi
  • screenshot in readme
  • js knn
  • blogpost

Other things to consider :

  • provide other ways to get pictures (add a simple resizer that work for generic picture folder)

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

Remaining :

  • inference in .py (in api and cli)
  • knn in .py (in api and cli)
  • get screenshots for readme and blogpost
  • js knn

from image_embeddings.

rom1504 avatar rom1504 commented on May 29, 2024

done
opening issues for the rest (js knn and blogpost)

from image_embeddings.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.