Comments (8)
ideas :
- download picture from google image
- extract pictures from youtube videos
- use image net (but that's too easy)
- find some interesting picture dataset in kaggle
- use a crawler to download images from website :
- an housing website
- animal pictures such as https://pixabay.com/images/search/animal/
- wikipedia pictures
problem with 1M pictures is it also means hosting it somewhere. Might be good to keep the url so it can be used for visualization ?
from image_embeddings.
might be reasonable to start with a simple example with 100 pictures, use that as basic example and expand to the more complete example afterwards
from image_embeddings.
https://lionbridge.ai/datasets/top-10-image-classification-datasets-for-machine-learning/
https://www.tensorflow.org/datasets/catalog/sun397
from image_embeddings.
plan :
- use https://www.tensorflow.org/datasets/catalog/sun397
- download it
- select 100 decent images of it
- run inference
- commit the 100 images and 100 embeddings
- build basic python knn and js knn
- then run fast inference on all 100k images
- try python knn on them
- put screenshots in readme
- write blog post
from image_embeddings.
using tf_flowers instead as sun397 is too big for a simple example
from image_embeddings.
bootstrapped in https://github.com/rom1504/image-embeddings/blob/master/ImageEmb.ipynb
Next steps :
- put that in clean .py files
- create setup.py / requirements.txt
- create clean README
- consider putting on pypi
- screenshot in readme
- js knn
- blogpost
Other things to consider :
- provide other ways to get pictures (add a simple resizer that work for generic picture folder)
from image_embeddings.
Remaining :
- inference in .py (in api and cli)
- knn in .py (in api and cli)
- get screenshots for readme and blogpost
- js knn
from image_embeddings.
done
opening issues for the rest (js knn and blogpost)
from image_embeddings.
Related Issues (20)
- provide end to end command with CLI
- put notebook on colab for easy demo HOT 1
- Provide notebook doing inference before knn for "your own image" effect
- Add test running pipeline end to end on tf_flowers
- fix fashion mnist and mnist
- Publish ui package and make it possible to provide embeddings and images
- try to do inference in ui with tensorflow.js and allow the user to provide their own image HOT 4
- image extension .jpeg hardcoded HOT 3
- In some folders I am getting this error OverflowError: cannot convert float infinity to integer HOT 2
- Indexing only bright pixels in an image? HOT 1
- Invalid argument: input depth must be evenly divisible by filter depth: 1 vs 3? HOT 1
- Video datasets HOT 1
- ValueError: Expect x to be a non-empty array or dataset. HOT 5
- Flexibility in choosing models for image embeddings
- module 'image_embeddings' has no attribute 'inference' HOT 1
- How to generate embeddings of an input image ?
- batch mode and multithread HOT 3
- Requirement faiss-cpu-noavx2 makes the package installation fail on Colab
- Add option in CLI to provide image to search
- Make it possible to configure how many examples are used in downloader HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from image_embeddings.