Food Recognition/ Ingredient Recognition
- 사진 안에 있는 음식이 어떤 음식인지 인식할 수 있는 인공지능 모델 개발
- 사진 안에 있는 식재료가 어떤 식재료인지 인식할 수 있는 인공지능 모델 개발
You can start on any computer that can learn deep learning.
If you want to learn fast, use GPU-workstation.
torch~=1.9.1
torchvision~=0.10.1
Pillow~=7.0.0
natsort~=7.0.1
sklearn~=0.0
scikit-learn~=0.24.2
tqdm~=4.42.0
numpy~=1.18.1
tensorflow~=2.2.0
tensorflow-gpu~=2.2.0
tensorboard~=2.7.0
matplotlib~=3.1.2
pip install -r requirements.txt
- If you only try inference,
torch~=1.9.1
torchvision~=0.10.1
Pillow~=7.0.0
natsort~=7.0.1
numpy~=1.18.1
How can I request Checkpoint?
If you respond to GoogleForms, we will share the download link within a few days.
Currently, the shared checkpoint is ResNet152.
test_model = torch.jit.load('./jit_traced_torch_model_name.pt', map_location='cpu')
sample_data = torch.randn(1, 3, 512, 512) # (1, channel, width, height)
out_data = test_model(sample_data)
python inference.py --config configure_name.json --image image_name.jpg --label labels.txt
Quick Start Training Guide
- You need to create a configuration first.
- Then execute the following command:
python train.py --configuration configuration_name.json
Baseline Results - Food Recognition
Pretrained Model |
Accuracy |
Loss |
epoch |
note |
VGG16 |
0.077 |
5.001 |
- |
early stop, the performance is terrible |
RESNET50 |
81.94 |
0.78 |
60 |
early stop, |
RESNET152 |
73.77 |
0.973 |
20 |
comming soon! |
WIDERESNET50_2 |
72.52 |
0.998 |
20 |
comming soon! |
MOBILENET V2 |
81.96 |
0.72 |
240 |
cool, stop training |
DENSENET121 |
45.94 |
4.3338e+7 |
40 |
early stop, |
Baseline Results - Ingredient Recognition
Pretrained Model |
Accuracy |
Loss |
epoch |
note |
dataset |
num of class |
VGG16 |
- |
- |
- |
poor accuracy |
INGD_V1 (private) |
58 |
RESNET50 |
- |
- |
- |
poor accuracy |
INGD_V1 (private) |
58 |
RESNET152 |
95.44 |
0.68 |
250 |
fruits and vegs only |
Food and Vegetable Image Dataset |
58 |
RESNET152 |
92.19 |
0.41 |
376 |
nice accuracy |
INGD_V1 (private) |
58 |
WIDERESNET50_2 |
- |
- |
- |
poor accuracy |
INGD_V1 (private) |
58 |
MOBILENET V2 |
82.55 |
0.70 |
282 |
not bad |
INGD_V1 (private) |
58 |
DENSENET121 |
- |
- |
- |
poor accuracy |
INGD_V1 (private) |
58 |
Stage2 Result - Ingredient Recognition
Pretrained Model |
Accuracy |
Loss |
epoch |
note |
dataset |
num of class |
RESNET152 |
83.03 |
0.71 |
40 |
now available! |
INGD_V2 (private) |
238 |
MOBILENET V2 |
|
|
|
comming soon! |
INGD_V2 (private) |
238 |
- waverDeep - model architecture, setup train/test pipeline
GPU RESOURCE |
RAM |
COUNT |
NOTE |
NVIDIA TITAN RTX |
24G |
2 |
training |
NVIDIA GeForce GTX 1080TI |
12G |
1 |
develop, test or etc |
This project is licensed under the MIT License - see the LICENSE.md file for details