cse253_pa4's Introduction

CSE253_PA4

PA4 Image Captioning

Model built for RNN and LSTM (w/o pretrained weights) --> model can be specified when called in training (if not will default to LSTM)

Training works for both models

Evaluation works for both models

train.py -- script to train the model on training images. To see usage type python train.py --help on command line

get_data.py -- script to retrieve data (currently specific to data being on dsmlp-login.ucsd.edu server. To see usage type python get_data.py --help on command line

dependencies.txt -- Run pip install dependencies.txt from command line to make sure you have all the necessary packages installed to run to train and test the model

subset_data.py -- filters the list of caption ids to include only those that will actually be used during training (basically filters based on if the file is in the training image folder or not). Saves this list of filtered ids that can be used in data loading. To see usage type python subset_data.py --help on command line

TODO

test function for validation split
Integrate model to train and val loops
Integrate resize, build_vocab into training
Add hyperparam storing to tensorboard
Add BLEU scoring to val and train
Caption generation for validation set

Recommend Projects

adamklie / cse253_pa4 Goto Github PK

cse253_pa4's Introduction

CSE253_PA4

TODO

cse253_pa4's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent