Code Monkey home page Code Monkey logo

onereview_data_collection's People

Contributors

porterehunley avatar

Watchers

 avatar  avatar

onereview_data_collection's Issues

Sequel Check

Sometimes when a movie has multiple parts, the collector will collect a movie review for the second movie when it should be in the first.

Add in Authentication to Start Collection

Right now the startservercontroller is open, and anyone can just start the data collection by hitting that API endpoint. The application has a token system. I need to first validate the user is logged in, then get his token, then call the protected endpoint.

What video data do we need to Query?

We do not have infinite queries and some are bigger than others. We should not go through the CC of the video unless we have too.

Should we look are comments, CC's, other things? What data do we need to get from the videos to train our algorithms. This is going to require training some crude ML models with clean data.

Collection Concurrency Issue

Right now, multiple authenticated users can start their own collection process. Add in a check that checks if there is a controller working at that moment

Add Easy Score Videos

This is a large issue with a couple of parts. First we need to be able to click on a movie on the frontend and have it display the youtube videos associated with that movie. Then we need to be able to enter a score for those videos if there is not one already.

Remove Internal API Calls

Remove the internal API calls inside of the application to increase speed and reduce complexity. Also makes it more configurable.

Add Data Migration

Write a script integrated with the application that transfers data from the database and puts it in Firestore.

Entry Monitors

Lets the user know (front end) whether or not a media item contains all the data entries needed.

Recollect Damaged Data

Add a button that recollects incomplete data. Complete the issue before this that allows for detection of media items with an incomplete data pipeline

Email Approve Users

Have an approval system for email where if a applicant is approved from the email, then it registers them into the database with a new token.

Clean YouTube Data

Gather some example data from a couple YouTube videos relating to different products and see what it all looks like and how we should clean it.

Mark it up using python and go ahead and commit it

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.