Code Monkey home page Code Monkey logo

rips-shoah-2012's People

Contributors

christiequaranta avatar ericcarlschwartz avatar esizikova avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

rips-shoah-2012's Issues

Look into SQL

In order to properly evaluate the results it is important to know how many videos are assigned to each keyword and if there is an overlap. This is probably an SQL issue

CSV to SQLLite

Imanol strongly believes that it would be easiest for us to convert some data files from csv to SQLlite in order to use in python or other analysis.

Read Last Year's project: translating locations

Translating locations will be a big problem, I think, as many of them Google translate does not recognize as words from a particular language. We should check out how the group last year dealt with this.

Research Uses of Wikipedia in our Project

As Roja suggested, we might be able to put context into out project to help us out with the translation. There might be an API for wikipedia or some other application that we could use.

Thesaurus Standard

Research the thesaurus standard used in the Shoah database as suggested by the sponsors.

Accent Problems in Translation

Python gives a lot of encoding errors when pasting translated text to a text file. Eric fixed the initial Spanish translation, but for other languages, especially with more difficult alphabets, the problem will be worse. We have to figure out a way that would avoid this.

Contact for More Info

Leo to set:

  • Skype conference
  • Database that has data for relating keyword IDs to Segment IDs
  • Query -> Lucene -> List -> DB -> Videos (we need every step for this to work)

Understand the Lucene Database

Work through Lucene code, initialization and be able to make simple queries that would return the related terms in an easy format.

Explain How to Codeshare

Git has a powerful way of sharing code with version control, and it would be a great idea to implement this in our project.

Automatic Translation

We agreed with @eschwartz1991 that I'll finish Russian and German by Friday (July 12) morning, then Eric does Persian and Swahili during Friday (July 12), and Elena runs Arabic and Mandarin Chinese over the weekend, if no other bugs arise.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.