Code Monkey home page Code Monkey logo

wikitrans's Introduction

WikiTrans

WikiTrans is a experiment in crowd sourcing for generating training data for statistical machine translations. WikiTrans will help users translate Wikipedia entries from one language to another in a form that also provides parallel corpora for supervised machine learning.

WikiTrans is licensed under the Apache Licence, Version 2.0

Information available here.

James Dennis <[email protected]>

wikitrans's People

Contributors

jmsdnns avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

wikitrans's Issues

manage.py mturk_pending_queue is incompatible with the version of boto in the requirements file

After running ./manage.py mturk_pending_queue I get the following error:

Traceback (most recent call last):
  File "./manage.py", line 28, in 
    execute_from_command_line()
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/django/core/management/__init__.py", line 352, in execute_from_command_line
    utility.execute()
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/django/core/management/__init__.py", line 306, in execute
    self.fetch_command(subcommand).run_from_argv(self.argv)
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/django/core/management/base.py", line 192, in run_from_argv
    self.execute(*args, **options.__dict__)
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/django/core/management/base.py", line 219, in execute
    output = self.handle(*args, **options)
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/django/core/management/base.py", line 348, in handle
    return self.handle_noargs(**options)
  File "/Users/ccb/Code/wikitrans/wt-app/apps/mturk_manager/management/commands/mturk_pending_queue.py", line 15, in handle_noargs
    handle_pending_task(task_item)
  File "/Users/ccb/Code/wikitrans/wt-app/apps/mturk_manager/workflow.py", line 248, in handle_pending_task
    handle_task(task_item, PENDING_FUNCTIONS)
  File "/Users/ccb/Code/wikitrans/wt-app/apps/mturk_manager/workflow.py", line 240, in handle_task
    retval = function(task_item, retval=retval)
  File "/Users/ccb/Code/wikitrans/wt-app/apps/wt_articles/mturk.py", line 168, in generate_question_forms
    qf = QuestionForm()
  File "/Users/ccb/.virtualenvs/wt.dev/lib/python2.6/site-packages/boto/mturk/question.py", line 145, in __init__
    raise ValueError("Must pass a list of Question instances to QuestionForm constructor")
ValueError: Must pass a list of Question instances to QuestionForm constructor

This appears to be because the version of boto that gets downloaded as part of the requirements does not incorporate James's patch.

Mechanical Turk workflow minor problems

In admin/mturk_manager/taskconfig/ the "reward per assignment" field has a max value of $0.99.

After running ./manage.py translate_mturk or ./manage.py mturk_pending_queue the articles should be removed from the Translation Request queue and put into a separate queue, so that the uploaded articles won't be resubmitted if we request translations of additional articles and then run the commands again.

If we can get boto to return the groupID for the HITs, that would allow us to preview them on the mturk web site. I.e. https://www.mturk.com/mturk/preview?groupId=D0FZ2M0F0X68RWCW32TZ

We need to add keywords to HITs through the API. This is how Turkers search for what to do.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.