Code Monkey home page Code Monkey logo

calc-workflow's Introduction

LingPy: A Python Library for Automatic Tasks in Historical Linguistics

This repository contains the Python package lingpy which can be used for various tasks in computational historical linguistics.

Build Status DOI PyPI version Documentation

Authors (Version 2.6.12): Johann-Mattis List and Robert Forkel

Collaborators: Christoph Rzymski, Simon J. Greenhill, Steven Moran, Peter Bouda, Johannes Dellert, Taraka Rama, Tiago Tresoldi, Gereon Kaiping, Frank Nagel, and Patrick Elmer.

LingPy is a Python library for historical linguistics. It is being developed for Python 2.7 and Python 3.x using a single codebase.

Quick Installation

For our latest stable version, you can simply use pip or easy_install for installation:

$ pip install lingpy

or

$ pip install lingpy

Depending on which easy_install or pip version you use, either the Python2 or the Python3 version of LingPy will be installed.

If you want to install the current GitHub version of LingPy on your system, open a terminal and type in the following:

$ git clone https://github.com/lingpy/lingpy/
$ cd lingpy
$ python setup.py install

If the last command above returns you some error regarding user permissions (usually "Errno 13"), you can install LingPy in your home Python setup:

$ python setup.py install --user

In order to use the library, start an interactive Python session and import LingPy as follows:

>>> from lingpy import *

To install LingPy to hack on it, fork the repository on GitHub, open a terminal and type:

$ git clone https://github.com/<your-github-user>/lingpy/
$ cd lingpy
$ python setup.py develop

This will install LingPy in "development mode", i.e. you will be able edit the sources in the cloned repository and import the altered code just as the regular Python package.

calc-workflow's People

Contributors

lingulist avatar macyl avatar wu-urbanek avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Forkers

gaybro8777

calc-workflow's Issues

Chen orthography

A summary of Hmong-Mien languages phoneme inventories is provided from page 50 to page 68. I have typed page 50 to page 54 so far. See the progress in code/Chen_book_orthography.csv

[Question] Which dataset should we use?

I can demo either Chen HM or BeijingDaxue1964.

Chen HM :
Pro 1. I prepared the dataset, so I understand the segmentations
Pro 2. It's already on Lexibank
Con 1. I don't know HM languages

BeijingDaxue1964:
Pro 1. I understand the languages (at least Mandarin)
Pro 2. It's in cldf format
Con 1. It's not yet on Lexibank as a public repository

Which one is better?

Initial tasks for writing the paper

  • assemble orthography profile from the inventories of Chen (provided by Doug Cooper)
  • select as many languages overlapping in Chen and Ratliff for the demonstration and indicate which languages in Chen are Hmong and which are Mien
  • translate the concept list into English
  • refine the concepticon mapping (check for identical chinese terms in Huang 1992, where English terms can be found)
  • select the concepts we want to use for the study (high overlap with concepts in Ratliff and in work of Nathan Hill on Burmish)
  • restructure the paper following discussion from April 16 (suggestions by @LinguList)

[to do task] handout and speech script

handout

  • illustration - raw data to wordlist
  • illustration - segmented data to cognate sets
  • illustration - cognate sets to alignments

speech script

  • partial cognates

template alignment

When judging the crossids, the program should consider template alignment ( "STRUCTURE" column) instead of the ALIGNMENT column

Chen and Martha

Upload a list for languages which are used both in Martha Ratliff's book and Chen's book

concept refining

Those concepts that are unlinked to CONCEPTICON should be removed from the dataset.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.