Code and data for paper The Crosslinguistic Relationship between Ordering Flexibility and Dependency Length Minimization: A Data-Driven Approach
-
Take data from the Universal Dependencies project
-
Extract instances and calculate flexibility score for each language
python3 codes/flexibility.py --input INPUT_TO_UD_DATA --output data/
-
Generate data for regression
python3 codes/data-process.py --input data/ --output data/
-
Run regression
python3 codes/lr.py --input data/ --output data/
-
Collect coefficeints from each language (plot/corr.csv)
-
Run analysis to test relationship between flexibility and DLM, as well as draw graphs (codes/analysis.R)