A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF
Python 2.7
TensorFlow 0.11.0 (Newer versions will be supported in the furture)
Pygame (Convert Chinese characters into pictures)
Yan Shao, Christian Hardmeier, Jörg Tiedemann and Joakim Nivre. "Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF" arXiv preprint arXiv: 1704.01314 (2017).
https://arxiv.org/pdf/1704.01314.pdf
To reproduce the results reported in the paper:
python tagger.py train -p ud1 -t train.txt -d dev.txt -wv -cp -rd -gru -m model_ud1 -emb Embeddings/glove.txt
python tagger.py test -p ud1 -e test.txt -m gru_full_ud1 -emb Embeddings/glove.txt
python tagger.py train -p ud1 -t train.txt -d dev.txt -wv -cp -rd -gru -m model_ud1_1 -emb Embeddings/glove.txt
python tagger.py train -p ud1 -t train.txt -d dev.txt -wv -cp -rd -gru -m model_ud1_2 -emb Embeddings/glove.txt
python tagger.py train -p ud1 -t train.txt -d dev.txt -wv -cp -rd -gru -m model_ud1_3 -emb Embeddings/glove.txt
python tagger.py train -p ud1 -t train.txt -d dev.txt -wv -cp -rd -gru -m model_ud1_4 -emb Embeddings/glove.txt
python tagger.py test -ens -p ud1 -e test.txt -m model_ud1 -emb Embeddings/glove.txt