Skip to content

mcmillanmajora/IGTautoglossing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Dependecies

For part of speech and lemmatization https://github.com/stanfordnlp/python-stanford-corenlp

For handling ODIN corpora https://github.com/xigt/xigt

For the CRF models https://sklearn-crfsuite.readthedocs.io/en/latest/

For saving the models https://github.com/joblib/joblib

Data

Download the ODIN data from http://depts.washington.edu/uwcl/odin/

Command Line Usage

For first time usage:

python3 ./main.py ./path_to_corpus_file ./path_to_output_file

If you want to reload already trained models:

python3 ./main.py ./path_to_corpus_file ./path_to_output_file ./path_to_saved_models_file

Recommended Citation

Please cite as:

McMillan-Major, Angelina (2020) "Automating Gloss Generation in Interlinear Glossed Text," Proceedings of the Society for Computation in Linguistics: Vol. 3 , Article 33. Available at: https://scholarworks.umass.edu/scil/vol3/iss1/33

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages