w2v-pl-186891476-20000-300-5-5-plwiki-20170820
emsi
released this
04 Sep 20:07
·
28 commits
to master
since this release
Polish Word2Vec
corpus size: 186891476 words / 1.3GB
vocabulary size: 20000
vector size: 300
window size: 5
negative subsampling: 5
corpus source: plwiki-20170820