Skip to content

Augmenting character level Transformers with Causal Dilated Conv1D layers

Notifications You must be signed in to change notification settings

takeraparterer/Charformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Charformer

Augmenting character level Transformers with Causal Dilated Conv1D layers

You do first need to download a parquet file, and either rename it to 2013.parquet or adjust the file names in make_vocab.py and charformer_main.py

make_vocab.py - create vocab, run this first

charformer_main.py - train model

run_charformer.py - load and demo model

charformer_model.py - utils and model file

Note: this has nothing to do with the previous architectures developed with the same name.

About

Augmenting character level Transformers with Causal Dilated Conv1D layers

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages