Transformer

PyTorch implementation of transformer model architecture presented in the paper "Attention Is All You Need"

My Implementation : see Notebook

ViT

My Implementation : see Notebook
Note: It uses the high-performance PyTorch Scaled Dot Product Attention (SDPA). I tried to keep the code as clean as possible without sacrificing on performance.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
img		img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
step_by_step_transformer.ipynb		step_by_step_transformer.ipynb
vit.ipynb		vit.ipynb