Experimenting with implementing the transformer architecture from the paper "Attention is All You Need" by Vaswani et al. (2017) in PyTorch. Under Development