"Gradient descent can write code better than you. I'm sorry", - Andrej Karpathy
ppetroskevicius / attention-is-all-you-need-pytorch Goto Github PK
View Code? Open in Web Editor NEWThis project forked from jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
License: MIT License