Contact
Ervin Tasnadi’s blog
GPU programming & deep learning
Gradient of the attention op
Oct 9