Introduction to Neural Network Transformers (10.4)
In this video I provide an introduction to transformers, which include attention, residual connections, dropout, and dense layers.
Code for This Video:
https://github.com/jeffheaton/t81_558_deep_learning/blob/master/t81_558_class_10_4_intro_transformers.ipynb
Course Homepage: https://sites.wustl.edu/jeffheaton/t81-558/
Follow Me/Subscribe:
https://www.youtube.com/user/HeatonResearch
https://github.com/jeffheaton
Tweets by jeffheaton
Support Me on Patreon: https://www.patreon.com/jeffheaton