Transformer architecture

In this lecture, we introduce the transformer architecture, the most widely used model in modern NLP.

In this post, we will introduce the architecture.