14 October 2025
In this lecture, we introduce the transformer architecture, the most widely used model in modern NLP.
1Lecture 05 - Conjugate Gradient Method (CGM)
13 October 2025
In this lecture, we introduce the conjugate gradient method (CGM) for solving the system of linear equations.
212 October 2025
My post is built based on Gregory Gundersen's blog-theme.
3