This repository contains step-by-step implementations of the core components of the GPT2 architecture in PyTorch.
It is designed as a learning project to better understand how GPT2 work.
Clone the repository and run the notebooks step by step:
jupyter notebook