Skip to content

xc308/Understanding-Transformer-Attention-Weights

Repository files navigation

Understanding-Transformer-Attention-Weights

This repository contains code to compute and visualise the attention weights across different layers and different heads.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors