Transformer Architectures for Generative AI

This repository contains code for the O'Reilly Live Online Training for "Transformer Architectures for Generative AI"

This course is designed to provide a deep understanding of transformer architectures and their revolutionary impact on both natural language processing (NLP) and vision tasks. This course is crucial for professionals looking to stay at the forefront of AI advancements, as transformers are now the cornerstone of many state-of-the-art models. By combining theory with practical exercises, participants will learn how to harness the power of transformers to tackle complex problems in text, image, and multimodal AI.

Notebooks

Introduction to LLMs

BERT - the beginnings of LLMs
- Introduction to BERT
T5 - the beginnings of instructional alignment
- Off the shelf NLP with T5
GPT - How LLMs learned to talk
- Introduction to GPT
Multimodal LLMs
- Image Captioning with Vision Transformers
  - Open in Colab
[Inspecting LLM token embeddings](notebooks/LLM Embeddings.ipynb) - Explore how different attention mechanisms lead to different token embeddings

Advanced LLMs

LLM Embedding

Rivaling OpenAI embeddings with fine-tuning - Fine-tune Embeddings with Synthetic Data

LLM Classification

bert_app_review.ipynb: Fine-tuning a BERT model for app review classification.
openai_app_review_fine_tuning.ipynb: Fine-tuning OpenAI models for app review classification.

Multimodal

Stock Image Search - Using a CLIP model to build an image search system
Visual Q/A
- constructing_a_vqa_system.ipynb: Step-by-step guide to constructing a Visual Question Answering (VQA) system using GPT-2 and Vision Transformer.
  - using_our_vqa.ipynb: Using the VQA system built in the previous notebook.

SAWYER - Instructional Fine-tuning

SAWYER_LLAMA_SFT.ipynb: Fine-tuning the Llama-3 model to create the SAWYER bot.
SAWYER_Reward_Model.ipynb: Training a reward model from human preferences for the SAWYER bot.
SAWYER_RLF.ipynb: Applying Reinforcement Learning from Human Feedback (RLHF) to align the SAWYER bot.
SAWYER_USE_SAWYER.ipynb: Using the SAWYER bot.

Distillation

Go Emotion Distillation: Exploring knowledge distillation techniques for transformer models.

Agents / RAG

RAG Retrieval: An introduction to vector databases, embeddings, and retrieval
Evaluating Tool Selection - Calculating the accuracy of tool selection between different LLMs and quantifying the positional bias present in auto-regressive LLMs

Probing

Probing Chess Playing LLMs
There are over a dozen notebooks for the birth year/death year probing example so I will only share a few key ones here:

Instructor

Sinan Ozdemir is founder and CTO of LoopGenius, where he uses state-of-the-art AI to help people create and run their businesses. He has lectured in data science at Johns Hopkins University and authored multiple books, videos and numerous online courses on data science, machine learning, and generative AI. He also founded the recently acquired Kylie.ai, an enterprise-grade conversational AI platform with RPA capabilities. Sinan most recently published Quick Guide to Large Language Models, and launched a podcast audio series, AI Unveiled. Ozdemir holds a master’s degree in pure mathematics from Johns Hopkins University.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
images		images
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Architectures for Generative AI

Notebooks

Introduction to LLMs

Advanced LLMs

Instructor

About

Releases

Packages

Languages

sinanuozdemir/foundations-of-gen-ai

Folders and files

Latest commit

History

Repository files navigation

Transformer Architectures for Generative AI

Notebooks

Introduction to LLMs

Advanced LLMs

Instructor

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages