The Annotated Transformer

This tutorial offers a detailed, line-by-line PyTorch implementation of the Transformer model introduced in "Attention Is All You Need." It elucidates the model's architecture—comprising encoder-decoder structures with multi-head self-attention and feed-forward layers—enhancing understanding through annotated code and explanations. This resource serves as both an educational tool and a practical guide for implementing and comprehending Transformer-based models.

Visit Website

Introduction

The Transformer has been on a lot of people’s minds over the five years. This post presents an annotated version of the paper in the form of a line-by-line implementation. It reorders and deletes some sections from the original paper and adds comments throughout. This document itself is a working notebook, and should be a completely usable implementation. Code is available here.

Back

Information

Websitenlp.seas.harvard.edu
AuthorsAlexander Rush
Published date2022/05/02

More Items

Anthropic's Interactive Prompt Engineering Tutorial

2024

Anthropic

An interactive prompt engineering tutorial released by Anthropic. The GitHub repository provides a step-by-step course (9 chapters + appendix) with lessons and hands-on exercises for building and troubleshooting prompts for Claude. It uses Claude 3 Haiku for examples, includes example playgrounds and an answer key, and is targeted at people who want to learn practical prompt design and common failure modes.

anthropic claude tutorial course LLM+2

openai-cookbook

2022

OpenAI

The OpenAI Cookbook is an open-source GitHub repository from OpenAI that provides example code, guides, and recipes for using the OpenAI API. It contains practical examples covering prompt engineering, text generation, embeddings, retrieval-augmented generation (RAG), image generation, fine-tuning, integrations, and more. Most examples are in Python and designed to help developers learn and integrate the API quickly.

openai ai-api github tutorial ai-coding+2

Hands-On Large Language Models

2024

Jay Alammar, Maarten Grootendorst +1

Official code repository for the O'Reilly book "Hands-On Large Language Models" by Jay Alammar and Maarten Grootendorst. It provides runnable notebooks, visual explanations, and practical examples across chapters covering tokens and embeddings, transformer internals, text classification, semantic search, fine-tuning, multimodal models, and more. Recommended to run in Google Colab for easy setup.

book llm LLM github tutorial+5