Skip to content

Instantly share code, notes, and snippets.

@moudgalyakvs
Forked from veekaybee/normcore-llm.md
Created October 6, 2024 04:24
Show Gist options
  • Save moudgalyakvs/8e87d75bf2b7b59e49a68f7226fd5482 to your computer and use it in GitHub Desktop.
Save moudgalyakvs/8e87d75bf2b7b59e49a68f7226fd5482 to your computer and use it in GitHub Desktop.
Normcore LLM Reads

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Pre-Transformer Models

Screenshot 2023-12-18 at 8 25 42 PM

Building Blocks

Screenshot 2023-12-18 at 8 33 35 PM

Foundational Deep Learning Papers

Screenshot 2023-12-18 at 8 35 18 PM

The Transformer Architecture

Screenshot 2023-12-18 at 8 37 44 PM

Attention

GPT

Screenshot 2023-12-18 at 8 37 44 PM

LLMs in 2023

Training Data

Pre-Training

RLHF and DPO

Fine-Tuning and Compression

Small and Local LLMs

Deployment and Production

Prompt Engineering

GPUs

Evaluation

UX

What's Next?

Thanks to everyone who added suggestions on Twitter, Mastodon, and Bluesky.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment