Gyanachand1/normcore-llm.md

Forked from veekaybee/normcore-llm.md

Created September 22, 2024 15:58

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/Gyanachand1/0745db94ac4e85ae38cfafb8a5d1a14d.js"></script>
Save Gyanachand1/0745db94ac4e85ae38cfafb8a5d1a14d to your computer and use it in GitHub Desktop.

Download ZIP

Normcore LLM Reads

Raw

normcore-llm.md

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Pre-Transformer Models

Screenshot 2023-12-18 at 8 25 42 PM

Building Blocks

Screenshot 2023-12-18 at 8 33 35 PM

Foundational Deep Learning Papers

Screenshot 2023-12-18 at 8 35 18 PM

The Transformer Architecture

Screenshot 2023-12-18 at 8 37 44 PM

Attention

GPT

LLMs in 2023

Training Data

Pre-Training

Fine-Tuning

The Complete Guide to LLM Fine-tuning
LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language - Really great overview of SOTA fine-tuning techniques
A Gentle Introduction to 8-bit matrix multiplication
Motivation for Parameter-Efficient Fine-tuning
Which Quantization Method is Right for You?
Fine-tuning with LoRA and QLoRA
Fine-tuning RedPajama on Slack Data

Small and Local LLMs

Deployment and Production

Prompt Engineering

GPUs

Evaluation

UX

What's Next?

Thanks to everyone who added suggestions on Twitter, Mastodon, and Bluesky.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment