ChatGPT Resources

Public Info

Blog Post

ChatGPT is a sibling model to InstructGPT, which is trained to follow an instruction in a prompt and provide a detailed response.

Business Context

OpenAI in 2019

Training Data

My Twitter Thread Question on Training Data
Books1 and Books2 - Books1 Resources
Bookcorpus paper
What's in MyAI Paper, Source
The model data is recent as of 2021 and does offline inference :

Models

We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with slight differences in the data collection setup.

Infra

Azure
K8s Source here

A large machine learning job spans many nodes and runs most efficiently when it has access to all of the hardware resources on each node. This allows GPUs to cross-communicate directly using NVLink, or GPUs to directly communicate with the NIC using GPUDirect. So for many of our workloads, a single pod occupies the entire node.

We have very little HTTPS traffic, with no need for A/B testing, blue/green, or canaries. Pods communicate directly with one another on their pod IP addresses with MPI via SSH, not service endpoints. Service “discovery” is limited; we just do a one-time lookup for which pods are participating in MPI at job startup time.

Terraform, Python, Chef, GPU workloads on 500+ node clusters

Use-Cases

Code completion
Semantic search

AlanHowlett/chatgpt.md

ChatGPT Resources

Public Info

Business Context

Training Data

Models

Infra

Use-Cases

My Attempts