ChatGPT Resources

Public Info

Blog Post

Business Context

OpenAI in 2019

Training Data

Models

My Twitter Thread Question on the Model
Language Models are Few-Shot Learners: GPT3
Model- reinforcement learning for language models

We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with slight differences in the data collection setup.

Infra

Azure
K8s Source here A large machine learning job spans many nodes and runs most efficiently when it has access to all of the hardware resources on each node. This allows GPUs to cross-communicate directly using NVLink, or GPUs to directly communicate with the NIC using GPUDirect. So for many of our workloads, a single pod occupies the entire node.

We have very little HTTPS traffic, with no need for A/B testing, blue/green, or canaries. Pods communicate directly with one another on their pod IP addresses with MPI via SSH, not service endpoints. Service “discovery” is limited; we just do a one-time lookup for which pods are participating in MPI at job startup time.
Terraform, Python, Chef, GPU workloads on 500+ node clusters

cedrickchee/chatgpt.md

ChatGPT Resources

Public Info

Business Context

Training Data

Models

Infra

My Attempts

wd021 commented Jul 10, 2025

Uh oh!