Skip to content

Instantly share code, notes, and snippets.

@AlanHowlett
Forked from veekaybee/chatgpt.md
Created December 12, 2022 16:02
Show Gist options
  • Save AlanHowlett/bfeb4057f82b50ecfd62cc5c04b3b18a to your computer and use it in GitHub Desktop.
Save AlanHowlett/bfeb4057f82b50ecfd62cc5c04b3b18a to your computer and use it in GitHub Desktop.
Everything I understand about chatgpt

ChatGPT Resources

Public Info

ChatGPT is a sibling model to InstructGPT, which is trained to follow an instruction in a prompt and provide a detailed response. We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with slight differences in the data collection setup. ChatGPT is fine-tuned from a model in the GPT-3.5 series

text-davinci-003 is an improvement on text-davinci-002 Screen Shot 2022-12-10 at 2 10 54 PM

Business Context

High-level overview:

Training Data

Screen Shot 2022-12-10 at 2 13 51 PM

The model was trained on

Screen Shot 2022-12-08 at 2 49 16 PM

Models

Model Evaluation

Market

Infra

A large machine learning job spans many nodes and runs most efficiently when it has access to all of the hardware resources on each node. This allows GPUs to cross-communicate directly using NVLink, or GPUs to directly communicate with the NIC using GPUDirect. So for many of our workloads, a single pod occupies the entire node.

We have very little HTTPS traffic, with no need for A/B testing, blue/green, or canaries. Pods communicate directly with one another on their pod IP addresses with MPI via SSH, not service endpoints. Service “discovery” is limited; we just do a one-time lookup for which pods are participating in MPI at job startup time.

Use-Cases

  • Code completion
  • Semantic search

My Attempts

Screen Shot 2022-12-08 at 11 45 43 AM

Screen Shot 2022-12-08 at 4 10 38 PM

Screen Shot 2022-12-08 at 4 11 07 PM

Screen Shot 2022-12-08 at 4 22 22 PM

Screen Shot 2022-12-08 at 4 23 07 PM

Screen Shot 2022-12-08 at 4 24 17 PM

Screen Shot 2022-12-10 at 10 18 14 AM

Screen Shot 2022-12-10 at 1 38 37 PM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment