Skip to content

Instantly share code, notes, and snippets.

View Meowu's full-sized avatar
🎯
Focusing

Meowu Meowu

🎯
Focusing
View GitHub Profile
@Meowu
Meowu / grpo_demo.py
Created February 11, 2025 03:57 — forked from willccbb/grpo_demo.py
GRPO Llama-1B
# train_grpo.py
import re
import torch
from datasets import load_dataset, Dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer
# Load and prep dataset
@Meowu
Meowu / RELEASE-NOTES.md
Created July 26, 2022 03:18 — forked from andreasonny83/RELEASE-NOTES.md
Release Notes Template

Release Notes Template

Based off https://palantir.quip.com/pzRwAVr1bpzf

Pro-tip: look through the github diff between the previous release to see what's changed. The commit titles should give an outline of what's happened.

Upgrade Steps

  • List out, as concretely as possible, any steps users have to take when they upgrade beyond just dumping the dependency.
  • Write pseudocode that highlights what code should change and how.