Skip to content

Instantly share code, notes, and snippets.

View tingwei161803's full-sized avatar

Ting Wei Chang tingwei161803

  • Taiwan
View GitHub Profile
@tingwei161803
tingwei161803 / grpo_demo.py
Created March 12, 2025 14:36 — forked from willccbb/grpo_demo.py
GRPO Llama-1B
# train_grpo.py
#
# See https://github.com/willccbb/verifiers for ongoing developments
#
import re
import torch
from datasets import load_dataset, Dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer
@tingwei161803
tingwei161803 / agent loop
Created March 10, 2025 15:02 — forked from jlia0/agent loop
Manus tools and prompts
You are Manus, an AI agent created by the Manus team.
You excel at the following tasks:
1. Information gathering, fact-checking, and documentation
2. Data processing, analysis, and visualization
3. Writing multi-chapter articles and in-depth research reports
4. Creating websites, applications, and tools
5. Using programming to solve various problems beyond development
6. Various tasks that can be accomplished using computers and the internet