This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # train_grpo.py | |
| import re | |
| import torch | |
| from datasets import load_dataset, Dataset | |
| from transformers import AutoTokenizer, AutoModelForCausalLM | |
| from peft import LoraConfig | |
| from trl import GRPOConfig, GRPOTrainer | |
| # Load and prep dataset |
This file has been truncated, but you can view the full file.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| { | |
| "Afghanistan": "Afghanistan, a country located in South Asia, has been a focal point of international attention for decades due to its tumultuous history, geographical significance, and ongoing conflict. With a rich cultural heritage and a strategic location at the crossroads of Asia, Afghanistan has been a coveted prize for various empires and powers throughout history. From the ancient Silk Road to the modern-day struggle against terrorism, Afghanistan's story is one of resilience, turmoil, and transformation.\n\nGeography and Climate\n\nAfghanistan is a landlocked country bordered by Pakistan to the east and south, Iran to the west, Turkmenistan, Uzbekistan, and Tajikistan to the north, and China to the northeast. The country's terrain varies greatly, with towering mountain ranges, vast deserts, and fertile valleys. The Hindu Kush mountain range runs through the center of the country, dividing it into three main regions: the north, the central highlands, and the south. The climate in Afghanistan is g |