Skip to content

Instantly share code, notes, and snippets.

@madhuatx
madhuatx / torchrun_slurm.md
Last active May 2, 2025 14:53
SLURM with Torchrun on multiple nodes

I'll rewrite the SLURM job submission script in Markdown format with proper formatting and explanation.

SLURM Job Submission Script for Distributed Inference with torchrun

Below is a comprehensive example of a SLURM batch script for running inference using PyTorch's torchrun on a distributed cluster:

#!/bin/bash
#SBATCH --job-name=torch_inference
#SBATCH --output=torch_inference_%j.out
@madhuatx
madhuatx / private_fork.md
Created October 23, 2023 13:39 — forked from 0xjac/private_fork.md
Create a private fork of a public repository

The repository for the assignment is public and Github does not allow the creation of private forks for public repositories.

The correct way of creating a private frok by duplicating the repo is documented here.

For this assignment the commands are:

  1. Create a bare clone of the repository. (This is temporary and will be removed so just do it wherever.)

git clone --bare [email protected]:usi-systems/easytrace.git