msaroufim

🤖

Putting the finishing touches on my robot army

Mark Saroufim msaroufim

🤖

Putting the finishing touches on my robot army

CUDA uninЇsțållåțîön fāīłüřęđ. Płēȃšę čøñțàçț șūppørt før åššīštåñćē

977 followers · 0 following

Achievements

x3 x3 x3

Achievements

x3 x3 x3

Organizations

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

johnrobinsn / claude-watch

A terminal UI dashboard for monitoring multiple Claude Code sessions in tmux

TypeScript 2 1 Updated Feb 2, 2026

changjonathanc / flex-nano-vllm

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 334 20 Updated Nov 2, 2025

NVlabs / NVBit

304 26 Updated Sep 23, 2025

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 107 6 Updated Feb 2, 2026

aisa-group / PostTrainBench

PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours

Python 129 13 Updated Feb 2, 2026

eligotts / legos

Python 21 Updated Jan 22, 2026

NVIDIA / pyxis

Container plugin for Slurm Workload Manager

C 412 39 Updated Jan 9, 2026

google / crosvm

The Chrome OS Virtual Machine Monitor - Mirror of https://chromium.googlesource.com/crosvm/crosvm/

Rust 1,141 136 Updated Feb 4, 2026

daytonaio / daytona

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 52,775 4,962 Updated Feb 4, 2026

wafer-ai / gpu-perf-engineering-resources

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

342 30 Updated Jan 13, 2026

facebookresearch / mosaic

Post processing library used to analyze memory snapshots

Python 19 6 Updated Jan 12, 2026

nlohmann / json

JSON for Modern C++

C++ 48,764 7,309 Updated Feb 3, 2026

google / gvisor

Application Kernel for Containers

Go 17,654 1,503 Updated Feb 4, 2026

HSA-Libraries / Bolt

Bolt is a C++ template library optimized for GPUs. Bolt provides high-performance library implementations for common algorithms such as scan, reduce, transform, and sort.

C++ 379 63 Updated Feb 11, 2016

alexzhang13 / rlm

General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.

Python 1,972 384 Updated Feb 1, 2026

NVIDIA / cuda-tile

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 818 60 Updated Jan 14, 2026