Skip to content
View msaroufim's full-sized avatar
🤖
Putting the finishing touches on my robot army
🤖
Putting the finishing touches on my robot army

Organizations

@facebookresearch @pytorch @fairinternal @mlcommons @Hugging-Face-Supporter @meta-pytorch @llm-efficiency-challenge @gpu-mode

Block or report msaroufim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A terminal UI dashboard for monitoring multiple Claude Code sessions in tmux

TypeScript 2 1 Updated Feb 2, 2026

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 334 20 Updated Nov 2, 2025
304 26 Updated Sep 23, 2025

Easy, Fast, and Scalable Multimodal AI

Python 107 6 Updated Feb 2, 2026

PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours

Python 129 13 Updated Feb 2, 2026
Python 21 Updated Jan 22, 2026

Container plugin for Slurm Workload Manager

C 412 39 Updated Jan 9, 2026

The Chrome OS Virtual Machine Monitor - Mirror of https://chromium.googlesource.com/crosvm/crosvm/

Rust 1,141 136 Updated Feb 4, 2026

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 52,775 4,962 Updated Feb 4, 2026

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

342 30 Updated Jan 13, 2026

Post processing library used to analyze memory snapshots

Python 19 6 Updated Jan 12, 2026

JSON for Modern C++

C++ 48,764 7,309 Updated Feb 3, 2026

Application Kernel for Containers

Go 17,654 1,503 Updated Feb 4, 2026

Bolt is a C++ template library optimized for GPUs. Bolt provides high-performance library implementations for common algorithms such as scan, reduce, transform, and sort.

C++ 379 63 Updated Feb 11, 2016

General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.

Python 1,972 384 Updated Feb 1, 2026

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 818 60 Updated Jan 14, 2026

Convert nvprof profiles into about:tracing compatible JSON files

Python 73 13 Updated Apr 9, 2021

A Python script to convert the output of NVIDIA Nsight Systems (in SQLite format) to JSON in Google Chrome Trace Event Format.

Python 49 6 Updated Aug 5, 2025

dictate anywhere in Linux

C 42 3 Updated Nov 9, 2025

Effortlessly migrate GitHub repositories to Codeberg! Seamlessly transfer project. Powered by Bash, curl, and jq.

Shell 57 5 Updated Dec 30, 2025

Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”

Python 119 6 Updated Feb 4, 2026

A good looking terminal emulator which mimics the old cathode display...

QML 25,069 950 Updated Jan 22, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,664 492 Updated Feb 3, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 831 96 Updated Feb 4, 2026

PyTorch-native post-training at scale

Python 611 84 Updated Feb 4, 2026

VLA-0: Building State-of-the-Art VLAs with Zero Modification

Python 436 22 Updated Jan 8, 2026

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 495 33 Updated Nov 19, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,241 89 Updated Aug 28, 2025

Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.

Python 287 32 Updated Dec 30, 2025

https://github.com/eunomia-bpf homepage, documents and blogs

HTML 175 34 Updated Feb 3, 2026
Next