Skip to content
View UranusSeven's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report UranusSeven

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A light weight vLLM simulator, for mocking out replicas.

Go 96 63 Updated Mar 5, 2026
Rust 20 6 Updated Mar 6, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 956 123 Updated Mar 7, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,609 610 Updated Mar 7, 2026

Open-source, secure environment with real-world tools for enterprise-grade agents.

MDX 11,171 793 Updated Mar 6, 2026

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 753 60 Updated Aug 6, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,832 422 Updated Mar 7, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 662 36 Updated Mar 5, 2026

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

285 5 Updated Dec 1, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,113 286 Updated Mar 7, 2026

Python framework for creating, editing, and running Noisy Intermediate-Scale Quantum (NISQ) circuits.

Python 4,883 1,184 Updated Mar 7, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,882 2,055 Updated Jan 13, 2026

PennyLane is an open-source quantum software platform for quantum computing, quantum machine learning, and quantum chemistry. Create meaningful quantum algorithms, from inspiration to implementation.

Python 3,096 750 Updated Mar 7, 2026

A debugging and profiling tool that can trace and visualize python code execution

Python 7,569 471 Updated Feb 16, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,330 471 Updated Mar 7, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 96,806 12,005 Updated Mar 7, 2026

A Quirky Assortment of CuTe Kernels

Python 846 87 Updated Mar 7, 2026

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 38,304 3,240 Updated Oct 24, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,149 182 Updated Feb 23, 2026

kernels, of the mega variety

Python 687 45 Updated Mar 6, 2026

A stand-alone implementation of several NumPy dtype extensions used in machine learning.

C++ 332 55 Updated Mar 3, 2026

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 180 28 Updated Mar 27, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 3,422 477 Updated Mar 7, 2026

The official repository for the gem5 computer-system architecture simulator.

C++ 2,493 1,731 Updated Mar 6, 2026

A lightweight design for computation-communication overlap.

Python 223 14 Updated Jan 20, 2026

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 123 15 Updated Dec 25, 2025

LLM training in simple, raw C/CUDA

Cuda 29,045 3,411 Updated Jun 26, 2025
Next