Skip to content
View Vibashan's full-sized avatar
💥
💥

Block or report Vibashan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the Molmo2 Vision-Language Model

153 4 Updated Dec 16, 2025

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 17,030 1,201 Updated Dec 19, 2025

Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.

Python 101 4 Updated Jan 14, 2026

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation​

Jupyter Notebook 326 20 Updated Feb 3, 2026

NanoGPT (124M) in 2 minutes

Python 4,591 612 Updated Feb 1, 2026

Galaxea's first VLA release

Python 508 34 Updated Jan 17, 2026

World Modeling by Forecasting Vision Foundation Model Features

Jupyter Notebook 33 Updated Jan 7, 2026

Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

22 Updated Dec 24, 2025

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 1,273 121 Updated Dec 8, 2025

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

Python 857 126 Updated Oct 28, 2025

A framework for efficient model inference with omni-modality models

Python 2,654 395 Updated Feb 7, 2026

MiMo-Embodied

Python 349 13 Updated Nov 21, 2025

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Python 1,718 145 Updated Dec 12, 2025

Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset

Python 278 24 Updated Nov 29, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,590 998 Updated Feb 3, 2026

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 289 8 Updated Jan 29, 2026

dLLM: Simple Diffusion Language Modeling

Python 1,713 171 Updated Feb 6, 2026

A character-level language diffusion model trained on Tiny Shakespeare

Python 850 81 Updated Jan 16, 2026

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 136 5 Updated Oct 14, 2025

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 507 11 Updated Nov 14, 2025

A minimal implementation of DeepMind's Genie world model

Python 1,149 92 Updated Nov 22, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model

Python 875 50 Updated Dec 2, 2025

Training framework with a goal to explore the frontier of sample efficiency of small language models

Python 96 10 Updated Jan 25, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,753 65 Updated Jan 20, 2026

The best ChatGPT that $100 can buy.

Python 42,490 5,487 Updated Feb 6, 2026

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,122 238 Updated Sep 12, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 992 40 Updated Nov 25, 2025

Implementation of MagViT2 Tokenizer in Pytorch

Python 661 34 Updated Jan 12, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,777 186 Updated May 20, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,699 1,189 Updated Nov 21, 2025
Next