Skip to content
View GeekTemo's full-sized avatar

Block or report GeekTemo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The absolute trainer to light up AI agents.

Python 15,550 1,333 Updated Feb 28, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 84,763 9,812 Updated Mar 26, 2026

A version of verl to support diverse tool use

Python 931 78 Updated Mar 2, 2026

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,311 86 Updated Mar 25, 2026

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 562 53 Updated Sep 8, 2025

TensorDict is a pytorch dedicated tensor container.

Python 1,018 111 Updated Mar 27, 2026

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,361 448 Updated Mar 27, 2026

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 4,285 620 Updated Mar 25, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,004 1,945 Updated Jan 9, 2026

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 32,522 2,930 Updated Mar 24, 2026

Fully open reproduction of DeepSeek-R1

Python 25,970 2,414 Updated Nov 24, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,896 379 Updated Mar 12, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,150 346 Updated Mar 27, 2026

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,621 130 Updated Nov 21, 2025

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 44,208 5,304 Updated Mar 27, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,728 674 Updated Mar 27, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,317 2,883 Updated Mar 27, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,786 1,698 Updated Jan 30, 2026

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,653 587 Updated Oct 24, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,718 1,332 Updated Mar 26, 2026

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 598 103 Updated Dec 6, 2024

A family of lightweight multimodal models.

Python 1,054 77 Updated Nov 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,693 9,494 Updated Nov 12, 2025

A Framework of Small-scale Large Multimodal Models

Python 968 100 Updated Mar 26, 2026

复现大模型相关算法及一些学习记录

Python 3,167 424 Updated Mar 21, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,715 1,422 Updated Mar 27, 2026

Natural Language Processing Best Practices & Examples

Python 6,444 912 Updated Aug 30, 2022

✨✨Latest Advances on Multimodal Large Language Models

17,529 1,119 Updated Mar 20, 2026

Memray is a memory profiler for Python

Python 14,970 438 Updated Mar 27, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,358 13,641 Updated Mar 26, 2026
Next