GeekTemo

Follow

GeekPower GeekTemo

Follow

7 followers · 18 following

Starred repositories

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 15,550 1,333 Updated Feb 28, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 84,763 9,812 Updated Mar 26, 2026

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 931 78 Updated Mar 2, 2026

AgentR1 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,311 86 Updated Mar 25, 2026

ShaohonChen / Qwen3-SmVL

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 562 53 Updated Sep 8, 2025

pytorch / tensordict

TensorDict is a pytorch dedicated tensor container.

Python 1,018 111 Updated Mar 27, 2026

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,361 448 Updated Mar 27, 2026

going-doer / Paper2Code

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 4,285 620 Updated Mar 25, 2026

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,004 1,945 Updated Jan 9, 2026

PDFMathTranslate / PDFMathTranslate

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python 32,522 2,930 Updated Mar 24, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,970 2,414 Updated Nov 24, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,896 379 Updated Mar 12, 2026

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,150 346 Updated Mar 27, 2026

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,621 130 Updated Nov 21, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT！🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 44,208 5,304 Updated Mar 27, 2026

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,728 674 Updated Mar 27, 2026

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 29,317 2,883 Updated Mar 27, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,786 1,698 Updated Jan 30, 2026

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,653 587 Updated Oct 24, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,718 1,332 Updated Mar 26, 2026

hkproj / pytorch-paligemma

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 598 103 Updated Dec 6, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,054 77 Updated Nov 18, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,693 9,494 Updated Nov 12, 2025

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 968 100 Updated Mar 26, 2026

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 3,167 424 Updated Mar 21, 2026

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,715 1,422 Updated Mar 27, 2026

microsoft / nlp-recipes

Natural Language Processing Best Practices & Examples

Python 6,444 912 Updated Aug 30, 2022

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,529 1,119 Updated Mar 20, 2026

bloomberg / memray

Memray is a memory profiler for Python

Python 14,970 438 Updated Mar 27, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,358 13,641 Updated Mar 26, 2026

Starred topics

natural-language-processing

machine-learning-algorithms