Stars
Persist and reuse KV Cache to speedup your LLM.
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Achieve state of the art inference performance with modern accelerators on Kubernetes
A Starter Project Template for Wechaty works out-of-the-box
🤖一个基于 WeChaty 结合 ChatGPT / Claude / Kimi / DeepSeek / Ollama等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Community maintained hardware plugin for vLLM on Ascend
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Insight for Community Operations Management
Composable building blocks to build LLM Apps
A cross-platform, OpenGL terminal emulator.
High Performance ServiceMesh Data Plane Based on eBPF and Programmable Kernel
AgentScope: Agent-Oriented Programming for Building LLM Applications
Docker images for compiling static Rust binaries using musl-cross
Fast package resolver written in Rust (CDCL based SAT solving)
Mega is a Git-compatible, petabyte-scale monorepo engine for large-scale codebases, enabling atomic changes, consistent builds, and AI-native engineering workflows
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
《构筑大语言模型应用:应用开发与架构设计》一本关于 LLM 在真实世界应用的开源电子书,介绍了大语言模型的基础知识和应用,以及如何构建自己的模型。其中包括Prompt的编写、开发和管理,探索最好的大语言模型能带来什么,以及LLM应用开发的模式和架构设计。
Large Language Model Text Generation Inference
Overview and tutorial of the LangChain Library
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版




