Skip to content
View TommyLike's full-sized avatar
🚑
building
🚑
building
  • Huawei
  • Sichuan,China

Block or report TommyLike

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Persist and reuse KV Cache to speedup your LLM.

Python 241 58 Updated Jan 22, 2026
76 4 Updated Nov 23, 2025

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,829 695 Updated Jan 22, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,382 298 Updated Jan 22, 2026

A Starter Project Template for Wechaty works out-of-the-box

JavaScript 842 360 Updated May 10, 2024

🤖一个基于 WeChaty 结合 ChatGPT / Claude / Kimi / DeepSeek / Ollama等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...

JavaScript 9,651 1,133 Updated Jan 8, 2026

Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…

Go 9,297 796 Updated Dec 8, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,373 1,350 Updated Oct 28, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 19,958 4,736 Updated Dec 17, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,588 760 Updated Jan 22, 2026

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 24,958 3,310 Updated Jan 21, 2026

Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration

Python 29 4 Updated Nov 22, 2025

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 32,251 1,933 Updated Jan 6, 2026

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,907 2,660 Updated Dec 30, 2025

Insight for Community Operations Management

Python 1 Updated Nov 26, 2024

Composable building blocks to build LLM Apps

Python 8,243 1,250 Updated Jan 22, 2026

A cross-platform, OpenGL terminal emulator.

Rust 62,065 3,282 Updated Jan 14, 2026

High Performance ServiceMesh Data Plane Based on eBPF and Programmable Kernel

Go 700 142 Updated Jan 8, 2026

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 15,812 1,384 Updated Jan 21, 2026

Make emojis for slack using AI

Elixir 652 58 Updated Mar 28, 2024

Docker images for compiling static Rust binaries using musl-cross

Shell 724 78 Updated Jan 21, 2026

Fast package resolver written in Rust (CDCL based SAT solving)

Rust 206 26 Updated Jan 19, 2026

Mega is a Git-compatible, petabyte-scale monorepo engine for large-scale codebases, enabling atomic changes, consistent builds, and AI-native engineering workflows

TypeScript 387 119 Updated Jan 22, 2026

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,874 3,722 Updated Jul 9, 2025

《构筑大语言模型应用:应用开发与架构设计》一本关于 LLM 在真实世界应用的开源电子书,介绍了大语言模型的基础知识和应用,以及如何构建自己的模型。其中包括Prompt的编写、开发和管理,探索最好的大语言模型能带来什么,以及LLM应用开发的模式和架构设计。

Rust 1,625 183 Updated Jan 23, 2024

OpenAI API Free Reverse Proxy

TypeScript 5,852 1,017 Updated Aug 23, 2024

Large Language Model Text Generation Inference

Python 10,735 1,253 Updated Jan 8, 2026

Overview and tutorial of the LangChain Library

Jupyter Notebook 7,365 2,044 Updated Aug 5, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 23,071 2,802 Updated Jun 12, 2025
Next