Skip to content

Pull requests: ModelCloud/GPTQModel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

WIP: compile foward
#1276 opened Feb 13, 2025 by CSY-ModelCloud Draft updated Feb 19, 2025
[MODEL] support minicpm-o 2.6
#1116 opened Jan 20, 2025 by ZX-ModelCloud Loading… updated Feb 19, 2025
tokenicer.save()
#1270 opened Feb 12, 2025 by CL-ModelCloud Loading… updated Feb 19, 2025
use second gpu for add_batch and self.H sum
#1194 opened Feb 1, 2025 by Qubitium Draft updated Feb 19, 2025
reduce memory
#1195 opened Feb 1, 2025 by Qubitium Draft updated Feb 19, 2025
add faster packing
#1464 opened Mar 14, 2025 by Qubitium Draft updated Mar 14, 2025
Llama 4 Support
#1508 opened Apr 6, 2025 by Qubitium Draft updated Apr 6, 2025
Google TPU Support
#1532 opened Apr 10, 2025 by Qubitium Loading… updated Apr 11, 2025
Fix v2 for MoE
#1548 opened Apr 17, 2025 by Qubitium Draft updated Apr 17, 2025
Mistral3 Support
#1563 opened Apr 29, 2025 by Qubitium Draft updated Apr 29, 2025
[KERNEL] machete
#1597 opened May 7, 2025 by LRL-ModelCloud Draft updated May 9, 2025
[MODEL] Intern vl2 support
#970 opened Dec 25, 2024 by ZX-ModelCloud Draft updated Jul 17, 2025
ProTip! What’s not been updated in a month: updated:<2025-06-17.