-
Notifications
You must be signed in to change notification settings - Fork 333
Pull requests: ModelTC/LightLLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add fused moe shared-expert and add-rmsnorm optimization
#1353
opened Jun 15, 2026 by
blueswhen
Collaborator
Loading…
feat: enable prefill cudagraph by default
#1352
opened Jun 15, 2026 by
sufubao
Collaborator
Loading…
test(sampling_params): repair broken test collection and add verify() coverage
#1350
opened Jun 13, 2026 by
SuperMarioYL
Contributor
Loading…
perf(qwen3next): drop q/k/v/a/b contiguous copies in GDN fused_recurrent decode
#1349
opened Jun 13, 2026 by
sufubao
Collaborator
Loading…
fix(visualserver): contain visual worker failures
#1347
opened Jun 13, 2026 by
sufubao
Collaborator
Loading…
feat(qwen3_5_mtp): Qwen3.5 / Qwen3.5-MoE MTP speculative decoding
#1338
opened Jun 9, 2026 by
sufubao
Collaborator
Loading…
feat: add multi-platform support with ascend and maca
#1335
opened Jun 8, 2026 by
zhangts20
Loading…
feat: update disk cache params and benchmark_multiturn.py
#1333
opened Jun 8, 2026 by
blueswhen
Collaborator
Loading…
fix: replace pickle deserialization with RestrictedUnpickler in PD WebSocket endpoints (CVE-2026-26220)
#1306
opened May 11, 2026 by
nexadodigital
Loading…
Propagate FINISHED_ERROR from detokenization init failure
#1299
opened May 9, 2026 by
sufubao
Collaborator
Loading…
6 tasks
import flashqla and support cudagraph for gdn
#1292
opened May 6, 2026 by
WANDY666
Contributor
Loading…
ViT/multimodal token-budget admission + max_pixels clamp
#1290
opened May 6, 2026 by
sufubao
Collaborator
Loading…
3 of 5 tasks
Logging colorization + access middleware cleanup + windowed cache stats
#1289
opened May 6, 2026 by
sufubao
Collaborator
Loading…
6 tasks done
fix(api): forward extra_body.chat_template_kwargs on /v1/messages
#1276
opened Apr 18, 2026 by
sufubao
Collaborator
Loading…
2 of 3 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.