Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

OMNIML-5128 Capture Docker experiment id
#1840 opened Jun 27, 2026 by ChenhanYu Collaborator Loading…
Fix Nemotron-H PTQ failure on Transformers 5.x with --trust_remote_code (moe_latent_size AttributeError) cherry-pick-0.45.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1839 opened Jun 26, 2026 by Fridah-nv Contributor Loading…
specdec(recipe): add MiniMax-M2.7-DFlash streaming multi-node pipeline
#1835 opened Jun 26, 2026 by yeyu-nvidia Contributor Loading…
3 tasks
Add quant+sparse attention for vLLM serving
#1832 opened Jun 25, 2026 by kaix-nv Contributor Draft
Fix weight-only prequant layernorm export
#1825 opened Jun 25, 2026 by meenchen Contributor Draft
Fix AutoQuantize causal LM score scaling
#1810 opened Jun 23, 2026 by realAsma Contributor Draft
Add NVFP4 Conv3d export for diffusers VAE (Wan 2.2)
#1809 opened Jun 23, 2026 by jingyu-ml Contributor Loading…
Support FP8 per block (weight + dynamic per token activation) export
#1807 opened Jun 23, 2026 by sugunav14 Contributor Loading…
MiniMax-M3 mixed MXFP8-base + NVFP4-experts PTQ export
#1806 opened Jun 23, 2026 by chadvoegele Contributor Loading…
Puzzletron tutorial fixes for runtime optimization
#1803 opened Jun 23, 2026 by grzegorz-k-karch Contributor Loading…
Add puzzletron eval skill
#1802 opened Jun 23, 2026 by danielkorzekwa Contributor Loading…
Support INT block scale learning
#1795 opened Jun 22, 2026 by realAsma Contributor Draft
[OMNIML-5060] cell_t0_d7
#1789 opened Jun 22, 2026 by ChenhanYu Collaborator Draft
[OMNIML-5084] cell_t0_d7
#1788 opened Jun 22, 2026 by ChenhanYu Collaborator Draft
Create adding_new_model_tutorial.md
#1784 opened Jun 22, 2026 by danielkorzekwa Contributor Loading…
Add: suppot trt-rtx-abi ep
#1783 opened Jun 22, 2026 by haoxiz-nvidia Contributor Loading…
Add: support input_shape_profile for trt-rtx ep
#1782 opened Jun 22, 2026 by haoxiz-nvidia Contributor Loading…
[Chore]: Update Dflash recipes to use dpace
#1775 opened Jun 20, 2026 by h-guo18 Contributor Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.