Uh oh!

There was an error while loading. Please reload this page.

NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 459
Star 3k

Code
Issues 74
Pull requests 200
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 33 Milestones 0

New pull request New

200 Open 1,242 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

OMNIML-5128 Capture Docker experiment id

#1840 opened Jun 27, 2026 by ChenhanYu Collaborator

Loading…

Fix Nemotron-H PTQ failure on Transformers 5.x with --trust_remote_code (moe_latent_size AttributeError) cherry-pick-0.45.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1839 opened Jun 26, 2026 by Fridah-nv Contributor

Loading…

specdec(recipe): add MiniMax-M2.7-DFlash streaming multi-node pipeline

#1835 opened Jun 26, 2026 by yeyu-nvidia Contributor

Loading…

3 tasks

feat(export): quant-aware reverse weight conversion for unified HF export

#1833 opened Jun 26, 2026 by Edwardf0t1 Contributor • Draft

Add quant+sparse attention for vLLM serving

#1832 opened Jun 25, 2026 by kaix-nv Contributor • Draft

Fix lm_eval_hf freezing issue on multi-gpu slurm interactive node

#1831 opened Jun 25, 2026 by danielkorzekwa Contributor

Loading…

[Refactor] Extract model specific logics in export lib

#1828 opened Jun 25, 2026 by h-guo18 Contributor • Draft

Add Qwen-Image DMD2 PTQ support; save quantizer state (amax) without weights

#1827 opened Jun 25, 2026 by jingyu-ml Contributor

Loading…

Fix weight-only prequant layernorm export

#1825 opened Jun 25, 2026 by meenchen Contributor • Draft

Emit VisualGen-compatible sparse_attention_config for diffusion skip-softmax export

#1816 opened Jun 24, 2026 by jingyu-ml Contributor

Loading…

Fix AutoQuantize causal LM score scaling

#1810 opened Jun 23, 2026 by realAsma Contributor • Draft

Add NVFP4 Conv3d export for diffusers VAE (Wan 2.2)

#1809 opened Jun 23, 2026 by jingyu-ml Contributor

Loading…

Support FP8 per block (weight + dynamic per token activation) export

#1807 opened Jun 23, 2026 by sugunav14 Contributor

Loading…

MiniMax-M3 mixed MXFP8-base + NVFP4-experts PTQ export

#1806 opened Jun 23, 2026 by chadvoegele Contributor

Loading…

Puzzletron tutorial fixes for runtime optimization

#1803 opened Jun 23, 2026 by grzegorz-k-karch Contributor

Loading…

Add puzzletron eval skill

#1802 opened Jun 23, 2026 by danielkorzekwa Contributor

Loading…

Support INT block scale learning

#1795 opened Jun 22, 2026 by realAsma Contributor • Draft

Add VLM pruning and PTQ with image-text calibration (Megatron-Bridge)

#1792 opened Jun 22, 2026 by kevalmorabia97 Collaborator

Loading…

[OMNIML-5060] cell_t0_d7

#1789 opened Jun 22, 2026 by ChenhanYu Collaborator • Draft

[OMNIML-5084] cell_t0_d7

#1788 opened Jun 22, 2026 by ChenhanYu Collaborator • Draft

Create adding_new_model_tutorial.md

#1784 opened Jun 22, 2026 by danielkorzekwa Contributor

Loading…

Add: suppot trt-rtx-abi ep

#1783 opened Jun 22, 2026 by haoxiz-nvidia Contributor

Loading…

Add: support input_shape_profile for trt-rtx ep

#1782 opened Jun 22, 2026 by haoxiz-nvidia Contributor

Loading…

Fix low_memory_mode meta-device crash on fused-MoE models

#1781 opened Jun 21, 2026 by abatilo

Loading…

[Chore]: Update Dflash recipes to use dpace

#1775 opened Jun 20, 2026 by h-guo18 Contributor • Draft

Previous 1 2 3 4 5 6 7 8 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!