Add MLX format export support for Apple Silicon and support vlm in AutoScheme by wenhuach21 · Pull Request #1732 · intel/auto-round

wenhuach21 · 2026-04-23T08:43:46Z

Description

test qwen3.5-4b, qwen3-0.6b. As there is no devices to test, so the oob support may not enough

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

- Support W2/W3/W4/W8 quantized model export to MLX format - Compatible with mlx-lm for inference on Apple Silicon - Handle cross-word bit packing for 3-bit quantization - Flatten rope_parameters for mlx-lm compatibility

for more information, see https://pre-commit.ci

Signed-off-by: Wenhua Cheng <[email protected]>

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds MLX export + Apple Silicon inference support by introducing an MLX backend, an MLX-format exporter, and MLX-specific quantized linear layers, plus tests covering export/inference behavior (including mixed-bit and VLM config validation).

Changes:

Introduces mlx output format and MLX exporter (export_to_mlx) that writes MLX-compatible config.json quantization blocks.
Adds MLX inference backend and QuantLinearMLX (including GPTQ→MLX post-init repacking on macOS).
Adds pytest coverage for MLX/native + auto_round flows and helper paths for Qwen models.

Reviewed changes

Copilot reviewed 13 out of 14 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
`auto_round/export/export_to_mlx/export.py`	Implements MLX packing + config.json generation (mixed-bit + VLM handling).
`auto_round/formats.py`	Registers `mlx` / `auto_round:mlx` formats and routes saving through MLX exporter.
`auto_round/inference/backend.py`	Adds `mlx` backend and OS-based backend filtering; adjusts backend requirements.
`auto_round/inference/convert_model.py`	Adds MPS device support and macOS GPTQ→MLX post-init conversion.
`auto_round_extension/mlx/qlinear_mlx.py`	Adds `QuantLinearMLX` with MLX-kernel forward + GPTQ→MLX repacking logic.
`auto_round_extension/torch/qlinear_mlx.py`	Adds backward-compat shim to the new MLX module location.
`auto_round/utils/common.py`	Extends supported format list with `mlx` and `auto_round:mlx`.
`auto_round/schemes.py`	Adds `W5A16` and `W6A16` preset schemes.
`test/test_mlx/test_mlx_format.py`	Adds comprehensive MLX export/inference pytest suite (incl. mixed-bit + VLM config assertions).
`test/helpers.py`	Adds a new helper path variable for a Qwen3 VL 9B model.
`test_mlx_export.py`	Adds a standalone MLX export test script.

Comments suppressed due to low confidence (1)

auto_round/inference/backend.py:1

The auto_awq:gemm backend no longer declares its dependency requirement, but dynamic_import_inference_linear(...) still imports awq.modules.linear.WQLinear_GEMM for AWQ backends. Without requirements=[\"autoawq\"] (or the correct package name used in your environment), backend selection may succeed and then fail at runtime with an ImportError. Re-add an explicit requirement for the AWQ package so compatibility checks prevent selecting this backend when the dependency is missing.

# Copyright (c) 2024 Intel Corporation

Co-authored-by: Copilot <[email protected]>

…o support_mlx

for more information, see https://pre-commit.ci

…o support_mlx Signed-off-by: Wenhua Cheng <[email protected]>

for more information, see https://pre-commit.ci

wenhuach21 · 2026-04-23T14:12:42Z

visual module's grad is 0 in autoscheme

for more information, see https://pre-commit.ci

…o support_mlx

for more information, see https://pre-commit.ci

…o support_mlx

wenhuach and others added 14 commits April 19, 2026 16:30

Add MLX format export support for Apple Silicon

4293548

- Support W2/W3/W4/W8 quantized model export to MLX format - Compatible with mlx-lm for inference on Apple Silicon - Handle cross-word bit packing for 3-bit quantization - Flatten rope_parameters for mlx-lm compatibility

[pre-commit.ci] auto fixes from pre-commit.com hooks

ad236b4

for more information, see https://pre-commit.ci

supprt auto-round：mlx

9829760

[pre-commit.ci] auto fixes from pre-commit.com hooks

feebb03

for more information, see https://pre-commit.ci

support for converting

bae290d

[pre-commit.ci] auto fixes from pre-commit.com hooks

9c23e4a

for more information, see https://pre-commit.ci

update

49462a4

[pre-commit.ci] auto fixes from pre-commit.com hooks

371fb17

for more information, see https://pre-commit.ci

Merge branch 'main' into mlx-format-support

bcdf5f3

tmp change

89b3748

Signed-off-by: Wenhua Cheng <[email protected]>

tmp change

82007f6

fix

0253f56

Signed-off-by: Wenhua Cheng <[email protected]>

fix

45ddbab

Signed-off-by: Wenhua Cheng <[email protected]>

update

741dc6e

Signed-off-by: Wenhua Cheng <[email protected]>

Copilot AI review requested due to automatic review settings April 23, 2026 08:43

[pre-commit.ci] auto fixes from pre-commit.com hooks

2fb47fd

for more information, see https://pre-commit.ci

Copilot AI reviewed Apr 23, 2026

View reviewed changes

wenhuach21 and others added 5 commits April 23, 2026 16:55

refine

357d214

Update auto_round/export/export_to_mlx/export.py

5fda5d5

Co-authored-by: Copilot <[email protected]>

Merge branch 'support_mlx' of https://github.com/intel/auto-round int…

92e482e

…o support_mlx

refine

5645adb

[pre-commit.ci] auto fixes from pre-commit.com hooks

afd398a

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of wenhuach21 April 23, 2026 09:18 View session

wenhuach21 changed the title ~~Support mlx~~ Add MLX format export support for Apple Silicon Apr 23, 2026

wenhuach21 and others added 4 commits April 23, 2026 20:06

Merge branch 'main' into support_mlx

b81551f

update

34abc44

update

f4d29bc

[pre-commit.ci] auto fixes from pre-commit.com hooks

55a6b52

for more information, see https://pre-commit.ci

wenhuach21 mentioned this pull request Apr 23, 2026

AutoScheme not supported in MLLM/VLM mode (Qwen3-VL) — timeline for support? #1273

Open

fix

2d45db2

wenhuach21 and others added 2 commits April 23, 2026 21:50

Merge branch 'support_mlx' of https://github.com/intel/auto-round int…

f5dc7cc

…o support_mlx Signed-off-by: Wenhua Cheng <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

fc40e7c

for more information, see https://pre-commit.ci

wenhuach21 and others added 7 commits April 24, 2026 11:14

fix

fe8f8ae

fix

5530a81

[pre-commit.ci] auto fixes from pre-commit.com hooks

cb4e2fd

for more information, see https://pre-commit.ci

fix

78047cc

Merge branch 'support_mlx' of https://github.com/intel/auto-round int…

a7462ea

…o support_mlx

[pre-commit.ci] auto fixes from pre-commit.com hooks

c109091

for more information, see https://pre-commit.ci

fix

51b9da0

wenhuach21 changed the title ~~Add MLX format export support for Apple Silicon~~ Add MLX format export support for Apple Silicon and support vlm in AutoScheme Apr 24, 2026

wenhuach21 added 2 commits April 24, 2026 17:44

fix preci

0de1ba3

Merge branch 'support_mlx' of https://github.com/intel/auto-round int…

da843fe

…o support_mlx

wenhuach21 requested review from lvliang-intel, n1ck-guo, xin3he and yiliu30 April 24, 2026 09:47

Merge branch 'main' into support_mlx

6848e08

hshen14 approved these changes Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MLX format export support for Apple Silicon and support vlm in AutoScheme#1732

Add MLX format export support for Apple Silicon and support vlm in AutoScheme#1732
wenhuach21 wants to merge 37 commits intomainfrom
support_mlx

wenhuach21 commented Apr 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

wenhuach21 commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wenhuach21 commented Apr 23, 2026 •

edited

Loading