Tensor scale nvfp4 #3022

nastya236 · 2026-01-20T15:39:17Z

Add per tensor scale for nvfp4 quantization for cuda and cpu.

qqmm, quantize, dequantize inputs optional 1D float32 array (global_scale) if mode == "nvfp4".

Tensor scale will help with small inputs:

import mlx.core as mx

x = mx.random.uniform(shape=(2, 16)) / 1e5
xq_ns, scales_ns = mx.quantize(x, mode="nvfp4")
global_scale=mx.absmax(x).astype(mx.float32)
xq_s, scales_s = mx.quantize(x, mode="nvfp4", global_scale = global_scale)

print(mx.allclose(scales_ns, mx.zeros_like(scales_ns)))
print(mx.allclose(scales_s, mx.zeros_like(scales_s)))

AbsMax reduction type and mx.absmax op

For nvfp4 training we will compute amax often. So now there is a new reduction type in which abs is applied inside the all_reduce kernel.
Probably there is a way to do it better.

x.shape() = (4*4096, 11008)
mx.absmax(x): 0.000166 s
x.abs().max(): 0.000284 s

TODO: we probably want to support global_scale in metal as well but it requires changing all quantized operations.

…into tensor-scale-nvfp4

python/src/ops.cpp

…te PR)

nastya236 and others added 8 commits January 16, 2026 00:47

adding tensor scale [wip]

98eedd1

Merge branch 'main' into tensor-scale-nvfp4

438830a

added absmax reduction, changed fp_quanitze api [wip]

6892404

refactoring

15d684b

Merge branch 'ml-explore:main' into tensor-scale-nvfp4

8c67953

alpha device ptr for qqmm

a7fab99

Merge branch 'tensor-scale-nvfp4' of https://github.com/nastya236/mlx …

9fdfce6

…into tensor-scale-nvfp4

device alpha, beta

47be994

nastya236 closed this Jan 20, 2026

nastya236 reopened this Jan 20, 2026

nastya236 changed the title ~~Tensor scale nvfp4~~ [WIP] Tensor scale nvfp4 Jan 20, 2026

nastya236 added 2 commits January 20, 2026 20:43

harcoded absmax to output float

7e4c6e8

fixed ops python dequantize

11ff19a

awni reviewed Jan 20, 2026

View reviewed changes

python/src/ops.cpp Outdated Show resolved Hide resolved

nastya236 added 15 commits January 21, 2026 00:52

input global_scale

2a86dc1

fix global_scale

2c68fb6

Merge branch 'main' into tensor-scale-nvfp4

abe37c2

fix scale to be float(fp8e4m3(scale))

277ceeb

removed AbsMax reduction (probably add back in the future as a separa…

dad7e57

…te PR)

Merge branch 'main' into tensor-scale-nvfp4

3d7ebd9

fix columnwise quantize scale, precommit

0a804a9

abs_max

7ca2642

fix

934c0c8

fixed the fallback, fixed absmax

1fea025

fix docs, remove the diff

306acd0

fix docs, delete debuging print

7492841

Merge branch 'main' into tensor-scale-nvfp4

5503802

reverted the example

f49abe5

abs_max -> absmax

37e5789

nastya236 changed the title ~~[WIP] Tensor scale nvfp4~~ Tensor scale nvfp4 Jan 23, 2026

nastya236 marked this pull request as ready for review January 23, 2026 22:35

nastya236 added 6 commits January 23, 2026 23:51

fix

507c94e

fix test, force flobal scale only on cuda

858fe00

fix stream

5fdffe4

made AbsMax the same structure as Max

1cc13ba

abs_val rename

05bd4d0

fix abs type

20480ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor scale nvfp4 #3022

Tensor scale nvfp4 #3022

nastya236 commented Jan 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tensor scale nvfp4 #3022

Are you sure you want to change the base?

Tensor scale nvfp4 #3022

Conversation

nastya236 commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add per tensor scale for nvfp4 quantization for cuda and cpu.

AbsMax reduction type and mx.absmax op

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nastya236 commented Jan 20, 2026 •

edited

Loading

Add per tensor scale for `nvfp4` quantization for `cuda` and `cpu`.

`AbsMax` reduction type and `mx.absmax` op