[WIP] Optimizing grouped convolutions #212

jfsantos · 2026-01-28T21:42:06Z

Optimizing grouped convolutions by pre-computing weight block indices and unrolling loops for common numbers of groups. Also added a performance benchamark for Conv1D and Conv1x1.

Other potential updates (still not implemented):

Store weights as block-diagonal sparse matrix, do one matmul instead of G matmuls
Templated convolutions with compile-time static shapes to leverage specific compile-time optimizations

…chmarking tool for convolution performance.

… for Conv1D Conv1x1: Use explicit group loop with groups=1 fast path. For small channel counts (2-8), this avoids the overhead of zero multiplications in block-diagonal matrices that BLAS cannot optimize efficiently. Conv1D: Keep block-diagonal approach (single matmul per kernel position) which shows 1.5-1.9x speedup for grouped convolutions. The multiple kernel positions amortize the overhead, making this approach beneficial. Removed pre-computed GroupBlock structs as they are no longer needed with these simplified implementations. Updated benchmark tool to test channels 2-8 for detailed comparison. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

João Felipe Santos and others added 4 commits January 28, 2026 12:49

Implemented some optimizations for grouped convolutions and a new ben…

48f4e15

…chmarking tool for convolution performance.

Implementation with all dimensions fixed.

acc028f

Added tests to verify correctness of fixed implementations.

4b3c236

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Optimizing grouped convolutions #212

[WIP] Optimizing grouped convolutions #212

jfsantos commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[WIP] Optimizing grouped convolutions #212

Are you sure you want to change the base?

[WIP] Optimizing grouped convolutions #212

Conversation

jfsantos commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant