Skip to content

[WAN] Use different sharding strategy for self and cross attention.

3866671
Select commit
Loading
Failed to load commit list.
Open

[WAN] Use different sharding strategy for self and cross attention. #250

[WAN] Use different sharding strategy for self and cross attention.
3866671
Select commit
Loading
Failed to load commit list.
Google CLA / cla/google succeeded Sep 18, 2025 in 2s

✅ All contributors are covered under a CLA with Google

See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).

ℹ️ Googlers: Go here to view more details and manage scans for this pull request.

Details

The following contributors were found for this pull request:

3866671 Author: @hyeygit <hy****t​@gmail.com>

(Only the first commit for a unique contributor is listed.)