Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

All gather and reduce scatter on SM80 #3

Merged
merged 1 commit into from
Jun 6, 2024
Merged

All gather and reduce scatter on SM80 #3

merged 1 commit into from
Jun 6, 2024

Conversation

zheng-ningxin
Copy link
Collaborator

@zheng-ningxin zheng-ningxin commented Jun 6, 2024

The current implementation supports two operations on the SM80 architecture:

  1. Allgather followed by GEMM (General Matrix-Matrix Multiplication)
  2. GEMM followed by Reduce-Scatter

The fused operations demonstrate improved performance compared to invoking GEMM and communication operations separately. This optimization is crucial for high-performance computing tasks especially for LLM training or inference.

We sincerely appreciate all contributors including but not limited to @kongroo @wenlei-bao @houqi @Meteorix @liwenchangbdbz @ZihengJiang @eric-haibin-lin.

@zheng-ningxin zheng-ningxin requested review from liwenchangbdbz and wenlei-bao and removed request for liwenchangbdbz June 6, 2024 09:46
…ecture:

1. Allgather followed by GEMM (General Matrix-Matrix Multiplication)
2. GEMM followed by Reduce-Scatter

The fused operations demonstrate improved performance compared to invoking GEMM
and communication operations separately. This optimization is crucial for high-performance
computing tasks especially for LLM training or inference..

Co-authored-by: Chengquan Jiang <[email protected]>
Co-authored-by: Wenlei Bao <[email protected]>
Co-authored-by: Ningxin Zheng <[email protected]>
Co-authored-by: Qi Hou <[email protected]>
Co-authored-by: Ziheng Jiang <[email protected]>
Co-authored-by: Xin Liu <[email protected]>
Co-authored-by: Liwen Chang <[email protected]>
Co-authored-by: Haibin Lin <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant