Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance Issues with MPIRun Due to Virtual Network Interfaces #13161

Open
GeofferyGeng opened this issue Mar 25, 2025 · 1 comment
Open

Performance Issues with MPIRun Due to Virtual Network Interfaces #13161

GeofferyGeng opened this issue Mar 25, 2025 · 1 comment

Comments

@GeofferyGeng
Copy link

Thank you for taking the time to submit an issue!

Background information

What version of Open MPI are you using?

v4.1.7rc1

Describe how Open MPI was installed

installed by MLNX_OFED

Please describe the system on which you are running

  • Operating system/version: Ubuntu 22.04
  • Computer hardware: Intel(R) Xeon(R) Platinum 8480+
  • Network type: Eth and mellanox

Details of the problem

I have a server with a single network card that has virtualized over 200 network interfaces. This causes significant delays when using mpirun, as the process hangs for a long time. I used UCX debug and found that the delays are primarily occurring on the bridged network interface.

Is there a solution for this issue? Any recommendations on how to optimize or configure the network interfaces to improve the performance of mpirun? Thank you!

ucx log

[1742871925.548610] [pod-hpc-02:1702645:0]       tcp_iface.c:945  UCX  DEBUG filtered out bridge device virbr0
[1742872077.918760] [pod-hpc-02:1702645:0]       tcp_iface.c:945  UCX  DEBUG filtered out bridge device wlan
@bosilca
Copy link
Member

bosilca commented Mar 25, 2025

UCX_NET_DEVICES is your friend here. Set it to the interface you do intent to use, and this issue shall go away.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants