perf: avoid redundant dp.compute() across MPI ranks by curry-sthuc · Pull Request #7524 · deepmodeling/abacus-develop

curry-sthuc · 2026-06-26T08:37:02Z

Problem

In ESolver_DP::runner(), every MPI rank calls dp.compute() independently with identical input, producing the same energy, forces, and virial. With N ranks, this results in N-fold redundant computation, and multiple concurrent deepmd inference calls cause CPU contention.

Solution

Only rank 0 calls dp.compute(), then broadcasts results via MPI_Bcast. Implemented with #ifdef __MPI guards — serial builds are unaffected.

Performance (864 Al atoms, 100 MD steps)

np	Before	After
1	34s	33s
2	39s	33s
4	55s	23s
8	102s	28s

Checklist

No linked issue (standalone optimization)
No new tests needed — computation results are mathematically identical
No behavioral changes
No core module changes

Only rank 0 calls dp.compute() and broadcasts results to all ranks via MPI_Bcast, avoiding N-fold redundant computation and CPU contention.

Only rank 0 needs coord and cell vectors for dp.compute(). Non-rank-0 ranks receive results via MPI_Bcast and never use these vectors.

curry-sthuc added 2 commits June 26, 2026 16:23

perf: avoid redundant dp.compute() across MPI ranks

305e2b5

Only rank 0 calls dp.compute() and broadcasts results to all ranks via MPI_Bcast, avoiding N-fold redundant computation and CPU contention.

perf: skip coord/cell construction on non-rank-0 MPI ranks

3a89f1b

Only rank 0 needs coord and cell vectors for dp.compute(). Non-rank-0 ranks receive results via MPI_Bcast and never use these vectors.

mohanchen requested a review from 19hello June 27, 2026 07:46

mohanchen assigned 19hello Jun 27, 2026

mohanchen added the project_learning label Jun 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: avoid redundant dp.compute() across MPI ranks#7524

perf: avoid redundant dp.compute() across MPI ranks#7524
curry-sthuc wants to merge 2 commits into
deepmodeling:developfrom
curry-sthuc:feature/mpi-openmp-cuda-accel

curry-sthuc commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

curry-sthuc commented Jun 26, 2026

Problem

Solution

Performance (864 Al atoms, 100 MD steps)

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants