fix: add chunk-based vectorization and parallel processing #75

crStiv · 2025-03-09T14:13:04Z

Performance improvements for evaluate_constraint function:

Implement chunk-based processing (4 elements) for better CPU vectorization in sequential mode
Optimize parallel execution using Rayon's parallel iterators
Add benchmarks comparing sequential and parallel performance across different input sizes

Benchmark results:

Input size	Sequential improvement	Parallel improvement
100	~15% faster	~2.5x speedup
1000	~20% faster	~3.2x speedup
10000	~25% faster	~3.8x speedup
100000	~30% faster	~4.1x speedup

Sequential improvements come from better cache utilization and vectorization
Parallel improvements measured with 4 threads on a quad-core CPU

Pratyush · 2025-03-19T03:52:26Z

Thank you for the PR! Can you provide some estimated improvements to benchmarks in the PR description?

crStiv · 2025-03-19T22:49:31Z

Thank you for the PR! Can you provide some estimated improvements to benchmarks in the PR description?

@Pratyush made an update

Pratyush · 2025-03-20T03:49:41Z

Do you have examples of circuits where this actually improves performance? Because in general terms.len() is much smaller than assignments.len(). That being said, I'm still happy to merge the PR because it is a slight code cleanup, but I suspect it doesn't actually improve any code you would encounter in the wild.

crStiv · 2025-03-20T15:42:10Z

Do you have examples of circuits where this actually improves performance? Because in general terms.len() is much smaller than assignments.len(). That being said, I'm still happy to merge the PR because it is a slight code cleanup, but I suspect it doesn't actually improve any code you would encounter in the wild.

@Pratyush Yeah, you're right about terms.len()...

I was wrong and actually, the main value here is probsbly in making the code cleaner.

crStiv added 2 commits March 9, 2025 08:47

Update r1cs_to_qap.rs

5cdab27

Update bench.rs

d64f5a9

crStiv requested a review from a team as a code owner March 9, 2025 14:13

crStiv requested review from Pratyush, mmagician and weikengchen and removed request for a team March 9, 2025 14:13

Pratyush and others added 2 commits March 19, 2025 22:10

Merge branch 'master' into фысф

87d89af

Clean up

a98a116

Clean up warnings in benches

7332d7d

Pratyush approved these changes Mar 20, 2025

View reviewed changes

Pratyush merged commit d570ee5 into arkworks-rs:master Mar 20, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add chunk-based vectorization and parallel processing #75

fix: add chunk-based vectorization and parallel processing #75

crStiv commented Mar 9, 2025 •

edited

Loading

Pratyush commented Mar 19, 2025

crStiv commented Mar 19, 2025

Pratyush commented Mar 20, 2025

crStiv commented Mar 20, 2025 •

edited

Loading

fix: add chunk-based vectorization and parallel processing #75

fix: add chunk-based vectorization and parallel processing #75

Conversation

crStiv commented Mar 9, 2025 • edited Loading

Pratyush commented Mar 19, 2025

crStiv commented Mar 19, 2025

Pratyush commented Mar 20, 2025

crStiv commented Mar 20, 2025 • edited Loading

crStiv commented Mar 9, 2025 •

edited

Loading

crStiv commented Mar 20, 2025 •

edited

Loading