First attempt at embarrasingly parallel execution to compute head grids. by dbrakenhoff · Pull Request #140 · timflow-org/timflow

dbrakenhoff · 2026-06-18T14:16:43Z

Early result suggests ~10x speedup on my machine.

Todo

- add parallel submodule - define numba tuples containing relevant data for computations for Model, Aquifer and Elements (LineSink and Well) for now. Each class gets to_numba_tuple() method to collect data. - add integer mappings for elements and boundary conditions for identifying computation path - gather element data in structured arrays - write fast versions of potinf and potential - parallelize on x,y pts

dbrakenhoff · 2026-06-18T15:33:06Z

Parallel example (based on example from Discussion #115)

parallel_example.zip

Note that first run of numba code triggers the compilation step, which means it runs in about ~15s on my machine. The next run takes about ~3s. Normal timflow (using parallel=True, referring to multithreading) is about ~30s.

EDIT: example won't run until #139 is merged into dev and subsequently this branch.

eriktoller · 2026-06-18T21:10:58Z

Cool work and awesome to see such speed-up already!

Some thoughts:

It could be a good idea to skip fastmath=True, at least when still developing the code
it would be interesting to look at the difference if you change results_arr=bessellsv2(...) to bessellsv2(restuls_arr,...) and reuse a work array rather than creating a new one per linesink per point (300 linesink for 100 by 100 grid will be a lot of time for memory allocation).
it would also be cool if you could time the pre-processing, computations and post-processing of the numba approach to see where the majority of the effort goes into.

I will follow the progress with great interest, great work Davíd 🎉

dbrakenhoff · 2026-06-22T16:10:31Z

@eriktoller Thanks for the early thoughts!

It could be a good idea to skip fastmath=True, at least when still developing the code

Good point, I did test it for my little example and it gave the same results, but better to develop it without for now and compare at the end. It also didn't really generate any speedup in my current example.

it would be interesting to look at the difference if you change results_arr=bessellsv2(...) to bessellsv2(restuls_arr,...) and reuse a work array rather than creating a new one per linesink per point (300 linesink for 100 by 100 grid will be a lot of time for memory allocation).

Good suggestions, I left the existing numba code alone for now, but an (optional) work array output seems like a good idea.

it would also be cool if you could time the pre-processing, computations and post-processing of the numba approach to see where the majority of the effort goes into.

The pre-processing is currently only 0.05% of the total computation time in my current example. Maybe it will be a bit more if I include all the code up to the first call to the numba optimized potential computations, but for now it seems negligible. But it will become more as we start adding in support for more elements probably. So good to keep an eye on that.

So >99% of the time is taken by the actual potential computations, and this is how my example script scales with the number of threads on my laptop (averages of 3 runs). The base case (1 thread) runs in ~35s.

eriktoller · 2026-06-22T19:53:59Z

@dbrakenhoff

So >99% of the time is taken by the actual potential computations, and this is how my example script scales with the number of threads on my laptop (averages of 3 runs). The base case (1 thread) runs in ~35s.

That is really impressive and shows great potential! For larger timflow models this will be a substantial upgrade when it come to plotting.

Did @mbakker7 have a look at this too?

- avoid memory assignment in loops

dbrakenhoff · 2026-06-23T15:00:34Z

My early attempt at parallel computations within regular timflow using multithreading was a bit misguided. I now modified it to use multiprocessing instead of multithreading (#139) and this is the result when compared to the fully optimized numba in this PR.

mbakker7 · 2026-06-24T10:16:48Z

@dbrakenhoff

So >99% of the time is taken by the actual potential computations, and this is how my example script scales with the number of threads on my laptop (averages of 3 runs). The base case (1 thread) runs in ~35s.

That is really impressive and shows great potential! For larger timflow models this will be a substantial upgrade when it come to plotting.

Did @mbakker7 have a look at this too?

Yes, @eriktoller , I am in the loop. Mostly simply talking to @dbrakenhoff rather than giving comments here. Thanks for all your suggestions. Keep them coming!

dbrakenhoff self-assigned this Jun 18, 2026

dbrakenhoff added the enhancement New feature or request label Jun 18, 2026

dbrakenhoff added 3 commits June 18, 2026 17:01

fix inits

cc64336

add docstrings and ruffit

dff7abd

please ruff

4912f7c

dbrakenhoff added 3 commits June 22, 2026 12:11

Merge remote-tracking branch 'origin/dev' into parallel_numba

1370af7

make FASTMATH a constant

436605b

use no. of cores as thread limit

241589d

add optional work arrays to besselnumba

c24c16a

- avoid memory assignment in loops

dbrakenhoff added 3 commits June 23, 2026 17:07

fix bug introduced when introducing optional work arrays

da00322

move division

2b1ad24

reuse nterms

fd09b9a

mbakker7 mentioned this pull request Jun 24, 2026

add head_array() methods for unstructured computation of heads #139

Merged

dbrakenhoff added 2 commits June 24, 2026 13:14

Merge remote-tracking branch 'origin/dev' into parallel_numba

e1cf4b9

Merge remote-tracking branch 'origin/dev' into parallel_numba

133db05

Base automatically changed from dev to main June 26, 2026 07:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

First attempt at embarrasingly parallel execution to compute head grids.#140

First attempt at embarrasingly parallel execution to compute head grids.#140
dbrakenhoff wants to merge 14 commits into
mainfrom
parallel_numba

dbrakenhoff commented Jun 18, 2026 •

edited

Loading

Uh oh!

dbrakenhoff commented Jun 18, 2026 •

edited

Loading

Uh oh!

eriktoller commented Jun 18, 2026

Uh oh!

dbrakenhoff commented Jun 22, 2026

Uh oh!

eriktoller commented Jun 22, 2026

Uh oh!

dbrakenhoff commented Jun 23, 2026

Uh oh!

mbakker7 commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

dbrakenhoff commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Todo

Uh oh!

dbrakenhoff commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eriktoller commented Jun 18, 2026

Uh oh!

dbrakenhoff commented Jun 22, 2026

Uh oh!

eriktoller commented Jun 22, 2026

Uh oh!

dbrakenhoff commented Jun 23, 2026

Uh oh!

mbakker7 commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dbrakenhoff commented Jun 18, 2026 •

edited

Loading

dbrakenhoff commented Jun 18, 2026 •

edited

Loading