Remove a register by lemire · Pull Request #3 · fastfloat/int_serialization_benchmark

lemire · 2026-02-13T19:46:17Z

We can simplify the algorithm. This should not affect the performance, although it might be slightly beneficial because we need one fewer register.

The gist of it is that we replace

_mm512_madd52lo_epu64(zmmzero, bcstq_h, ifma_const);

by

_mm512_madd52lo_epu64(ifma_const, bcstq_h, ifma_const);

So instead of loading zmmzero and ifma_const, we just need ifma_const.

I am also trimming out shift_and_insert_dot as it is not needed.

jaja360

Very cool !!!

I ran some benchmarks to see if it impacts performance. It doesn't at all with clang. With g++, however, I get that the new version is 7% faster on the Twitter dataset, but 15% slower on Uniform-8 (it goes from 4.01 to 4.62 c/n). g++ has always given us suboptimal results (it generates 30 instructions instead of 27 for clang, on Uniform-8), so it seems more like a GCC codegen quirk than a problem on our side.

I think we can merge !

lemire · 2026-02-16T01:04:13Z

@jaja360 Logically, it should not make a difference... We save one named register, but that's not a limitation. We also save loading the register, which might help a little bit (maybe save one instruction) but it was almost surely not a bottleneck.

Of course, the main point is to simplify the algorithm and the code.

Merging.

jaja360 · 2026-02-16T01:07:27Z

Indeed, it should not do a difference (and it does not on clang !)
I think g++ just have some code alignment issues (that we already observed earlier), this time due to the static const variables we removed.

lemire added 2 commits February 13, 2026 13:31

simplify the algorithm, remove a register

50f2f0d

added explanation

804acef

lemire requested a review from jaja360 February 13, 2026 19:46

jaja360 approved these changes Feb 16, 2026

View reviewed changes

lemire merged commit 14f15ea into main Feb 16, 2026
4 checks passed

jaja360 deleted the remove_a_register branch February 16, 2026 01:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove a register#3

Remove a register#3
lemire merged 2 commits intomainfrom
remove_a_register

lemire commented Feb 13, 2026

Uh oh!

jaja360 left a comment

Uh oh!

lemire commented Feb 16, 2026

Uh oh!

Uh oh!

jaja360 commented Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lemire commented Feb 13, 2026

Uh oh!

jaja360 left a comment

Choose a reason for hiding this comment

Uh oh!

lemire commented Feb 16, 2026

Uh oh!

Uh oh!

jaja360 commented Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants