Norm-extracted vectors#7107

Open

connortsui20 wants to merge 9 commits intodevelopfrom

Contributor

connortsui20 commented Mar 20, 2026

Summary

Tracking Issue: #6865

Related PR: #7018

Adds a new encoding specific to the vector extension type.

Note that we cannot actually utilize this by adding it to the compressor until we make it pluggable (see #7018). When that does eventually land, we can simply create a NormVectorScheme that uses the NormVectorArray.

Implementation

We currently do not have a good way of broadcasting a multiplication onto a FixedSizeList, so this implementation hand rolls the norm multiplication. Additionally, we still do not have #6717 so the scalar_at implementation is also slow.

API Changes

Adds a new encoding type NormVector

Testing

Some simple tests including roundtrips.

connortsui20 requested a review from robert3005

March 20, 2026 22:05

connortsui20 force-pushed the ct/norm branch from 13dd7d4 to c75407b Compare

March 23, 2026 15:46

connortsui20 requested a review from gatesn

March 23, 2026 15:46

connortsui20 added the changelog/feature label

connortsui20 marked this pull request as draft

March 24, 2026 03:45

connortsui20 force-pushed the ct/norm branch from c75407b to 99fc4fc Compare

March 24, 2026 03:46

codspeed-hq bot commented Mar 24, 2026 •

edited

Loading

Merging this PR will degrade performance by 10.2%

❌ 1 regressed benchmark
✅ 1105 untouched benchmarks
⏩ 1522 skipped benchmarks¹

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
❌	Simulation	`map_each[BufferMut<i32>, 128]`	770.6 ns	858.1 ns	-10.2%

_{Comparing ct/norm (e288bb1) with develop (ec2c602)}

1522 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

connortsui20 commented

View reviewed changes

vortex-tensor/src/encodings/norm/vtable/operations.rs

Contributor Author

connortsui20 Mar 24, 2026

Note that once we get ScalarValue::Array (#6717), this will be a lot less ugly.

connortsui20 marked this pull request as ready for review

March 24, 2026 15:03

connortsui20 force-pushed the ct/norm branch from 99fc4fc to 4d1680b Compare

March 24, 2026 15:05

connortsui20 enabled auto-merge (squash)

March 24, 2026 15:05

gatesn requested changes

View reviewed changes

vortex-tensor/src/encodings/norm/array.rs Outdated Show resolved Hide resolved

vortex-tensor/src/encodings/norm/array.rs Show resolved Hide resolved

vortex-tensor/src/encodings/norm/array.rs

+                      // also be nullable (null vectors produce null norms).
+                      let storage = extension_storage(&vector_array)?;
+                      let l2_norm_expr =
+                          Expression::try_new(ScalarFn::new(L2Norm, EmptyOptions).erased(), [root()])?;

Contributor

gatesn Mar 24, 2026

Short cut would be L2Norm.new_expr(...)

Contributor

gatesn Mar 24, 2026

Even then, we should have a nicer way of doing this!!

vortex-tensor/src/encodings/norm/array.rs

+                      let norms_array = norms_prim.clone().into_array();
+                      // Extract flat elements from the (always non-nullable) storage for normalization.
+                      let flat = extract_flat_elements(&storage, list_size, ctx)?;

Contributor

gatesn Mar 24, 2026

What is FlatElements?

Contributor Author

connortsui20 Mar 25, 2026

It's a helper function I use for all of the other tensor types to make sure we can deal with ConstantArray correctly:

/// Extracts the flat primitive elements from a tensor storage array (FixedSizeList).
///
/// When the input is a [`ConstantArray`] (e.g., a literal query vector), only a single row is
/// materialized to avoid expanding it to the full column length.

vortex-tensor/src/encodings/norm/array.rs

+                      let storage = extension_storage(&vector_array)?;
+                      let l2_norm_expr =
+                          Expression::try_new(ScalarFn::new(L2Norm, EmptyOptions).erased(), [root()])?;
+                      let norms_prim: PrimitiveArray = vector_array.apply(&l2_norm_expr)?.execute(ctx)?;

Contributor

gatesn Mar 24, 2026

You said vector array can be nullable, but you don't check validity when iterating below

Contributor Author

connortsui20 Mar 25, 2026

whoops

vortex-tensor/src/encodings/norm/array.rs Outdated Show resolved Hide resolved

vortex-tensor/src/encodings/norm/array.rs

+                      // Extract flat elements from the (always non-nullable) storage for normalization.
+                      let flat = extract_flat_elements(&storage, list_size, ctx)?;
+                      match_each_float_ptype!(flat.ptype(), |T| {

Contributor

gatesn Mar 24, 2026

This all seems a bit convoluted? Can you just run-end encode the norms by the vector lengths, then use a divide function?

Contributor Author

connortsui20 Mar 25, 2026 •

edited

Loading

So this is what the decompress path would look like (the compress path would be similar):

        // We need to multiply each vector element with its respective norm. We do not have any kind
        // of "broadcast" expression to each of the `FixedSizeList` elements, so we can mimic this
        // by multiplying the normalized vector array with a `RunEnd(Sequence)` array.
        let base: PValue = list_size.into();
        let multiplier: PValue = base;
        let ends_ptype = base.ptype();
        let ends_nullability = Nullability::NonNullable;
        let ends =
            SequenceArray::try_new(base, multiplier, ends_ptype, ends_nullability, num_vectors)?;

        let runend = RunEndArray::try_new(ends.into_array(), self.norms.clone())?;

        let storage = extension_storage(&self.vector_array)?;
        let fsl: FixedSizeListArray = storage.execute(ctx)?;
        let elements = fsl.elements();
        debug_assert!(elements.dtype().is_primitive());

        let decompress = elements.binary(runend.into_array(), Operator::Mul)?;

        let denormalized_elements: PrimitiveArray = decompress.execute(ctx)?;

        // SAFETY: TODO
        let fsl = unsafe {
            FixedSizeListArray::new_unchecked(
                denormalized_elements.into_array(),
                list_size,
                self.validity()?,
                num_vectors,
            )
        };

        Ok(ExtensionArray::new(ext.clone(), fsl.into_array()).into_array())
    }

I don't think that this is actually less convoluted? But maybe we do this regardless because the optimizer might be able to do something

connortsui20 added 8 commits

March 25, 2026 07:17


          boilerplate with NormVector encoding

f6714b4

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          add most implementation

f2921cb

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          implement compress and decompress

267181c

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          fix scalars

686c2d7

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          rebase

b264fff

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          fix problems

aee3673

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          update lockfile

b9fee0b

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>


          potential fix

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

Contributor Author

connortsui20 commented Mar 25, 2026

ok just realized that a bunch of stuff was wrong because I changed from non-nullable to nullable halfway through, will fix everything


          use broadcast with runend sequence and binary

e288bb1

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

connortsui20 force-pushed the ct/norm branch from 4d1680b to e288bb1 Compare

March 25, 2026 14:33

Contributor Author

connortsui20 commented Mar 25, 2026

Also @gatesn the .binary(???, Operator::Div) won't work because we need a safe divide by 0, and Operator::Div will return an ArrowError that I can't easily intercept.

Do you think it is fine to just have this broadcast logic for decompression and not compression? (see the latest commit)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/feature