SciSharp · Nucs · Feb 14, 2026 · Feb 21, 2026 · Mar 7, 2026 · Mar 8, 2026
diff --git a/.claude/SESSION_HISTORY_REPORT.md b/.claude/SESSION_HISTORY_REPORT.md
diff --git a/.claude/plans/numpy-alignment-audit.md b/.claude/plans/numpy-alignment-audit.md
diff --git a/.claude/settings.local.json b/.claude/settings.local.json
@@ -0,0 +1 @@
+{}
diff --git a/benchmark/results-20260214-backup.tar.gz b/benchmark/results-20260214-backup.tar.gz
diff --git a/docs/INT64_DEVELOPER_GUIDE.md b/docs/INT64_DEVELOPER_GUIDE.md
@@ -0,0 +1,337 @@
+# Int64 Indexing Migration - Developer Guide
+
+This guide provides patterns and rules for developers continuing the int32 to int64 indexing migration.
+
+---
+
+## Core Principle
+
+**Think before casting.** The goal is to use `long` everywhere indices, sizes, strides, and offsets are involved. Only cast to `int` when absolutely required by external APIs.
+
+---
+
+## Decision Tree: Should This Be `long`?
+
+```
+Is it an index, size, stride, offset, or count?
+├── YES → Use `long`
+│   └── Exception: Does external API require int?
+│       ├── YES → Cast at the boundary, document why
+│       └── NO → Keep as `long`
+└── NO → Keep original type
+```
+
+---
+
+## Pattern 1: Loop Counters Over Array Elements
+
+**WRONG:**
+```csharp
+for (int i = 0; i < array.size; i++)  // size is now long
+    Process(array[i]);
+```
+
+**CORRECT:**
+```csharp
+for (long i = 0; i < array.size; i++)
+    Process(array[i]);
+```
+
+**Rule:** If iterating over array indices, use `long` loop counter.
+
+---
+
+## Pattern 2: Coordinate Arrays
+
+**WRONG:**
+```csharp
+var coords = new int[2];
+coords[0] = i;  // i is long
+coords[1] = j;  // j is long
+array.GetValue(coords);  // GetValue now takes long[]
+```
+
+**CORRECT:**
+```csharp
+var coords = new long[2];
+coords[0] = i;
+coords[1] = j;
+array.GetValue(coords);
+```
+
+**Rule:** Coordinate arrays are `long[]`, not `int[]`.
+
+---
+
+## Pattern 3: Matrix Dimensions (M, K, N)
+
+**WRONG:**
+```csharp
+int M = (int)left.shape[0];   // Defeats the purpose!
+int K = (int)left.shape[1];
+int N = (int)right.shape[1];
+```
+
+**CORRECT:**
+```csharp
+long M = left.shape[0];   // shape[] returns long
+long K = left.shape[1];
+long N = right.shape[1];
+```
+
+**Rule:** Matrix dimensions are `long`. They come from shape which is now `long[]`.
+
+---
+
+## Pattern 4: Pointer Arithmetic (Works Naturally)
+
+Pointer arithmetic already supports `long` offsets:
+
+```csharp
+T* ptr = (T*)Address;
+long offset = 3_000_000_000L;
+T value = ptr[offset];  // OK! Pointer indexing accepts long
+```
+
+**Rule:** Pointer arithmetic is already correct. Focus on the index variables.
+
+---
+
+## Pattern 5: Method Signatures
+
+When updating method signatures, change ALL index-related parameters:
+
+**BEFORE:**
+```csharp
+private static void MatMulCore<T>(NDArray left, NDArray right, T* result, int M, int K, int N)
+```
+
+**AFTER:**
+```csharp
+private static void MatMulCore<T>(NDArray left, NDArray right, T* result, long M, long K, long N)
+```
+
+**Rule:** Update the signature AND all callers simultaneously.
+
+---
+
+## Pattern 6: Unsafe Pointer Parameters
+
+**BEFORE:**
+```csharp
+public static unsafe bool IsContiguous(int* strides, int* shape, int ndim)
+```
+
+**AFTER:**
+```csharp
+public static unsafe bool IsContiguous(long* strides, long* shape, int ndim)
+```
+
+**Note:** `ndim` stays `int` (max ~32 dimensions).
+
+---
+
+## Pattern 7: Local Variables in Algorithms
+
+**BEFORE:**
+```csharp
+int expectedStride = 1;
+for (int d = ndim - 1; d >= 0; d--)
+{
+    expectedStride *= shape[d];  // shape[d] is now long
+}
+```
+
+**AFTER:**
+```csharp
+long expectedStride = 1;
+for (int d = ndim - 1; d >= 0; d--)  // d stays int (dimension index)
+{
+    expectedStride *= shape[d];
+}
+```
+
+**Rule:** Variables that accumulate products of dimensions must be `long`. Dimension indices (`d`) can stay `int`.
+
+---
+
+## Valid Exceptions: When int Cast IS Correct
+
+### 1. Span<T> Operations
+
+Span has hard `int` limitation:
+
+```csharp
+if (Count > int.MaxValue)
+    throw new InvalidOperationException("Storage size exceeds Span<T> maximum.");
+return new Span<T>(Address, (int)Count);
+```
+
+### 2. Managed Array Allocation
+
+.NET arrays limited to int indexing:
+
+```csharp
+if (size > int.MaxValue)
+    throw new InvalidOperationException("Cannot allocate managed array exceeding int.MaxValue.");
+var array = new T[(int)size];
+```
+
+### 3. Algorithm Complexity Constraints
+
+When O(n*m) complexity makes large arrays impractical anyway:
+
+```csharp
+// Convolution is O(na * nv), so practical limits are well under int.MaxValue
+int na = (int)a.size;
+int nv = (int)v.size;
+```
+
+**Document these exceptions with comments explaining why the cast is safe.**
+
+---
+
+## What Stays `int`
+
+| Item | Reason |
+|------|--------|
+| `ndim` | Maximum ~32 dimensions |
+| `Slice.Start/Stop/Step` | Python slice semantics |
+| Dimension indices (`d` in loops) | Iterating over dimensions, not elements |
+| `NPTypeCode` values | Small enum |
+| Vector lane counts | Hardware-limited |
+
+---
+
+## Checklist for Each File
+
+When migrating a file:
+
+1. [ ] **Find all `int` variables** related to indices/sizes/strides/offsets
+2. [ ] **Change to `long`** unless exception applies
+3. [ ] **Update method signatures** if parameters are index-related
+4. [ ] **Update callers** of changed methods
+5. [ ] **Check loop counters** iterating over array elements
+6. [ ] **Check coordinate arrays** - must be `long[]`
+7. [ ] **Check pointer params** - `int*` → `long*` for strides/shapes
+8. [ ] **Add overflow checks** where external APIs require `int`
+9. [ ] **Document exceptions** with comments
+
+---
+
+## Common Error Patterns
+
+### Error: Cannot convert long to int
+
+```
+error CS0266: Cannot implicitly convert type 'long' to 'int'
+```
+
+**Fix:** Change the receiving variable to `long`, OR if external API requires `int`, add explicit cast with overflow check.
+
+### Error: Argument type mismatch
+
+```
+error CS1503: Argument 1: cannot convert from 'int[]' to 'long[]'
+```
+
+**Fix:** Change the array type at declaration site to `long[]`.
+
+### Error: Iterator type mismatch
+
+```
+error CS0029: Cannot implicitly convert type 'int' to 'long' in foreach
+```
+
+**Fix:** Check if the enumerated collection now yields `long`. Update the loop variable type.
+
+---
+
+## File Categories and Priority
+
+### Priority 1: Core Types (Done)
+- Shape.cs - dimensions, strides, offset, size
+- IArraySlice.cs - index parameters
+- UnmanagedStorage.cs - Count field
+- UnmanagedStorage.Getters.cs - index parameters
+- UnmanagedStorage.Setters.cs - index parameters
+
+### Priority 2: Supporting Infrastructure (In Progress)
+- ArraySlice.cs / ArraySlice`1.cs - Allocate count, index operations
+- Incrementors (6 files) - coordinate arrays
+- StrideDetector.cs - pointer parameters
+
+### Priority 3: IL Kernel System (Major Effort)
+- IKernelProvider.cs - interface
+- ILKernelGenerator.*.cs (13 files) - IL emission, delegate signatures
+- SimdKernels.cs, SimdMatMul.cs - SIMD helpers
+
+### Priority 4: DefaultEngine Operations
+- Default.Clip.cs, Default.ATan2.cs
+- Default.Reduction.*.cs
+- Default.NonZero.cs, Default.Transpose.cs
+
+### Priority 5: API Functions
+- np.*.cs files
+- NDArray.*.cs files
+
+---
+
+## Testing Strategy
+
+After each batch of changes:
+
+1. **Build** - Fix all compilation errors
+2. **Run tests** - `dotnet test -- --treenode-filter "/*/*/*/*[Category!=OpenBugs]"`
+3. **Check for regressions** - Compare output with NumPy
+
+---
+
+## Git Commit Guidelines
+
+Commit in logical batches with descriptive messages:
+
+```
+int64 indexing: <component> <what changed>
+
+- <specific change 1>
+- <specific change 2>
+- <specific change 3>
+```
+
+Example:
+```
+int64 indexing: StrideDetector pointer params int* -> long*
+
+- IsContiguous: int* strides/shape -> long* strides/shape
+- IsScalar: int* strides -> long* strides
+- CanSimdChunk: int* params -> long*, innerSize/lhsInner/rhsInner -> long
+- Classify: int* params -> long*
+- expectedStride local -> long
+```
+
+---
+
+## Quick Reference
+
+| Old | New | Notes |
+|-----|-----|-------|
+| `int size` | `long size` | Array/storage size |
+| `int offset` | `long offset` | Memory offset |
+| `int[] dimensions` | `long[] dimensions` | Shape dimensions |
+| `int[] strides` | `long[] strides` | Memory strides |
+| `int[] coords` | `long[] coords` | Index coordinates |
+| `int* shape` | `long* shape` | Unsafe pointer |
+| `int* strides` | `long* strides` | Unsafe pointer |
+| `for (int i` | `for (long i` | Element iteration |
+| `int M, K, N` | `long M, K, N` | Matrix dimensions |
+| `int ndim` | `int ndim` | **KEEP** - dimension count |
+| `int d` (dim index) | `int d` | **KEEP** - dimension loop |
+
+---
+
+## Getting Help
+
+- GitHub Issue: #584
+- Migration Plan: `docs/INT64_INDEX_MIGRATION.md`
+- NumPy Reference: `src/numpy/_core/include/numpy/npy_common.h:217`
diff --git a/docs/issues/issue-0075-implement-numpy.asarray.md b/docs/issues/issue-0075-implement-numpy.asarray.md
@@ -10,7 +10,7 @@
 ## Description
 
 Convert the input to an array.
-https://numpy.org/doc/stable-1.15.0/reference/generated/numpy.asarray.html
+https://docs.scipy.org/doc/numpy-1.15.0/reference/generated/numpy.asarray.html
 
 ## Comments
 

diff --git a/docs/issues/issue-0078-implement-numpy.where.md b/docs/issues/issue-0078-implement-numpy.where.md
@@ -10,7 +10,7 @@
 ## Description
 
 Return elements, either from x or y, depending on condition.
-https://numpy.org/doc/stable-1.13.0/reference/generated/numpy.where.html
+https://docs.scipy.org/doc/numpy-1.13.0/reference/generated/numpy.where.html
 
 ## Comments
 

diff --git a/docs/issues/issue-0105-implement-numpy.vdot.md b/docs/issues/issue-0105-implement-numpy.vdot.md
@@ -11,7 +11,7 @@
 ## Description
 
 Return the dot product of two vectors.
-https://numpy.org/doc/stable-1.15.1/reference/generated/numpy.vdot.html#numpy.vdot
+https://docs.scipy.org/doc/numpy-1.15.1/reference/generated/numpy.vdot.html#numpy.vdot
 
 `ndarray.vdot` should be put in `LinearAlgebra` folder.
 

diff --git a/docs/issues/issue-0106-implement-numpy.inner.md b/docs/issues/issue-0106-implement-numpy.inner.md
@@ -11,6 +11,6 @@
 ## Description
 
 Inner product of two arrays.
-https://numpy.org/doc/stable-1.15.1/reference/generated/numpy.inner.html#numpy.inner
+https://docs.scipy.org/doc/numpy-1.15.1/reference/generated/numpy.inner.html#numpy.inner
 
 `ndarray.inner` should be put in `LinearAlgebra` folder.
diff --git a/docs/issues/issue-0108-implement-numpy.tensordot.md b/docs/issues/issue-0108-implement-numpy.tensordot.md
@@ -11,6 +11,6 @@
 ## Description
 
 Compute tensor dot product along specified axes for arrays >= 1-D.
-https://numpy.org/doc/stable-1.15.1/reference/generated/numpy.tensordot.html#numpy.tensordot
+https://docs.scipy.org/doc/numpy-1.15.1/reference/generated/numpy.tensordot.html#numpy.tensordot
 
 `ndarray.tensordot` should be put in `LinearAlgebra` folder.
diff --git a/docs/issues/issue-0114-implement-numpy.fft.fft.md b/docs/issues/issue-0114-implement-numpy.fft.fft.md
@@ -10,4 +10,4 @@
 ## Description
 
 Compute the one-dimensional discrete Fourier Transform.
-https://numpy.org/doc/stable-1.15.0/reference/generated/numpy.fft.fft.html#numpy.fft.fft
+https://docs.scipy.org/doc/numpy-1.15.0/reference/generated/numpy.fft.fft.html#numpy.fft.fft
diff --git a/docs/issues/issue-0202-implement-np.pad.md b/docs/issues/issue-0202-implement-np.pad.md
@@ -12,7 +12,7 @@
 ## Description
 
 Add a buffer or padding around a numpy array: 
-https://numpy.org/doc/stable/reference/generated/numpy.pad.html 
+https://docs.scipy.org/doc/numpy/reference/generated/numpy.pad.html 
 
 ## Comments