Skip to content

bench(sirun): benchmark harness foundation#8721

Draft
BridgeAR wants to merge 4 commits into
masterfrom
BridgeAR/2026-05-31-benchmark-platform-job
Draft

bench(sirun): benchmark harness foundation#8721
BridgeAR wants to merge 4 commits into
masterfrom
BridgeAR/2026-05-31-benchmark-platform-job

Conversation

@BridgeAR
Copy link
Copy Markdown
Member

@BridgeAR BridgeAR commented May 30, 2026

Summary

Foundation for the split-up sirun benchmark work; the per-bench PRs stack on this and should merge after it.

  1. startup-guard.js — the load-vs-loop share assertion every bench calls.
  2. runall.sh — tolerate a new benchmark failing on the older baseline source (only when it passed on the candidate), and auto-shard from variant count × available cores with an exact fix message when the matrix is too small.
  3. .gitlab/benchmarks.yml — report changes at 2% instead of 5%, and add the dashboard-link variable.
  4. .github/CODEOWNERS — route each benchmark directory to its owning team.

Test plan

  • The benchmark pipeline runs green with the new sharding and threshold.

BridgeAR added 2 commits May 31, 2026 01:55
Lower the report threshold from 5% to 2%. The analyzer only flags a change when
its confidence interval is entirely outside the threshold, so a tight (low
variance) interval is required to surface a 2% move while noisy intervals stay
under the bar. Also add BENCHMARK_DASHBOARD_URL (empty by default) which the
reporter renders as a link in the PR comment when set.
…uite

With BENCHMARKS_FROM=candidate the baseline runs this PR's benchmark code on the
older source. A baseline failure is skipped only when the same variant passed on
the candidate run, confirming the failure is specific to the older source rather
than a broken benchmark. The run still fails when the PR also changes
non-benchmark source (docs, CODEOWNERS, CI config and tests excluded) -- the A/B
comparison is then incomplete and the benchmark should land on its own first.
@codecov
Copy link
Copy Markdown

codecov Bot commented May 30, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 93.02%. Comparing base (03116df) to head (b1d2306).

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #8721      +/-   ##
==========================================
- Coverage   93.23%   93.02%   -0.22%     
==========================================
  Files         860      860              
  Lines       48883    48883              
  Branches     9248     9248              
==========================================
- Hits        45577    45474     -103     
- Misses       3306     3409     +103     
Flag Coverage Δ
aiguard-integration-active 41.50% <ø> (-0.14%) ⬇️
aiguard-integration-latest 41.51% <ø> (-0.14%) ⬇️
aiguard-integration-maintenance 41.68% <ø> (ø)
aiguard-macos 33.38% <ø> (-0.08%) ⬇️
aiguard-ubuntu 33.46% <ø> (-0.08%) ⬇️
aiguard-windows 33.19% <ø> (-0.08%) ⬇️
apm-capabilities-tracing-macos 48.43% <ø> (ø)
apm-capabilities-tracing-ubuntu-active 48.42% <ø> (ø)
apm-capabilities-tracing-ubuntu-latest 48.42% <ø> (?)
apm-capabilities-tracing-ubuntu-maintenance 48.63% <ø> (+0.18%) ⬆️
apm-capabilities-tracing-ubuntu-oldest 48.63% <ø> (ø)
apm-capabilities-tracing-windows ?
apm-integrations-aerospike-18-gte.5.2.0 33.24% <ø> (-0.08%) ⬇️
apm-integrations-aerospike-20-gte.5.5.0 33.26% <ø> (-0.08%) ⬇️
apm-integrations-aerospike-22-gte.5.12.1 33.26% <ø> (-0.08%) ⬇️
apm-integrations-aerospike-22-gte.6.0.0 33.26% <ø> (-0.08%) ⬇️
apm-integrations-aerospike-eol- 33.16% <ø> (-0.08%) ⬇️
apm-integrations-child-process 34.18% <ø> (-0.08%) ⬇️
apm-integrations-confluentinc-kafka-javascript-18 40.27% <ø> (-0.08%) ⬇️
apm-integrations-confluentinc-kafka-javascript-20 40.28% <ø> (-0.08%) ⬇️
apm-integrations-confluentinc-kafka-javascript-22 40.29% <ø> (-0.08%) ⬇️
apm-integrations-confluentinc-kafka-javascript-24 40.24% <ø> (-0.08%) ⬇️
apm-integrations-couchbase-18 33.42% <ø> (+0.09%) ⬆️
apm-integrations-couchbase-eol 33.30% <ø> (-0.08%) ⬇️
apm-integrations-dns 33.17% <ø> (-0.08%) ⬇️
apm-integrations-elasticsearch 34.28% <ø> (-0.08%) ⬇️
apm-integrations-http-latest 41.46% <ø> (-0.07%) ⬇️
apm-integrations-http-maintenance 41.50% <ø> (-0.07%) ⬇️
apm-integrations-http-oldest 41.44% <ø> (-0.06%) ⬇️
apm-integrations-http2 ?
apm-integrations-kafkajs-latest 40.33% <ø> (-0.18%) ⬇️
apm-integrations-kafkajs-oldest 40.37% <ø> (-0.18%) ⬇️
apm-integrations-net 33.87% <ø> (-0.08%) ⬇️
apm-integrations-next-11.1.4 20.07% <ø> (ø)
apm-integrations-next-12.3.7 20.07% <ø> (ø)
apm-integrations-next-13.0.0 29.21% <ø> (-0.12%) ⬇️
apm-integrations-next-13.2.0 29.21% <ø> (-0.12%) ⬇️
apm-integrations-next-13.5.11 29.35% <ø> (-0.08%) ⬇️
apm-integrations-next-14.0.0 29.28% <ø> (-0.08%) ⬇️
apm-integrations-next-14.2.35 29.28% <ø> (-0.08%) ⬇️
apm-integrations-next-14.2.6 29.28% <ø> (-0.08%) ⬇️
apm-integrations-next-14.2.7 29.28% <ø> (-0.08%) ⬇️
apm-integrations-next-15.0.0 29.28% <ø> (-0.08%) ⬇️
apm-integrations-next-15.4.0 29.35% <ø> (-0.08%) ⬇️
apm-integrations-oracledb 34.10% <ø> (-0.08%) ⬇️
apm-integrations-prisma-18-gte.6.16.0.and.lt.7.0.0 34.59% <ø> (-0.08%) ⬇️
apm-integrations-prisma-latest-all 34.38% <ø> (-0.06%) ⬇️
apm-integrations-restify 35.28% <ø> (-0.08%) ⬇️
apm-integrations-sharedb 32.64% <ø> (-0.08%) ⬇️
apm-integrations-tedious 33.57% <ø> (-0.08%) ⬇️
appsec-express 51.34% <ø> (-0.06%) ⬇️
appsec-fastify 48.06% <ø> (-0.06%) ⬇️
appsec-graphql 48.05% <ø> (-0.05%) ⬇️
appsec-integration-active 36.24% <ø> (ø)
appsec-integration-latest 36.24% <ø> (ø)
appsec-integration-maintenance 36.28% <ø> (ø)
appsec-integration-oldest 36.27% <ø> (ø)
appsec-kafka 40.58% <ø> (-0.02%) ⬇️
appsec-ldapjs 39.90% <ø> (-0.07%) ⬇️
appsec-lodash 39.92% <ø> (-0.07%) ⬇️
appsec-macos 57.34% <ø> (-0.14%) ⬇️
appsec-mongodb-core 44.15% <ø> (-0.07%) ⬇️
appsec-mongoose 45.00% <ø> (-0.05%) ⬇️
appsec-mysql 47.23% <ø> (+0.06%) ⬆️
appsec-next-latest-11.1.4 27.46% <ø> (-0.07%) ⬇️
appsec-next-latest-12.3.7 27.66% <ø> (ø)
appsec-next-latest-13.0.0 29.26% <ø> (-0.08%) ⬇️
appsec-next-latest-13.2.0 29.29% <ø> (-0.08%) ⬇️
appsec-next-latest-13.5.11 29.39% <ø> (-0.08%) ⬇️
appsec-next-latest-14.0.0 29.31% <ø> (-0.08%) ⬇️
appsec-next-latest-14.2.35 29.31% <ø> (-0.08%) ⬇️
appsec-next-latest-14.2.6 29.31% <ø> (-0.08%) ⬇️
appsec-next-latest-14.2.7 29.31% <ø> (-0.08%) ⬇️
appsec-next-latest-15.0.0 29.31% <ø> (-0.08%) ⬇️
appsec-next-latest-latest 29.32% <ø> (-0.08%) ⬇️
appsec-next-oldest-11.1.4 27.46% <ø> (-0.07%) ⬇️
appsec-next-oldest-12.3.7 29.26% <ø> (-0.08%) ⬇️
appsec-next-oldest-13.0.0 29.26% <ø> (-0.08%) ⬇️
appsec-next-oldest-13.2.0 29.53% <ø> (-0.08%) ⬇️
appsec-next-oldest-13.5.11 29.64% <ø> (-0.08%) ⬇️
appsec-next-oldest-14.0.0 29.56% <ø> (-0.08%) ⬇️
appsec-next-oldest-14.2.35 29.56% <ø> (-0.08%) ⬇️
appsec-next-oldest-14.2.6 29.56% <ø> (-0.08%) ⬇️
appsec-next-oldest-14.2.7 29.56% <ø> (-0.08%) ⬇️
appsec-next-oldest-15.0.0 29.56% <ø> (-0.08%) ⬇️
appsec-next-oldest-latest 27.77% <ø> (ø)
appsec-node-serialize 39.23% <ø> (-0.07%) ⬇️
appsec-passport 42.84% <ø> (-0.07%) ⬇️
appsec-postgres 46.96% <ø> (-0.06%) ⬇️
appsec-sourcing 38.63% <ø> (-0.07%) ⬇️
appsec-stripe 40.60% <ø> (-0.07%) ⬇️
appsec-template 39.47% <ø> (-0.07%) ⬇️
appsec-ubuntu 57.41% <ø> (-0.06%) ⬇️
appsec-windows ?
debugger-ubuntu-active 43.84% <ø> (ø)
debugger-ubuntu-latest 43.84% <ø> (ø)
debugger-ubuntu-maintenance 43.92% <ø> (ø)
debugger-ubuntu-oldest 44.23% <ø> (ø)
instrumentations-instrumentation-ai 32.35% <ø> (ø)
instrumentations-instrumentation-aws-sdk 35.50% <ø> (?)
instrumentations-instrumentation-bluebird 27.59% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-body-parser 35.80% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-child_process 33.55% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-cookie-parser 29.52% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-couchbase-18 36.81% <ø> (ø)
instrumentations-instrumentation-couchbase-eol 36.81% <ø> (ø)
instrumentations-instrumentation-crypto 27.67% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-express 29.70% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-express-mongo-sanitize 29.62% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-express-multi-version 20.97% <ø> (ø)
instrumentations-instrumentation-express-session 35.56% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-fastify 39.70% <ø> (ø)
instrumentations-instrumentation-fetch 33.05% <ø> (ø)
instrumentations-instrumentation-fs 27.31% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-generic-pool 27.15% <ø> (ø)
instrumentations-instrumentation-hono 28.82% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-http 35.23% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-http-client-options 37.73% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-kafkajs 48.79% <ø> (ø)
instrumentations-instrumentation-knex 27.57% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-light-my-request 35.42% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-mongoose 28.70% <ø> (?)
instrumentations-instrumentation-multer 35.46% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-mysql2 33.52% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-openai-aiguard 42.77% <ø> (ø)
instrumentations-instrumentation-otel-sdk-trace 25.23% <ø> (ø)
instrumentations-instrumentation-passport 39.30% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-passport-http 39.00% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-passport-local 39.46% <ø> (-0.07%) ⬇️
instrumentations-instrumentation-pg 33.23% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-promise 27.53% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-promise-js 27.54% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-q 27.57% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-router 34.98% <ø> (ø)
instrumentations-instrumentation-stripe 28.08% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-url 27.50% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-when 27.55% <ø> (-0.08%) ⬇️
instrumentations-instrumentation-zlib 27.54% <ø> (-0.08%) ⬇️
instrumentations-integration-esbuild-0.16.12-active 18.50% <ø> (ø)
instrumentations-integration-esbuild-0.16.12-latest 18.50% <ø> (ø)
instrumentations-integration-esbuild-0.16.12-maintenance 18.51% <ø> (ø)
instrumentations-integration-esbuild-0.16.12-oldest 18.50% <ø> (ø)
instrumentations-integration-esbuild-latest-active 18.50% <ø> (ø)
instrumentations-integration-esbuild-latest-latest 18.50% <ø> (ø)
instrumentations-integration-esbuild-latest-maintenance 18.51% <ø> (ø)
instrumentations-integration-esbuild-latest-oldest 18.50% <ø> (ø)
llmobs-ai 36.31% <ø> (-0.08%) ⬇️
llmobs-anthropic 36.69% <ø> (-0.08%) ⬇️
llmobs-bedrock 35.41% <ø> (-0.07%) ⬇️
llmobs-google-genai 35.74% <ø> (-0.07%) ⬇️
llmobs-langchain 34.81% <ø> (-0.06%) ⬇️
llmobs-openai-latest 39.18% <ø> (-0.07%) ⬇️
llmobs-openai-oldest 39.21% <ø> (-0.07%) ⬇️
llmobs-sdk-active 43.55% <ø> (-0.08%) ⬇️
llmobs-sdk-latest 43.55% <ø> (-0.08%) ⬇️
llmobs-sdk-maintenance ?
llmobs-sdk-oldest 43.59% <ø> (-0.08%) ⬇️
llmobs-vertex-ai 35.73% <ø> (-0.08%) ⬇️
master-coverage 93.02% <ø> (?)
openfeature-macos 37.69% <ø> (+<0.01%) ⬆️
openfeature-ubuntu 37.76% <ø> (ø)
openfeature-unit-active 50.65% <ø> (ø)
openfeature-unit-latest 50.65% <ø> (ø)
openfeature-unit-maintenance 50.77% <ø> (ø)
openfeature-unit-oldest 50.77% <ø> (ø)
openfeature-windows 37.51% <ø> (-0.06%) ⬇️
platform-core 31.85% <ø> (ø)
platform-esbuild 36.42% <ø> (ø)
platform-instrumentations-misc 30.37% <ø> (ø)
platform-integration-active 46.92% <ø> (?)
platform-integration-latest 46.92% <ø> (ø)
platform-integration-maintenance 46.98% <ø> (+0.05%) ⬆️
platform-integration-oldest 47.08% <ø> (-0.05%) ⬇️
platform-shimmer 40.18% <ø> (ø)
platform-unit-guardrails 32.72% <ø> (ø)
platform-webpack 18.02% <ø> (ø)
plugins-axios 35.58% <ø> (+0.01%) ⬆️
plugins-azure-cosmos 35.84% <ø> (-0.24%) ⬇️
plugins-azure-event-hubs 34.84% <ø> (ø)
plugins-azure-service-bus 35.34% <ø> (ø)
plugins-body-parser 36.55% <ø> (ø)
plugins-bullmq 39.16% <ø> (+0.02%) ⬆️
plugins-cassandra 33.71% <ø> (-0.08%) ⬇️
plugins-cookie 25.16% <ø> (ø)
plugins-cookie-parser 24.91% <ø> (ø)
plugins-crypto 24.70% <ø> (ø)
plugins-dd-trace-api 33.41% <ø> (-0.08%) ⬇️
plugins-express-mongo-sanitize 25.16% <ø> (ø)
plugins-express-session 24.83% <ø> (ø)
plugins-fastify 37.86% <ø> (-0.08%) ⬇️
plugins-fetch 34.14% <ø> (-0.08%) ⬇️
plugins-fs 33.80% <ø> (-0.08%) ⬇️
plugins-generic-pool 23.87% <ø> (ø)
plugins-google-cloud-pubsub 41.54% <ø> (-0.07%) ⬇️
plugins-grpc 36.64% <ø> (-0.08%) ⬇️
plugins-handlebars 25.12% <ø> (ø)
plugins-hapi 35.68% <ø> (-0.08%) ⬇️
plugins-hono 36.02% <ø> (-0.08%) ⬇️
plugins-ioredis 34.26% <ø> (-0.08%) ⬇️
plugins-jest 27.11% <ø> (ø)
plugins-knex 24.84% <ø> (ø)
plugins-langgraph 32.41% <ø> (-0.08%) ⬇️
plugins-ldapjs 22.49% <ø> (ø)
plugins-light-my-request 24.56% <ø> (ø)
plugins-limitd-client 27.89% <ø> (-0.15%) ⬇️
plugins-lodash 24.06% <ø> (ø)
plugins-mariadb 35.23% <ø> (-0.08%) ⬇️
plugins-memcached 33.78% <ø> (-0.08%) ⬇️
plugins-microgateway-core 34.77% <ø> (-0.08%) ⬇️
plugins-modelcontextprotocol-sdk 32.37% <ø> (-0.08%) ⬇️
plugins-moleculer 36.70% <ø> (?)
plugins-mongodb 35.84% <ø> (-0.08%) ⬇️
plugins-mongodb-core 35.51% <ø> (-0.08%) ⬇️
plugins-mongoose 34.58% <ø> (+0.08%) ⬆️
plugins-multer 24.87% <ø> (ø)
plugins-mysql 34.69% <ø> (-0.09%) ⬇️
plugins-mysql2 34.97% <ø> (-0.08%) ⬇️
plugins-nats 36.43% <ø> (-0.08%) ⬇️
plugins-node-serialize 25.20% <ø> (ø)
plugins-opensearch 33.66% <ø> (-0.08%) ⬇️
plugins-passport-http 24.80% <ø> (ø)
plugins-pino ?
plugins-postgres 34.53% <ø> (-0.08%) ⬇️
plugins-process 24.70% <ø> (ø)
plugins-pug 25.16% <ø> (ø)
plugins-redis 34.31% <ø> (-0.09%) ⬇️
plugins-router 38.12% <ø> (-0.24%) ⬇️
plugins-sequelize 23.75% <ø> (ø)
plugins-test-and-upstream-amqp10 33.75% <ø> (-0.25%) ⬇️
plugins-test-and-upstream-amqplib 39.14% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-apollo 34.89% <ø> (-0.07%) ⬇️
plugins-test-and-upstream-avsc 33.81% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-bunyan 29.03% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-connect 36.32% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-graphql 36.11% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-koa 35.86% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-protobufjs 34.05% <ø> (-0.08%) ⬇️
plugins-test-and-upstream-rhea 39.27% <ø> (-0.08%) ⬇️
plugins-undici 34.65% <ø> (-0.08%) ⬇️
plugins-url 24.70% <ø> (ø)
plugins-valkey 33.88% <ø> (-0.07%) ⬇️
plugins-vm 24.70% <ø> (ø)
plugins-winston 29.70% <ø> (-0.08%) ⬇️
plugins-ws 37.16% <ø> (-0.08%) ⬇️
profiling-macos 43.28% <ø> (-0.07%) ⬇️
profiling-ubuntu ?
profiling-windows 41.02% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-aws-sdk 33.52% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-bedrockruntime 31.74% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-client 20.28% <ø> (ø)
serverless-aws-sdk-latest-dynamodb 34.29% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-eventbridge 27.10% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-kinesis 37.49% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-lambda 34.83% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-s3 32.68% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-serverless-peer-service 39.76% <ø> (-0.08%) ⬇️
serverless-aws-sdk-latest-sns 38.63% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-sqs 38.12% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-stepfunctions 33.39% <ø> (-0.07%) ⬇️
serverless-aws-sdk-latest-util 47.00% <ø> (?)
serverless-aws-sdk-oldest-aws-sdk 33.60% <ø> (-0.06%) ⬇️
serverless-aws-sdk-oldest-bedrockruntime 32.00% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-client 20.67% <ø> (ø)
serverless-aws-sdk-oldest-dynamodb 34.39% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-eventbridge 27.14% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-kinesis 37.61% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-lambda 34.89% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-s3 32.77% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-serverless-peer-service 39.80% <ø> (-0.08%) ⬇️
serverless-aws-sdk-oldest-sns 38.81% <ø> (+0.05%) ⬆️
serverless-aws-sdk-oldest-sqs 38.10% <ø> (+0.07%) ⬆️
serverless-aws-sdk-oldest-stepfunctions 33.44% <ø> (-0.07%) ⬇️
serverless-aws-sdk-oldest-util 47.26% <ø> (ø)
serverless-azure-durable-functions 36.86% <ø> (ø)
serverless-azure-functions-eventhubs 38.47% <ø> (ø)
serverless-azure-functions-servicebus 38.53% <ø> (ø)
serverless-lambda 34.60% <ø> (-0.10%) ⬇️
test-optimization-cucumber-latest-7.0.0 50.44% <ø> (+0.11%) ⬆️
test-optimization-cucumber-latest-latest 53.24% <ø> (+0.11%) ⬆️
test-optimization-cucumber-oldest-7.0.0 50.48% <ø> (+0.11%) ⬆️
test-optimization-cypress-latest-12.0.0-commonJS 48.86% <ø> (-0.70%) ⬇️
test-optimization-cypress-latest-12.0.0-esm 49.66% <ø> (+0.07%) ⬆️
test-optimization-cypress-latest-14.5.4-commonJS 49.20% <ø> (-0.20%) ⬇️
test-optimization-cypress-latest-14.5.4-esm 49.50% <ø> (+0.07%) ⬆️
test-optimization-cypress-latest-latest-commonJS 49.96% <ø> (+0.07%) ⬆️
test-optimization-cypress-latest-latest-esm 50.00% <ø> (+0.19%) ⬆️
test-optimization-cypress-oldest-12.0.0-commonJS 49.67% <ø> (+0.07%) ⬆️
test-optimization-cypress-oldest-12.0.0-esm 49.70% <ø> (+0.07%) ⬆️
test-optimization-cypress-oldest-14.5.4-commonJS 49.51% <ø> (+0.34%) ⬆️
test-optimization-cypress-oldest-14.5.4-esm 49.41% <ø> (-0.06%) ⬇️
test-optimization-jest-latest-latest 55.57% <ø> (+1.98%) ⬆️
test-optimization-jest-latest-oldest 54.42% <ø> (+4.41%) ⬆️
test-optimization-jest-oldest-latest 55.58% <ø> (+0.08%) ⬆️
test-optimization-jest-oldest-oldest 54.45% <ø> (+0.08%) ⬆️
test-optimization-mocha-latest-latest 53.76% <ø> (+0.08%) ⬆️
test-optimization-mocha-latest-oldest 51.44% <ø> (+0.08%) ⬆️
test-optimization-mocha-oldest-latest 53.82% <ø> (+0.08%) ⬆️
test-optimization-mocha-oldest-oldest 51.40% <ø> (+0.10%) ⬆️
test-optimization-playwright-latest-latest-playwright-active-test-span 44.36% <ø> (+0.13%) ⬆️
test-optimization-playwright-latest-latest-playwright-atr 43.16% <ø> (?)
test-optimization-playwright-latest-latest-playwright-efd 43.57% <ø> (+0.09%) ⬆️
test-optimization-playwright-latest-latest-playwright-final-status 43.61% <ø> (+0.08%) ⬆️
test-optimization-playwright-latest-latest-playwright-impacted-tests 43.10% <ø> (ø)
test-optimization-playwright-latest-latest-playwright-reporting 43.21% <ø> (+0.11%) ⬆️
test-optimization-playwright-latest-latest-playwright-test-management 44.78% <ø> (+0.10%) ⬆️
test-optimization-playwright-latest-oldest-playwright-active-test-span 44.31% <ø> (?)
test-optimization-playwright-latest-oldest-playwright-atr 43.24% <ø> (+0.10%) ⬆️
test-optimization-playwright-latest-oldest-playwright-efd 43.47% <ø> (+0.06%) ⬆️
test-optimization-playwright-latest-oldest-playwright-final-status 43.54% <ø> (?)
test-optimization-playwright-latest-oldest-playwright-impacted-tests 43.01% <ø> (ø)
test-optimization-playwright-latest-oldest-playwright-reporting 42.99% <ø> (+0.06%) ⬆️
test-optimization-playwright-latest-oldest-playwright-test-management 44.72% <ø> (+0.10%) ⬆️
test-optimization-playwright-oldest-latest-playwright-active-test-span 44.54% <ø> (+0.11%) ⬆️
test-optimization-playwright-oldest-latest-playwright-atr 43.19% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-latest-playwright-efd 43.58% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-latest-playwright-final-status 43.67% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-latest-playwright-impacted-tests 43.13% <ø> (ø)
test-optimization-playwright-oldest-latest-playwright-reporting 43.21% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-latest-playwright-test-management 44.82% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-active-test-span 44.34% <ø> (+0.27%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-atr 43.27% <ø> (+0.10%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-efd 43.51% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-final-status 43.57% <ø> (+0.08%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-impacted-tests 43.05% <ø> (ø)
test-optimization-playwright-oldest-oldest-playwright-reporting 43.03% <ø> (+0.09%) ⬆️
test-optimization-playwright-oldest-oldest-playwright-test-management 44.76% <ø> (+0.10%) ⬆️
test-optimization-selenium-latest 45.53% <ø> (+0.06%) ⬆️
test-optimization-selenium-oldest 44.99% <ø> (-0.20%) ⬇️
test-optimization-testopt-active 48.66% <ø> (+0.12%) ⬆️
test-optimization-testopt-latest 48.66% <ø> (+0.12%) ⬆️
test-optimization-testopt-maintenance 48.70% <ø> (+0.12%) ⬆️
test-optimization-testopt-oldest 49.64% <ø> (+0.10%) ⬆️
test-optimization-vitest-latest 51.01% <ø> (+1.19%) ⬆️
test-optimization-vitest-oldest 48.15% <ø> (+0.32%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@dd-octo-sts
Copy link
Copy Markdown
Contributor

dd-octo-sts Bot commented May 30, 2026

Overall package size

Self size: 6.06 MB
Deduped: 7.1 MB
No deduping: 7.1 MB

Dependency sizes | name | version | self size | total size | |------|---------|-----------|------------| | import-in-the-middle | 3.0.1 | 82.56 kB | 817.39 kB | | opentracing | 0.14.7 | 194.81 kB | 194.81 kB | | dc-polyfill | 0.1.11 | 25.74 kB | 25.74 kB |

🤖 This report was automatically generated by heaviest-objects-in-the-universe

@datadog-official
Copy link
Copy Markdown

datadog-official Bot commented May 30, 2026

Pipelines  Tests

Fix all issues with BitsAI

⚠️ Warnings

🚦 4 Pipeline jobs failed

Project | lint   View in Datadog   GitHub Actions

🔧 Fix in code (Fix with Cursor). Job failed due to linting errors. Missing path warnings: 15.

All Green | all-green   View in Datadog   GitHub Actions

🔄 Retry job. This looks flaky and may succeed on retry. One or more jobs failed during the workflow execution after retries.

CodeQL | Analyze (javascript)   View in Datadog   GitHub Actions

🔄 Retry job. This looks flaky and may succeed on retry. API rate limit exceeded for installation. Request ID: 1C10:129148:38DAA81:D9A9417:6A1C8AE6

View all 4 failed jobs.

ℹ️ Info

No other issues found (see more)

🧪 All tests passed
❄️ No new flaky tests detected

🎯 Code Coverage (details)
Patch Coverage: 100.00%
Overall Coverage: 86.62% (-0.01%)

Useful? React with 👍 / 👎

This comment will be updated automatically if new data arrives.
🔗 Commit SHA: b1d2306 | Docs | Datadog PR Page | Give us feedback!

@pr-commenter
Copy link
Copy Markdown

pr-commenter Bot commented May 31, 2026

Benchmarks

Benchmark execution time: 2026-05-31 19:35:56

Comparing candidate commit b1d2306 in PR branch BridgeAR/2026-05-31-benchmark-platform-job with baseline commit 03116df in branch master.

Found 1 performance improvements and 1 performance regressions! Performance is the same for 1485 metrics, 106 unstable metrics.

scenario:plugin-bluebird-with-tracer-20

  • 🟩 max_rss_usage [-4.740MB; -3.104MB] or [-5.821%; -3.811%]

scenario:scope-manager-scope_enabled-20

  • 🟥 instructions [+117.1M instructions; +254.4M instructions] or [+4.573%; +9.937%]

…hip)

Foundation the per-bench PRs stack on:
1. startup-guard.js: the load-vs-loop share assertion every bench calls.
2. runall.sh: auto-shard from variant count x available cores, failing with the
   exact SPLITS to configure.
3. CODEOWNERS: route each benchmark directory to its owning team.
@BridgeAR BridgeAR changed the title ci(benchmarks): report at 2% and tolerate new benchmarks on the baseline bench(sirun): benchmark harness foundation May 31, 2026
Their benches still run on this branch, so narrowing the PLUGINS install set must
pair with removing the benches -- which happens in the integrations PR, not the
harness. Restores the master plugin list to avoid a missing-module failure.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant