gc: priority scheduling with dual watermarks and cross-scan quota by xiaoxichen · Pull Request #436 · eBay/HomeObject

xiaoxichen · 2026-06-17T08:23:44Z

Sort eligible chunks by garbage ratio (desc) before submission so the most garbage-heavy chunks are always GC'd first
Add gc_garbage_rate_threshold_low (default 30%) as a low watermark; chunks between the two watermarks consume at most half the quota
Track m_pending_normal_gc_task_count in pdev_gc_actor to reflect tasks queued or running in m_gc_executor across scan cycles; previous code only capped submissions per scan, allowing unbounded queue growth
scan_chunks_for_gc now skips a pdev entirely when already at quota, and derives low_tier_cap proportionally from remaining_capacity
Add ADR docs/adr/gc-priority-scheduling.md

- Sort eligible chunks by garbage ratio (desc) before submission so the most garbage-heavy chunks are always GC'd first - Add gc_garbage_rate_threshold_low (default 30%) as a low watermark; chunks between the two watermarks consume at most half the quota - Track m_pending_normal_gc_task_count in pdev_gc_actor to reflect tasks queued or running in m_gc_executor across scan cycles; previous code only capped submissions per scan, allowing unbounded queue growth - scan_chunks_for_gc now skips a pdev entirely when already at quota, and derives low_tier_cap proportionally from remaining_capacity - Add ADR docs/adr/gc-priority-scheduling.md Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Signed-off-by: Xiaoxi Chen <xiaoxchen@ebay.com>

codecov-commenter · 2026-06-17T15:48:11Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 66.00000% with 17 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (stable/v4.x@e1c23e1). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
src/lib/homestore_backend/gc_manager.cpp	64.58%	13 Missing and 4 partials ⚠️
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@              Coverage Diff               @@
##             stable/v4.x     #436   +/-   ##
==============================================
  Coverage               ?   54.23%           
==============================================
  Files                  ?       36           
  Lines                  ?     5423           
  Branches               ?      686           
==============================================
  Hits                   ?     2941           
  Misses                 ?     2175           
  Partials               ?      307

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

JacksonYao287 · 2026-06-18T09:56:46Z

+            eligible.push_back({chunk_id, ratio_pct});
+        }
+
+        // Sort eligible chunks by garbage ratio descending so the most garbage-heavy chunks are GC'd first.


this is a topK problem.
suggest to use a heap with a capacity of remaining_capacity (priority_queue), so that we can hold at most remaining_capacity ChunkGCInfo in memory. and for any new ChunkGCInfo, we only need to compare it with the top of the heap.

IMO It doesnt worth those lines of code , considering the maximum chunk is 32K.

better to have it since it is not complicated and will also make code more simple, for example, the loop on the entire chunk collection will not be involved.

Signed-off-by: Xiaoxi Chen <xiaoxchen@ebay.com>

- scan_chunks_for_gc: replace std::vector + std::sort with a bounded std::priority_queue (min-heap) of capacity max_task_num. Memory is now O(K) per pdev regardless of chunk count. K is fixed at max_task_num so we always retain enough candidates if capacity opens up later in the scan; submission is still gated by the dynamic remaining_capacity / low_tier_cap. - add_gc_task: move m_pending_normal_gc_task_count.fetch_add(1) BEFORE m_gc_executor->add() so an immediately-completing task (which decrements via ~gc_task_guard) cannot underflow the counter. - process_gc_task: when we early-return because the chunk is not in GC state, gc_task_guard is never constructed, so its decrement never fires. Decrement m_pending_normal_gc_task_count for normal-priority tasks here to mirror the guard's bookkeeping and prevent a permanent quota leak. - docs/adr: update ADR to describe top-K heap selection instead of full sort.

xiaoxichen · 2026-06-23T09:11:37Z

@JacksonYao287 addressed

xiaoxichen requested review from Besroy and JacksonYao287 June 17, 2026 08:25

xiaoxichen force-pushed the gc-sort branch from 6d93085 to 6fca0fe Compare June 17, 2026 15:03

JacksonYao287 reviewed Jun 21, 2026

View reviewed changes

Address review

6513ed1

Signed-off-by: Xiaoxi Chen <xiaoxchen@ebay.com>

JacksonYao287 reviewed Jun 23, 2026

View reviewed changes

Comment thread src/lib/homestore_backend/gc_manager.cpp

Comment thread src/lib/homestore_backend/gc_manager.cpp Outdated

xiaoxichen force-pushed the gc-sort branch from 9ff8b3a to 27d3b42 Compare June 23, 2026 08:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gc: priority scheduling with dual watermarks and cross-scan quota#436

gc: priority scheduling with dual watermarks and cross-scan quota#436
xiaoxichen wants to merge 3 commits into
eBay:stable/v4.xfrom
xiaoxichen:gc-sort

xiaoxichen commented Jun 17, 2026

Uh oh!

codecov-commenter commented Jun 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

JacksonYao287 Jun 18, 2026

Uh oh!

xiaoxichen Jun 21, 2026

Uh oh!

JacksonYao287 Jun 23, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiaoxichen commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xiaoxichen commented Jun 17, 2026

Uh oh!

codecov-commenter commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

JacksonYao287 Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

JacksonYao287 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiaoxichen commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Jun 17, 2026 •

edited

Loading