Use compressed sort on unsorted chunks if possible #9133

natalya-aksman · 2026-01-15T18:45:06Z

Fixes #9116

Use compressed sort on unordered chunks when we sort on segmentby columns only.

Implemented in a simpler, different way from #9128 after discussions on that PR.

github-actions · 2026-01-15T18:45:55Z

@melihmutlu, @akuzm: please review this pull request.

Powered by pull-review

codecov · 2026-01-15T18:54:34Z

Codecov Report

❌ Patch coverage is 88.88889% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 82.53%. Comparing base (d347098) to head (8082a05).

Files with missing lines	Patch %	Lines
tsl/src/nodes/columnar_scan/columnar_scan.c	87.50%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #9133      +/-   ##
==========================================
+ Coverage   82.45%   82.53%   +0.08%     
==========================================
  Files         243      243              
  Lines       47938    47917      -21     
  Branches    12234    12232       -2     
==========================================
+ Hits        39525    39547      +22     
- Misses       3544     3556      +12     
+ Partials     4869     4814      -55

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

svenklemm · 2026-01-15T19:23:47Z

tsl/test/expected/compress_unordered_sort.out

+ d2     | A
+ d2     | C
+
+:PREFIX select device, sensor, count(*), max(sensor) from metrics group by device, sensor order by 1,2;


This is expected not to use skipscan, right? Could you add a comment describing expected behaviour

Oh nvm this is not only about SkipScan, would be good to have some negative example where we cant use the optimization atm because of filters on time column

We can use this optimization with filters on time column though.
Because if we only care about sorting on segmentby columns, it doesn't matter if we have unsorted time or other non-segmentby columns in the result or during the aggregation or SkipScan.

@akuzm pointed it out in #9128 (comment), and it is actually correct.

I.e. we don't have any limits on using compressed sort on unordered chunks if a sort is on segmentby columns only. The only limit is when the sort is on other than segmentby columns.

akuzm · 2026-01-16T11:12:50Z

tsl/test/sql/compress_unordered_sort.sql

+SET enable_bitmapscan=0;
+SET enable_seqscan=0;
+
+SET timezone TO PST8PDT;


can we use the default test timezone for this?

Will do. It's the same timezone for the column time, so the column time can also be created with default timezone.

akuzm · 2026-01-16T11:13:21Z

tsl/test/sql/compress_unordered_sort.sql

+
+SET max_parallel_workers_per_gather = 0;
+SET enable_bitmapscan=0;
+SET enable_seqscan=0;


This is not strictly tied to index scans, so many test queries would be just as fine with sort over seq scan I think.

SkipScan is tied to index scans, that was the original issue i.e. we could not use compressed sort on unordered chunks, therefore could not use IndexScan and therefore could not use SkipScan.

So here we show that we can use IndexScan and not do any extra sorting.

akuzm · 2026-01-16T11:14:52Z

tsl/src/nodes/columnar_scan/columnar_scan.c

+			/* Can use compressed sort on segmentby cols for unordered chunks as well,
+			 * unless this option is turned OFF
+			 */
+			if (!ts_guc_enable_compressed_unordered_sort && ts_chunk_is_unordered(chunk))


This change looks simple and obviously correct, I think it might be OK w/o a GUC.

Sounds good, will remove the guc.

akuzm · 2026-01-16T11:18:01Z

tsl/src/nodes/columnar_scan/columnar_scan.c

+		}
+
+		/*
+		 * Cannot push down sort on (segmentby + non-segmentby) columns if the chunk is unordered


Might be good to extend the commend slightly, because the name "unordered" is somewhat confusing. The reason why we can't use sort pushdown is that the "unordered" chunks have batches that overlap on the orderby columns axis.

Will clarify. Basically the only difference between pushing down sort into ordered vs unordered chunks is that for unordered chunk we cannot push down sort with keys on orderby columns.

natalya-aksman requested a review from a team January 15, 2026 18:45

github-actions bot assigned natalya-aksman Jan 15, 2026

github-actions bot requested review from akuzm and melihmutlu January 15, 2026 18:45

natalya-aksman requested review from antekresic and svenklemm and removed request for melihmutlu January 15, 2026 18:46

Use compressed sort on unsorted chunks if possible

8082a05

natalya-aksman force-pushed the use_compressed_sort_on_unordered_chunks branch from a99c727 to 8082a05 Compare January 15, 2026 19:05

natalya-aksman added this to the v2.25.0 milestone Jan 15, 2026

natalya-aksman added enhancement An enhancement to an existing feature for functionality Columnstore Related to the column store / compression skip-scan labels Jan 15, 2026

svenklemm reviewed Jan 15, 2026

View reviewed changes

akuzm reviewed Jan 16, 2026

View reviewed changes

akuzm approved these changes Jan 16, 2026

View reviewed changes

akuzm reviewed Jan 16, 2026

View reviewed changes

Use compressed sort on unsorted chunks if possible #9133

Are you sure you want to change the base?

Use compressed sort on unsorted chunks if possible #9133

Conversation

natalya-aksman commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

codecov bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

natalya-aksman commented Jan 15, 2026 •

edited

Loading

codecov bot commented Jan 15, 2026 •

edited

Loading