Skip to content

Introduce table scan#96

Open
luoyuxia wants to merge 1 commit intoapache:mainfrom
luoyuxia:introduce-table-scan
Open

Introduce table scan#96
luoyuxia wants to merge 1 commit intoapache:mainfrom
luoyuxia:introduce-table-scan

Conversation

@luoyuxia
Copy link
Contributor

@luoyuxia luoyuxia commented Feb 15, 2026

Purpose

Linked issue: close #105

Brief change log

Tests

API and Format

Documentation

@luoyuxia luoyuxia force-pushed the introduce-table-scan branch from 91f2bda to b6b9f76 Compare March 3, 2026 23:22
@luoyuxia luoyuxia force-pushed the introduce-table-scan branch from b6b9f76 to 065c688 Compare March 3, 2026 23:30
bucket: i32,
bucket_path: String,
total_buckets: Option<i32>,
total_buckets: i32,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

java use nuaable Integer for potential null value. But I think it won't be null in new version paimon. So, I remove Option.
For reference: apache/paimon#5537 may be a related pr.

let base_path = table_path;
let mut splits = Vec::new();

for ((_partition, bucket), group_entries) in groups {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@luoyuxia Hi, for partitioned tables, _partition is dropped right after grouping, but the split is later built with BinaryRow::new(0) and "{table_path}/bucket-{bucket}". This is fine for unpartitioned tables, but for partitioned tables, it loses partition identity, and bucket_path also misses the partition directory prefix (k=v/...). It would be better to reconstruct the partition from the grouped partition bytes, build splits with the real partition, and generate bucket_path as partition_path/bucket-{bucket}. What do you think of this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

introduce table scan

2 participants