[spark] Add startup mode for batch read #2532

Yohahaha · 2026-01-30T14:54:12Z

Purpose

Add a new option start.up.mode to read different offset from fluss.
This PR only changes batch read related class.

Linked issue: close #2549

Brief change log

Tests

API and Format

Documentation

Yohahaha · 2026-02-02T06:37:52Z

@YannByron @wuchong please help take a look, thank you!

wuchong

Thanks, @Yohahaha! It looks like no new tests have been added, should we include some tests for the new configuration option?

Also, as a best practice, we recommend first creating a dedicated issue to describe the feature and proposed APIs before submitting a pull request. The PR can then be linked to that issue. This helps us better track progress and maintain visibility across all subtasks of the umbrella initiative.

wuchong · 2026-02-02T11:30:29Z

...s-spark/fluss-spark-common/src/main/scala/org/apache/fluss/spark/SparkConnectorOptions.scala

+    ConfigBuilder
+      .key("scan.startup.mode")
+      .stringType()
+      .defaultValue(StartUpMode.LATEST.toString)


Should we use the default FULL mode to stay aligned with the Flink connector?
Using LATEST by default may result in empty results if the user doesn’t explicitly specify a startup mode for the query.

wuchong · 2026-02-02T12:19:54Z

...uss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussAppendPartitionReader.scala

    val scanRecords = logScanner.poll(POLL_TIMEOUT)
+    if ((scanRecords == null || scanRecords.isEmpty) && currentOffset < flussPartition.stopOffset) {
+      throw new IllegalStateException(s"No more data from fluss server," +
+        s" but current offset $currentOffset not reach the stop offset ${flussPartition.stopOffset}")


logScanner.poll() may return empty results when the Fluss server is undergoing recovery, restart, or rebalance. Given that the current POLL_TIMEOUT is set to a very short duration (100ms), this scenario is highly likely to occur.

Currently, the source immediately throws an exception if logScanner.poll() returns no records, which makes it unstable during Fluss server failover events.

A straightforward fix is to increase the POLL_TIMEOUT to 60 seconds. This means the source will wait up to 60 seconds for data during transient server unavailability. If the Fluss server still hasn’t recovered within that window, we can then throw an exception to alert users.

wuchong · 2026-02-02T12:22:25Z

...uss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussAppendPartitionReader.scala

    if (currentRecords.hasNext) {
      val scanRecord = currentRecords.next()
      currentRow = convertToSparkRow(scanRecord)
+      currentOffset += 1


We can't simply +1 and should use currentOffset = scanRecord.logOffset() + 1 instead. Because, there are some record batch increases log offsets without any records in it. Simply +1 will lead to missing some data.

wuchong · 2026-02-02T12:25:38Z

...uss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussAppendPartitionReader.scala

-      logScanner.subscribeFromBeginning(bucketId)
+      logScanner.subscribe(bucketId, flussPartition.startOffset)
    }
+    pollMoreRecords()


When both the start and end offsets are set to LATEST, the stop offset may be less than or equal to the start offset. In such cases, attempting to poll records will cause pollMoreRecords() to throw an exception.

To avoid this, we should explicitly validate that the start offset is strictly less than the stop offset before initiating polling.

wuchong · 2026-02-02T12:26:07Z

fluss-spark/fluss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussBatch.scala

    tablePath: TablePath,
    tableInfo: TableInfo,
    readSchema: StructType,
+    startOffsetsInitializer: OffsetsInitializer,


The startOffsetsInitializer is not used in FlussUpsertBatch.

wuchong · 2026-02-02T12:27:49Z

fluss-spark/fluss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussScan.scala


  override def toBatch: Batch = {
-    new FlussUpsertBatch(tablePath, tableInfo, readSchema, options, flussConfig)
+    val startOffsetsInitializer = FlussScan.startOffsetsInitializer(options)


The startOffsetsInitializer is not used FlussUpsertBatch. For FlussUpsertBatch (primary key tables), we currently only support FULL startup mode that reads kv snapshot first, and then swith to the corresponding log offsets. So we should check the startup mode is FULL, otherwise, throw unsupported exception.

Yohahaha added 3 commits January 30, 2026 21:52

stage

c3895a2

stage

cf66d5e

fix primary key table

a444d8c

wuchong reviewed Feb 2, 2026

View reviewed changes

wuchong added the priority=critical label Feb 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Add startup mode for batch read #2532

[spark] Add startup mode for batch read #2532

Yohahaha commented Jan 30, 2026 •

edited

Loading

Uh oh!

Yohahaha commented Feb 2, 2026

Uh oh!

wuchong left a comment

Uh oh!

wuchong Feb 2, 2026

Uh oh!

wuchong Feb 2, 2026

Uh oh!

wuchong Feb 2, 2026

Uh oh!

wuchong Feb 2, 2026

Uh oh!

wuchong Feb 2, 2026

Uh oh!

wuchong Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[spark] Add startup mode for batch read #2532

Are you sure you want to change the base?

[spark] Add startup mode for batch read #2532

Conversation

Yohahaha commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

Yohahaha commented Feb 2, 2026

Uh oh!

wuchong left a comment

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Yohahaha commented Jan 30, 2026 •

edited

Loading