Skip to content

Conversation

Fokko
Copy link
Contributor

@Fokko Fokko commented Apr 8, 2025

With apache/iceberg#12593 Row lineage is required and Spark uses Java 1.8.0 that does not write the required first-row-id field in the snapshot.

Therefore, I think it would be good to just use the SNAPSHOT for now until it gets released. We can easily revert the PR once that's done.

Closes #1898

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

With apache/iceberg#12593 Row lineage is required
and Spark uses Java 1.8.0 that does not write the required `start-row-id`
field in the snapshot.

Therefore, I think it would be good to just use the SNAPSHOT for now
until it gets released.
@Fokko Fokko merged commit aeb4493 into apache:main Apr 8, 2025
7 checks passed
@Fokko
Copy link
Contributor Author

Fokko commented Apr 8, 2025

Thanks @nastra


# Download AWS bundle
RUN curl --retry 5 -s https://repo1.maven.org/maven2/org/apache/iceberg/iceberg-aws-bundle/${ICEBERG_VERSION}/iceberg-aws-bundle-${ICEBERG_VERSION}.jar \
RUN curl --retry 5 -s https://repository.apache.org/content/groups/snapshots/org/apache/iceberg/iceberg-aws-bundle/1.9.0-SNAPSHOT/iceberg-aws-bundle-1.9.0-20250408.002722-86.jar \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah this doesnt exist anymore, https://repository.apache.org/content/groups/snapshots/org/apache/iceberg/iceberg-aws-bundle/1.9.0-SNAPSHOT/

which causes CI to fail with

IllegalArgumentException: Cannot initialize FileIO implementation org.apache.iceberg.aws.s3.S3FileIO: Cannot find constructor for interface org.apache.iceberg.io.FileIO
	Missing org.apache.iceberg.aws.s3.S3FileIO [java.lang.NoClassDefFoundError: software/amazon/awssdk/services/s3/model/S3Exception]
make: *** [Makefile:61: test-integration] Error 1
Error: Process completed with exit code 2.

cc @Fokko

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

interestingly, iceberg-spark-runtime contains both 20250408 and 20250409 but iceberg-aws-bundle only has 20250409

Copy link
Contributor Author

@Fokko Fokko Apr 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

interestingly, iceberg-spark-runtime contains both 20250408 and 20250409 but iceberg-aws-bundle only has 20250409

Not sure what's going on there 🤔

The JARs are cleaned up weekly. Since the 0.9.0 SNAPSHOT is being voted on, I suggest bumping it to an available version. I've created a PR here: #1907

@Fokko Fokko deleted the fd-fix-ci branch April 13, 2025 07:00
Fokko added a commit to Fokko/iceberg-python that referenced this pull request Apr 28, 2025
gabeiglio pushed a commit to Netflix/iceberg-python that referenced this pull request Aug 13, 2025
With apache/iceberg#12593 Row lineage is
required and Spark uses Java 1.8.0 that does not write the required
`first-row-id` field in the snapshot.

Therefore, I think it would be good to just use the SNAPSHOT for now
until it gets released. We can easily revert the PR once that's done.

Closes apache#1898

<!--
Thanks for opening a pull request!
-->

<!-- In the case this PR will resolve an issue, please replace
${GITHUB_ISSUE_ID} below with the actual Github issue id. -->
<!-- Closes #${GITHUB_ISSUE_ID} -->

# Rationale for this change

# Are these changes tested?

# Are there any user-facing changes?

<!-- In the case of user-facing changes, please add the changelog label.
-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Failing CI because of first-row-id

3 participants