-
Notifications
You must be signed in to change notification settings - Fork 307
use the latest release apache-datasketches-bigquery-1.1.1 #492
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use the latest release apache-datasketches-bigquery-1.1.1 #492
Conversation
/gcbrun |
@AlexanderSaydakov apache/datasketches-bigquery#142 introduced breaking changes to the build process for this repo. Can you revert that PR and instead specify that JS_BUCKET values must not end in forward slash? You can have JS_BUCKET specify a path to a folder but you just need to leave out the last forward slash. |
I think I tried that and it did not work, but the modified Makefiles worked for both cases (bucket and folder). I will revisit this. |
The build pipeline for datasketches in bigquery-utils repo passes a folder to the JS_BUCKET variable so I'm certain it works with folders (leaving out the trailing forward slash). Let me know if you can reproduce the issue you saw that led to the PR, I can help debug. |
I tested the makefiles when I made the change and re-tested now with JS_BUCKET not having the trailing slash in both cases: gs://bucket and gs://bucket/folder. Works for me. |
You'll need to now revert the changes from apache/datasketches-bigquery#142 in order to publish via this repo |
Could you clarify why do you think we need to revert the change? I have just re-tested the release 1.1.0 setting JS_BUCKET=gs://bucketname and JS_BUCKET=gs://bucketname/foldername (without slash at the end). Both ways work. |
PR#142 removed the forward slash from the following command:
which results in cp command thinking that the gs:// path is to a file instead of a folder. Here is an excerpt of the build logs from this repo (note how the makefiles all write/overwrite the same filename "2eaa735":
|
I guess that in your case the specified folder "2eaa735" does not exist. Is that right? Do you expect the copy command to create the folder? |
Correct, the specified folder doesn't exist nor should it need to since folders aren't actually real objects by default in Cloud Storage (see https://cloud.google.com/storage/docs/objects#flat-namespace for more details). |
/gcbrun |
@AlexanderSaydakov The unit tests are not passing. There are 117 failures (I've attached a file with error details). Were you able to make the unit tests pass in your own BigQuery project or is this the first time the unit tests are run in a BigQuery project? |
Tests pass for me and for several others who voted on the release. |
It seems to me that you did not rerun the tests after the version change to 1.1.1 |
How are you running the tests? |
https://github.com/GoogleCloudPlatform/bigquery-utils/pull/492/checks?check_run_id=43226504649 |
I am running 'make', 'make install' and 'make test' |
I ran these exact commands in a clean cloud shell instance for datasketches tag v1.1.1 and got the same failures. |
I used emscripten 4.0.7 (installed by brew on MacOS). |
…es its submodule dependency
I've added a not-so-pretty shim in this repo to pin emsdk to version 4.0.7 but only as a temporary workaround until apache/datasketches-bigquery#156 makes its way to the next release. Thanks for the help debugging and finding the version issue! |
/gcbrun |
Release notes here:
https://github.com/apache/datasketches-bigquery/releases/tag/1.1.0