Skip to content

Vs 1619 #9196

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 18 commits into from
Jun 10, 2025
Merged

Vs 1619 #9196

merged 18 commits into from
Jun 10, 2025

Conversation

koncheto-broad
Copy link
Collaborator

@koncheto-broad koncheto-broad commented Jun 2, 2025

Modifying the create filter set code to accept custom references AND a custom truth file. Completes successfully, but we don't have truth data to test it against because it's using custom training data.

Successful completion: https://app.terra.bio/#workspaces/gvs-dev/GVS%20Tiny%20Quickstart%20for%20Felis%20catus%20adoribus/submission_history/0b40a0ae-494f-4f79-871f-46a776a2376e

Successful integration run: https://app.terra.bio/#workspaces/gvs-dev/GVS%20Integration%20ahatcher/submission_history/c3567b95-8594-43fe-8507-98db01734a5e

@koncheto-broad koncheto-broad marked this pull request as ready for review June 10, 2025 14:07
@mcovarr mcovarr requested a review from Copilot June 10, 2025 14:52
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR extends the filter‐set creation tools to support fully custom references by accepting a contig-mapping file and, in WDL workflows, a custom truth resources file.

  • Adds --contig-mapping-file argument and logic in Java tools to set a custom contig map when --ref-version CUSTOM is used
  • Updates WDL (GvsUtils.wdl and GvsCreateFilterSet.wdl) to pass --contig-mapping-file and allow a custom_training_resources override
  • Registers the new branch in .dockstore.yml

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
CreateSiteFilteringFiles.java Added contigMappingFile argument and conditional custom contig map loading
CreateFilteringFiles.java Same additions for indel/SNP filter builder
scripts/variantstore/wdl/GvsUtils.wdl Introduced optional custom_contig_mapping and made --ref-version dynamic
scripts/variantstore/wdl/GvsCreateFilterSet.wdl Added custom_reference, custom_contig_mapping, custom_training_resources; wired flags
.dockstore.yml Added new branch VS-1619 to workflows
Comments suppressed due to low confidence (1)

scripts/variantstore/wdl/GvsCreateFilterSet.wdl:352

  • You added custom_reference as an input but the GATK call still uses the original reference variable. Replace -R "~{reference}" with a reference that falls back to custom_reference (e.g., use effective_reference) so the task actually uses the custom fasta when supplied.
-R "~{reference}"

Copy link
Collaborator

@mcovarr mcovarr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One nit and one concern with an issue flagged by Copilot.

@@ -16,6 +16,11 @@ workflow GvsCreateFilterSet {
String reference_name = "hg38"
String? interval_list

# for supporting non-hg38 references
File? custom_reference
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In practice, you could put the new reference information into the Reference struct and then use the corresponding reference_name (e.g. kitty) to look it up in GvsUtils.py - But might be too early for that yet?

@koncheto-broad koncheto-broad merged commit 988f467 into gvs_expanded_references Jun 10, 2025
16 checks passed
@koncheto-broad koncheto-broad deleted the VS-1619 branch June 10, 2025 19:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants