Two small bugfixes to 02-EvidenceQC #666
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
When running EvidenceQC.wdl on ~30k samples from the NIH AllOfUs cohort, I encountered two unrelated issues with the
MakeQcTable
task inEvidenceQC.wdl
:EvidenceQC.wdl
supports optionally disabling running VCF QC but theread_all_outlier()
function inmake_evidence_qc_table.py
exits with an error when there are strictly zero outlier samples:I solved this by adding a conditional statement to check if there are zero outliers, in which case the function returns an empty dataframe with the expected headers (and this allows the rest of the script to run successfully).
merge_evidence_qc_table()
fails for cohorts where every sample has an integer ID. This seems to be due to pandas coercing some of the ID columns to dtypeobject
whereas some are dtypeint64
leading to this error:I fixed this by forcing all ID columns to dtype
object
prior to merging, which resolves this error.(Both of these were encountered when using Docker image
us.gcr.io/broad-dsde-methods/gatk-sv/sv-pipeline:2024-03-04-v0.28.4-beta-f0ad3f0f
, but based on the edit history ofmake_evidence_qc_table.py
my impression is these should reflect the currentmain
branch)Thanks!
Ryan