Merging CNVs with truvari bench
vs truvari collapse
#247
Replies: 1 comment
-
Hello, In the paper, we first used For your project, your final output will be a single, merged VCF and therefore you'll be focused on
However, since you don't have typical GT fields you may not find that functionality very helpful and may want to build your own script to consolidate the SAMPLE columns for intra-sample merging (e.g. special handling of the copy-number (CN) field since Once you've created the final set of CNVs per-sample, you could use repeat the bcftools merge, truvari collapse procedure to perform inter-sample merging. An extra benefit of a two step intra/inter merge is you can have more control over the matching thresholds at each step. One could imagine using looser thresholds for intra-sample than what's used for inter-sample if we assume that calls from the same sample that overlap are more likely to be from the same event whereas that same amount of overlap between samples could be from different events. Have a great day, |
Beta Was this translation helpful? Give feedback.
-
Hi Adam, I've been trying to find out if it is possible to merge CNVs using Truvari if I do not have a truth set. In the Truvari paper, I saw that you first used
truvari bench
to merge CNVs by more liberal parameters, and then you usedtruvari collapse
to refine the merging to make it more specific.I am a PhD student trying to create a Nextflow workflow that integrates three exome sequencing-based CNV callers (CANOES, CLAMMS and XHMM) using a SV merging tool, then annotating the output with features used to train a random forest classifier.
Each of these three CNV callers has a different output file format with different fields, but I have written scripts to convert their outputs to VCF format, such that I can use them as inputs for merging tools, like Truvari.
Unfortunately, the CNV callers used do not provide genotype or strand information and as the tools all use the read depth approach to call CNVs from exome sequencing data, the boundaries are fuzzy.
I would like to know if I should first merge the CNVs intra-sample using
truvari bench
, and then merge between samples, while retaining the association of specific samples to particular CNVs, usingtruvari collapse
.Kind regards
Yoni Wolberg
Beta Was this translation helpful? Give feedback.
All reactions