In the PureCN.R output file *_variants.csv, am I correct to think that I can get a quick count of the number of somatic variants that PureCN.R saw in the input VCF file by adding the number of entries > 0.9 in column "prior.somatic"? (A log message said that variants marked SOMATIC were found and their prior probability was set to 0.999).
PureCN.R is not seeing most of my somatic variants, or at least is not putting the prior value to 0.999. For one sample, which should have about 120 somatic variants, only 1 variant has prior probability > 0.9.
What filtering might PureCN be applying to my somatic variants? I didn't see log file messages indicating that a bunch of them had been filtered out.
So far I haven't found anything wrong with the VCF. However, I noted when trying to run the vcf object through filterVcfBasic() that in one place it attempts to get the depths from the DP genotype field, which does not exist in my VCF. When I run it by hand, it gets a fatal error. Yet that isn't happening when I run PureCN.
I'm wondering if I should add a FORMAT/DP field?
Later -- I went ahead and added a DP field, and then when I ran it, PureCN crashed with the following output:
Did you load the VCF with PureCN:::.readAndCheckVcf(vcf.file, "hg38")? This should take care of necessary standardizations like adding DP.
No, ok, thx.
However, the crash occurred when I was running PureCN normally.