-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to set up sex of PON #324
Comments
It's not working automatically? Internally it will create sex-specific PoNs. Not sure I ever tested this thoroughly, but I expect it to work. |
it's strange i have no result on X and Y, PureCN detect anomalies on autosomal chromosomes but not on X and Y, |
Ah, got it. Yes, the likelihood model only supports diploid genomes, so it ignores X/Y for males. Keeping the X/Y segmentation is a fair feature request though. Not sure I find time for it soon. |
You might be able to force the use of X/Y by providing sex="diploid". Not sure there are any side effects though. But might be enough to get the segmentation and merge it then back to the regular run. |
is NormalDB.R script accept the parameter sex="diploid" ? |
PureCN.R. |
oh yes of course, sorry, |
and i have another question because i obtain too much noisy results: |
my command was: |
Can you post the log file? It will tell you exactly what it used for filtering low quality bins. The --stats-file is only for Mutect1. Also have a look at #320 if you have many MBQ 20 variants. |
PSX-AD-001.log |
Looks fine, but you indeed suffer the same issue with MBQ 20. Install the version from the issue_320 branch: And then re-run PureCN.R with --min-base-quality 20 |
ok i have installed the version from issue_320 Rscript $PURECN/PureCN.R -v then: and same output with the same number of CNVs. |
You should have now a much higher number of SNPs in the log file. INFO [2023-09-19 16:42:29] 2.0% of targets contain variants. That should be now > 10%. That number should have dropped significantly: INFO [2023-09-19 16:42:28] Removing 14110 low quality variants with non-offset BQ < 25. |
ok, yes i understand. |
You can use the GATK segmentation with PureCN via --fun-segmentation GATK. That should get you close to GATK, but with off-target read support. PSCBS has a few unique features to combine off-target and on-target reads a bit smarter in the segmentation though. For low purity, I recommend PSCBS, for higher purity (like usually > 30%), GATK is probably cleaner. It's a balance of sensitivity and specificity. CNVkit is tweaked for sensitivity and expected to give more segments. |
yes. |
Describe the issue
build PON from multiples samples, all samples are males
To Reproduce
Rscript $PURECN/Coverage.R --out-dir $OUTPUTS/normals --bam normals.list --intervals $OUT_REF/hg38_intervals.txt --cores 12 --force
ls -a $OUTPUTS/normals/*_loess.txt.gz | cat > PON_normal_coverages.list
Rscript $PURECN/NormalDB.R --out-dir $OUT_REF --coverage-files PON_normal_coverages.list --genome hg38 --assay PON --force
How to set up sex male if all my normal samples are males ?
I know that in cnvkit tool i can set --male-reference for male when i build reference (PON).
thank you--
The text was updated successfully, but these errors were encountered: