-
Notifications
You must be signed in to change notification settings - Fork 100
Closed
Labels
Description
Hello,
With sniffles 2.5 I am trying to perform a fairly simple genotype/merge with PacBio input CRAM files (from pbmm2). Despite providing "--reference" to sniffles, the resulting VCF has 'N' for all the REF alleles. To verify this is not CRAM-speciifc, I also converted CRAMs to BAMs and repeated, with the same result.
here is the first command (once per input):
$SNIFFLES \
--input $BAM \
--snf $SNF \
--allow-overwrite \
--threads 12 \
--no-progress \
--reference $MMUL10
and the merge:
$SNIFFLES \
--no-progress \
--input fileList.tsv \
--allow-overwrite \
-t 12 \
--no-sort \
--combine-pctseq 0 \
--reference $MMUL10 \
--vcf $VCF
The resulting VCF lists 'N' as the REF for every row. It seems like the bcftools fill-from-fasta plugin can salvage the REF field:
bcftools sort $VCF | bcftools +fill-from-fasta $VCF -- -f $MMUL10 -c REF | bgzip > ${VCF}.gz
but thought it was worth reporting this.