Skip to content

Ref base always 'N', despite providing --reference FASTA #531

@bbimber

Description

@bbimber

Hello,

With sniffles 2.5 I am trying to perform a fairly simple genotype/merge with PacBio input CRAM files (from pbmm2). Despite providing "--reference" to sniffles, the resulting VCF has 'N' for all the REF alleles. To verify this is not CRAM-speciifc, I also converted CRAMs to BAMs and repeated, with the same result.

here is the first command (once per input):

$SNIFFLES \
	--input $BAM \
	--snf $SNF \
	--allow-overwrite \
	--threads 12 \
	--no-progress \
	--reference $MMUL10

and the merge:

$SNIFFLES \
	--no-progress \
	--input fileList.tsv \
	--allow-overwrite \
	-t 12 \
	--no-sort \
	--combine-pctseq 0 \
	--reference $MMUL10 \
	--vcf $VCF

The resulting VCF lists 'N' as the REF for every row. It seems like the bcftools fill-from-fasta plugin can salvage the REF field:

bcftools sort $VCF | bcftools +fill-from-fasta $VCF -- -f $MMUL10 -c REF | bgzip > ${VCF}.gz

but thought it was worth reporting this.

Metadata

Metadata

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions