Data was generated using dwgsim version 0.1.11.
The reference genome used to generate the data was the GRCh38 assembly including alt contigs and decoy sequences provided by NCBI.
The commands used to generate the data were:
dwgsim -N 10000 -e 0.0010 -E 0.0010 -1 152 -2 152 -c 0 -S1 -z 1 -Q 5 \
reference.fasta test
gzip -c -9 test.bwa.read1.fastq > test.fastq.gz
concatenated.fastq.gz was created with:
gzip -cd tests/data/test.fastq.gz | head -n 1000 | gzip -c -1 >> test2.fastq.gz
gzip -cd tests/data/test.fastq.gz | head -n 1000 | gzip -c -1 >> test2.fastq.gz