diff options
author | Jason W. Bacon <jwb@FreeBSD.org> | 2021-03-21 15:17:27 +0000 |
---|---|---|
committer | Jason W. Bacon <jwb@FreeBSD.org> | 2021-03-21 15:17:27 +0000 |
commit | 5a899ba8ed1909cc03c8c66eee45bcc82e407ba0 (patch) | |
tree | b68312e23839f6cc6bd1b1d28243fd764915050b /java | |
parent | biology/biolibc: Low-level high-performance bioinformatics library (diff) |
biology/vcf-split: Split a multi-sample VCF into single-sample VCFs
Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.
Notes
Notes:
svn path=/head/; revision=568922
Diffstat (limited to 'java')
0 files changed, 0 insertions, 0 deletions