summaryrefslogtreecommitdiff
path: root/java
diff options
context:
space:
mode:
authorJason W. Bacon <jwb@FreeBSD.org>2021-03-21 15:17:27 +0000
committerJason W. Bacon <jwb@FreeBSD.org>2021-03-21 15:17:27 +0000
commit5a899ba8ed1909cc03c8c66eee45bcc82e407ba0 (patch)
treeb68312e23839f6cc6bd1b1d28243fd764915050b /java
parentbiology/biolibc: Low-level high-performance bioinformatics library (diff)
biology/vcf-split: Split a multi-sample VCF into single-sample VCFs
Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF with bcftools takes two days, so extracting the 137,977 samples one at a time or using thousands of parallel readers of the same file is impractical. Vcf-split solves this by generating thousands of single-sample outputs during a single sweep through the multi-sample input.
Notes
Notes: svn path=/head/; revision=568922
Diffstat (limited to 'java')
0 files changed, 0 insertions, 0 deletions