cgpCaVEManWrapper: Simple execution of caveman in order to detect somatic single nucleotide variants in NGS data
Jones D., Raine KM., Davies H., Tarpey PS., Butler AP., Teague JW., Nik-Zainal S., Campbell PJ.
© 2016 John Wiley & Sons, Inc. CaVEMan is an expectation maximization-based somatic substitutiondetection algorithm that is written in C. The algorithm analyzes sequence data from a test sample, such as a tumor relative to a reference normal sample from the same patient and the reference genome. It performs a comparative analysis of the tumor and normal sample to derive a probabilistic estimate for putative somatic substitutions.When combined with a set of validated post-hoc filters, CaVEMan generates a set of somatic substitution calls with high recall and positive predictive value. Here we provide instructions for using a wrapper script called cgpCaVEManWrapper, which runs the CaVEMan algorithm and additional downstream post-hoc filters. We describe both a simple one-shot run of cgpCaVEManWrapper and a more in-depth implementation suited to large-scale compute farms.