There is no way to by- pass or short-cut these steps. Many of the premature claims in a number of publications concerning the pathogenicity or disease-association of certain muta- tions could have been avoided if, from the very outset, efficient search strategies and some degree of phylo- ...
Your loop is not optimized at all because strict FP math is the default. XMM0 is a 128-bit register, YMM0 is the corresponding 256-bit. vaddsd is ADD Scalar Double, using the low element of XMM0. https://felixcloutier.com/x86/addsd With clang -O3 -ffast-math -march=native to...