This function is the most speed-critical in the library. In profiles, this optimization reduces it from ~75% of the profile to ~55%. I have tried several approaches, but didn't manage to improve on this one (LLVM already unrolls the loop here). Though I'm sure it is possible.pull/7/head
parent
ea4b5e4df0
commit
dd96d3b9d4
Loading…
Reference in new issue