This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| // Branchless, vectorized `atan2f`. Various functions of increasing | |
| // performance are presented. The fastest version is 50~ faster than libc | |
| // on batch workloads, outputing a result every ~2 clock cycles, compared to | |
| // ~110 for libc. The functions all use the same `atan` approximation, and their | |
| // max error is around ~1/10000 of a degree. | |
| // | |
| // They also do not handle inf / -inf | |
| // and the origin as an input as they should -- in our case these are a sign | |
| // that something is wrong anyway. Moreover, manual_2 does not handle NaN | |
| // correctly (it drops them silently), and all the auto_ functions do not |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # Compiled source # | |
| ################### | |
| *.com | |
| *.class | |
| *.dll | |
| *.exe | |
| *.o | |
| *.so | |
| # Packages # |