Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

According to the Intel optimization manual, there's a 5 cycle latency for doubles (4 for floats), but both fp multiply/add and alu ops have a one operation per cycle throughput.

I guess it comes down to how effectively you can schedule stuff.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: