Veedrac "points out" his likes and dislikes, that doesn't mean his likes and dislikes are "The Truth".
When Veedrac dismissively tells you - "pidigits tests whether you have bindings to GMP" - you should ask why he hasn't told you that the measurements can be different even when all the programs use GMP; you should ask why he hasn't told you that the measurements also show the difference for the same language implementation when programs do and don't use GMP.
Have you looked at the benchmarks game website?
Please show where the benchmarks game website claims that those tasks simulate "real workloads" (whatever that means).
You will see "Your application is the ultimate benchmark" and "These are just 10 tiny examples" and …
http://benchmarksgame.alioth.debian.org/dont-jump-to-conclus...