PyPy Status Blog: Some benchmarking

Wednesday, November 18, 2009

Some benchmarking

Hello.

Recently, thanks to the surprisingly helpful Unhelpful, also known as Andrew Mahone, we have a decent, if slightly arbitrary, set of performances graphs. It contains a couple of benchmarks already seen on this blog as well as some taken from The Great Computer Language Benchmarks Game. These benchmarks don't even try to represent "real applications" as they're mostly small algorithmic benchmarks. Interpreters used:

PyPy trunk, revision 69331 with --translation-backendopt-storesink, which is now on by default
Unladen swallow trunk, r900
CPython 2.6.2 release

Here are the graphs; the benchmarks and the runner script are available

And zoomed in for all benchmarks except binary-trees and fannkuch.

As we can see, PyPy is generally somewhere between the same speed as CPython to 50x faster (f1int). The places where we're the same speed as CPython are places where we know we have problems - for example generators are not sped up by the JIT and they require some work (although not as much by far as generators & Psyco :-). The glaring inefficiency is in the regex-dna benchmark. This one clearly demonstrates that our regular expression engine is really, really, bad and urgently requires attention.

The cool thing here is, that although these benchmarks might not represent typical python applications, they're not uninteresting. They show that algorithmic code does not need to be far slower in Python than in C, so using PyPy one need not worry about algorithmic code being dramatically slow. As many readers would agree, that kills yet another usage of C in our lives :-)

Cheers,
fijal

32 comments:

LuisNovember 18, 2009 at 11:09 PM
Wow! This is getting really interesting. Congratulations!
By the way, it would be great if you include psyco in future graphs, so speed junkies can have a clearer picture of pypy's progress.
ReplyDelete
Replies
Eric FlorenzanoNovember 18, 2009 at 11:14 PM
Very interesting, congratulations on all the recent progress! It would be very interesting to see how PyPy stacks up against Unladen Swallow on Unladen Swallow's own performance benchmark tests, which do include a bit more real-world scenarios.
ReplyDelete
Replies
Maciej FijalkowskiNovember 18, 2009 at 11:31 PM
@Eric: yes, definitely, we're approaching that set of benchmarks

@Luis: yes, definitely, will try to update tomorrow, sorry.
ReplyDelete
Replies
Paddy3118November 19, 2009 at 5:06 AM
It's good, but...

We are still in the realms of micro-benchmarks. It would be good to compare their performances when working on something larger. Django or Zope maybe?
ReplyDelete
Replies
Gaëtan de MentenNovember 19, 2009 at 8:52 AM
These last months, you seem to have had almost exponential progress. I guess all those years of research are finally paying off. Congratulations!

Also, another graph for memory pressure would be nice to have. Unladen Shadow is (was?) not very good in that area, and I wonder how PyPy compares.

[nitpick warning]
As a general rule, when mentioning trunk revisions, it's nice to also mention a date so that people know the test was fair. People assume it's from the day you did the tests, and confirming that would be nice.
[/nitpick warning]
ReplyDelete
Replies
AntoineNovember 19, 2009 at 10:45 AM
How about benchmarking against CPython trunk as well?

cheers

Antoine.
ReplyDelete
Replies
Tony LandisNovember 19, 2009 at 5:02 PM
What about memory consumption? That is almost as important to me as speed.
ReplyDelete
Replies
wilkNovember 19, 2009 at 5:04 PM
Congratulations !

Please could you remember us how to build and test pypy-jit ?
ReplyDelete
Replies
AnonymousNovember 20, 2009 at 12:38 AM
I'm curious why mandelbrot is much less accelerated than, say, nbody. Does PyPy not JIT complex numbers properly yet?
ReplyDelete
Replies
Benjamin PetersonNovember 20, 2009 at 4:03 AM
@wilk ./translate.py -Ojit targetpypystandalone.py
ReplyDelete
Replies
Benjamin PetersonNovember 20, 2009 at 4:11 AM
@Anon Our array module is in pure Python and much less optimized than CPython's.
ReplyDelete
Replies
LeoNovember 20, 2009 at 8:11 AM
How long until I can do

pypy-c-jit translate.py -Ojit targetpypystandalone.py

?

So far, when I try, I get

NameError: global name 'W_NoneObject' is not defined
http://paste.pocoo.org/show/151829/
ReplyDelete
Replies
holger krekelNovember 20, 2009 at 8:37 AM
ASFAIU it's not PyPy's regex engine being "bad" but rather the fact that the JIT generator cannot consider and optimize the loop in the regex engine, as it is a nested loop (the outer one being the bytecode interpretation one).
ReplyDelete
Replies
Armin RigoNovember 20, 2009 at 11:41 AM
@holger: yes, that explains why regexps are not faster in PyPy, but not why they are 5x or 10x slower. Of course our regexp engine is terribly bad. We should have at least a performance similar to CPython.
ReplyDelete
Replies
AnonymousNovember 20, 2009 at 4:35 PM
Benjamin, is it really an issue with array? The inner loop just does complex arithmetic. --Anon
ReplyDelete
Replies
Benjamin PetersonNovember 20, 2009 at 11:41 PM
@Anon I'm only guessing. Our math is awfully fast.
ReplyDelete
Replies
Antonio CuniNovember 21, 2009 at 12:54 AM
@Anon, @Benjamin
I've just noticed that W_ComplexObject in objspace/std/complexobject.py is not marked as _immutable_=True (as it is e.g. W_IntObject), so it is totally possible that the JIT is not able to optimize math with complexes as it does with ints and floats. We should look into it, it is probably easy to discover
ReplyDelete
Replies
vakNovember 21, 2009 at 12:58 AM
guys, sorry, who cares about *seconds*??

why didn't you normalize to the test winners? :)
ReplyDelete
Replies
AntiplutocratNovember 21, 2009 at 10:05 AM
This comment has been removed by the author.
ReplyDelete
Replies
LeoNovember 21, 2009 at 10:06 AM
So, um, has anyone managed to get JIT-ed pypy to compile itself?

When I tried to do this today, I got this:

http://paste.pocoo.org/show/151829/
ReplyDelete
Replies
Maciej FijalkowskiNovember 21, 2009 at 12:26 PM
@Leo:

yes, we know that bug. Armin is fixing it right now on faster-raise branch.
ReplyDelete
Replies
Armin RigoNovember 21, 2009 at 6:47 PM
antonio: good point. On the second thought, though, it's not a *really* good point because we don't have _immutable_=True on floats either...
ReplyDelete
Replies
LeoNovember 21, 2009 at 8:35 PM
@Maciej Great! It'll be awesome to have a (hopefully much faster??) JITted build ... it currently takes my computer more than an hour ...
ReplyDelete
Replies
Benjamin PetersonNovember 22, 2009 at 2:45 AM
@Leo it's likely to take tons of memory, though.
ReplyDelete
Replies
AnonymousNovember 22, 2009 at 11:13 AM
Would perhaps also be nice to compare the performance with one the current Javascript-Engines(V8, SquirrelFish etc.)
ReplyDelete
Replies
Tom ClarkeNovember 22, 2009 at 1:08 PM
Nice comparisons - and micro-performance looking good. Congratulations.

HOWEVER - there is no value in having three columns for each benchmark. The overall time is arbitrary, all that matters is relative so you might as well normalise all graphs to CPython = 1.0, for example. The relevant informtion is then easier to see!
ReplyDelete
Replies
UnknownNovember 23, 2009 at 8:24 PM
it's called "The Computer Language
Benchmarks Game" these days...
ReplyDelete
Replies
LuisNovember 23, 2009 at 10:10 PM
Tom is right, normalizing the graphs to cpython = 1.0 would make them much more readable.
Anyway, this is a very good Job from Unhelpful.
Thanks!
ReplyDelete
Replies
AnonymousNovember 27, 2009 at 2:54 PM
Do any of those benchmarks work with shedskin?
ReplyDelete
Replies
¬¬November 30, 2009 at 8:26 AM
glad to see someone did something with my language shootout benchmark comment ;)
ReplyDelete
Replies
AnonymousDecember 1, 2009 at 8:07 PM
I checked http://www.looking-glass.us/~chshrcat/python-benchmarks/results.txt but it doesn't have the data for unladen swallow. Where are the number?
ReplyDelete
Replies
Term PaperFebruary 18, 2010 at 8:05 AM
I'm curious why mandelbrot is much less accelerated than, say, nbody. Does PyPy not JIT complex numbers properly yet?
ReplyDelete
Replies

Add comment

See also PyPy's IRC channel: #pypy at freenode.net, or the pypy-dev mailing list.
If the blog post is old, it is pointless to ask questions here about it---you're unlikely to get an answer.