PyPy Status Blog: PyPy 1.4: Ouroboros in practice

Friday, November 26, 2010

PyPy 1.4: Ouroboros in practice

We're pleased to announce the 1.4 release of PyPy. This is a major breakthrough in our long journey, as PyPy 1.4 is the first PyPy release that can translate itself faster than CPython. Starting today, we are using PyPy more for our every-day development. So may you :) You can download it here:

http://pypy.org/download.html

What is PyPy

PyPy is a very compliant Python interpreter, almost a drop-in replacement for CPython. It is fast (pypy 1.4 and cpython 2.6 comparison).

New Features

Among its new features, this release includes numerous performance improvements (which made fast self-hosting possible), a 64-bit JIT backend, as well as serious stabilization. As of now, we can consider the 32-bit and 64-bit linux versions of PyPy stable enough to run in production.

Numerous speed achievements are described on our blog. Normalized speed charts comparing pypy 1.4 and pypy 1.3 as well as pypy 1.4 and cpython 2.6 are available on the benchmark website. For the impatient: yes, we got a lot faster!

More highlights

PyPy's built-in Just-in-Time compiler is fully transparent and automatically generated; it now also has very reasonable memory requirements. The total memory used by a very complex and long-running process (translating PyPy itself) is within 1.5x to at most 2x the memory needed by CPython, for a speed-up of 2x.
More compact instances. All instances are as compact as if they had __slots__. This can give programs a big gain in memory. (In the example of translation above, we already have carefully placed __slots__, so there is no extra win.)
Virtualenv support: now PyPy is fully compatible with virtualenv: note that to use it, you need a recent version of virtualenv (>= 1.5).
Faster (and JITted) regular expressions - huge boost in speeding up the re module.
Other speed improvements, like JITted calls to functions like map().

Cheers,
Carl Friedrich Bolz, Antonio Cuni, Maciej Fijalkowski, Amaury Forgeot d'Arc, Armin Rigo and the PyPy team

Posted by Maciej Fijalkowski at 19:18

27 comments:

ipc said...: congratulations!; November 26, 2010 at 7:42 PM
why said...: This is unacceptable. Christmas is not until next month!!!; November 26, 2010 at 7:47 PM
Tim Parkin said...: Massive congratulations - exciting!; November 26, 2010 at 8:09 PM
Unknown said...: Sweet! Keep up the great work !; November 26, 2010 at 8:18 PM
Anonymous said...: Woohoo!!; November 26, 2010 at 8:41 PM
Anonymous said...: Hip hip hooooraaaay!!!!; November 26, 2010 at 9:59 PM
ipc said...: all I want for Christmas is stackless support in a 64-bit pypy-c-jit :) 'two greenlets switching and a partridge in a pear tree!'; November 26, 2010 at 11:51 PM
Unknown said...: Congratulations. I hope the PPA is going to be updated soon. Too lazy to build it myself, right now. (:; November 27, 2010 at 12:14 AM
Paul Boddie said...: Is there a -j <number-of-cores> option for the translation process? It's a bit unfortunate that 15 cores on the nice machine I'm using can't be put to use making it translate faster. (Or unfortunate that I didn't read the documentation, maybe.); November 27, 2010 at 12:29 AM
ipc said...: --make-jobs=N only some parts of the translation process is parallel.; November 27, 2010 at 12:54 AM
Anonymous said...: Eta until numpy scipy?; November 27, 2010 at 1:10 AM
Paul Boddie said...: The report of 2.4GB usage on x86-64 is accurate, but it took about 7800s on a 2.33GHz Xeon. Next time I'll try and exercise some of the other cores, though.; November 27, 2010 at 2:00 AM
Anonymous said...: so pypy on average is now about 2x faster than cpython?

and unladen swallows goal was beeing 5x faster? was that totally unrealistic?; November 27, 2010 at 5:54 AM
Leonard Ritter said...: You are my heroes!; November 27, 2010 at 11:59 AM
Symbol said...: Just Awesome!!!

KUTGW!; November 27, 2010 at 12:37 PM
Daivd said...: Does this release include the -free branch that was mentioned in the previous post? The 2x memory requirements lead me to believe so.; November 27, 2010 at 1:02 PM
Maciej Fijalkowski said...: @Daivd
yes, it does

@Anonymous
5x improvement is not a well defined goal, however it's a good marketing thing. PyPy is 2x faster on translation, 60x faster on some benchmarks while slower on other. What does it mean to be 5x faster?; November 27, 2010 at 2:45 PM
Christian S. Perone said...: Sounds great, great work, great thanks !; November 27, 2010 at 3:23 PM
scientist said...: Do you know why the purely numerical benchmarks nbody and spectral-norm are still so much slower in PyPy compared to e.g. LuaJIT?; November 27, 2010 at 3:34 PM
tobami said...: This is awesome. PyPy 1.4 addresses the 2 slowest benchmarks, slowspitfire and spambayes. There is no benchmark anymore where PyPy is much slower than CPython.

To me, this marks the first time you can say that PyPy is ready for general "consumption". Congratulations!

PS: The best comparison to appreciate how much of an improvement 1.4 has been is:
http://speed.pypy.org/comparison/?exe=2%2B35,1%2B41,1%2B172&ben=1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20&env=1&hor=false&bas=2%2B35&chart=normal+bars; November 27, 2010 at 3:44 PM
Maciej Fijalkowski said...: @scientist

Sure, because LuaJIT is crazy when it comes to optimizations :-) We'll get there eventually, but purely numerical stuff is not as high on our list as other things.; November 27, 2010 at 6:37 PM
Luis said...: @maciej: in an old thread (have tracing compilers won?) you replied to Mike Pall saying that pypy was in a way middle ground, that it didn't offer as much opportunities for micro optimizations as luajit.

You were discussing about keeping high level constructions from the user program to perform more tricks.

Has the situation changed?
Do you really think now that you'll get there?

Anyway, let me tell you that you are all already my super heroes :-); November 27, 2010 at 7:37 PM
Maciej Fijalkowski said...: Heh, I don't remember that :-)

Anyway, LuaJIT has more options for microoptimziations simply because Lua is a simpler language. That doesn't actually make it impossible for PyPy, it simply make it harder and taking more time (but it's still possible). I still think we can get (but predicting future is hard) where LuaJIT is right now, but racing Mike would be a challenge that we might loose ;-)

That said, even in simple loops there are obvious optimizations to be performed, so we're far from being done. We're going there, but it's taking time ;-); November 27, 2010 at 7:46 PM
Victor said...: Congrats to all PyPy developers for making huge contributions to Python performance, JIT and implementation research and delivering an end product that will help many developers to get more done.

IIUC, we still have ARM, jit-unroll-loops, more memory improvements, Python 2.7 (Fast Forward branch) and a bunch of other cool improvements in the works, besides some known interesting targets that will eventually be tackled (e.g. JITted stackless).

I wish more big Python apps and developers would play with PyPy and report the results.

Cheers!

P.S.: Fijal: see http://lambda-the-ultimate.org/node/3851#comment-57715; November 27, 2010 at 8:33 PM
Michal M. said...: Congratulations.
However, you suggest people used it in production environment - please, give us version compatible at least with CPython 2.6.
I hope that you plan it but at first you wanted to have stable and fast base. :); November 29, 2010 at 7:55 PM
Amaury Forgeot d'Arc said...: @Michal:
There is already an ongoing effort to port PyPy to Python 2.7.

But we need some help! It's a good way to become a PyPy developer.
And no, you don't have to be a JIT expert to implement itertools.combinations or asian codecs.; December 1, 2010 at 11:21 PM
Anonymous said...: kudos to whip-smart guys for this wonderful piece of software.; February 9, 2011 at 1:18 AM