Pyston

Pyston is a new, under-development Python implementation built using LLVM and modern JIT techniques with the goal of achieving good performance.

We have a small website pyston.org, which for now just hosts the mailing lists and the blog. We have two mailing lists: pyston-dev@ for development-related discussions, and pyston-announce@ which is for wider announcements (new releases, major project changes).

Current state

Pyston should be considered in early alpha: it "works" in that it can successfully run Python code, but it is still quite far from being useful for end-users.

Currently, Pyston targets Python 2.7, only runs on x86_64 platforms, and only has been tested on Ubuntu. Support for more platforms -- along with Python 3 compatibility -- is planned for the future, but this is the initial target due to prioritization constraints.

Note: Pyston does not currently work on Mac OSX, but that is being worked on.

Contributing

Pyston welcomes any kind of contribution; please see CONTRIBUTING.md for details.

tl;dr: You will need to sign the Dropbox CLA and run the tests.

We have a small list of starter projects, but it is often not up to date. If you are interested in contributing but would like to chat about where to start, please feel free to email pyston-dev.

Roadmap

v0.1: released 4/2/2014

Focus was on building and validating the core Python-to-LLVM JIT infrastructure.
Many core parts of the language were missing.

v0.2: released 9/11/2014

Focus was on improving language compatibility to the point that we can start running "real code" in the form of existing benchmarks.
Many new features:
Exceptions
Class inheritance, metaclasses
Basic native C API support
Closures, generators, lambdas, generator expressions
Default arguments, keywords, *args, **kwargs
Longs, and integer promotion
Multithreading support
We have allowed performance to regress, sometimes considerably, but (hopefully) in places that allow for more efficient implementations as we have time.

v0.3: current series

Goal is to improve performance, informed by our behavior on real benchmarks.

Getting started

To get a full development environment for Pyston, you will need pretty recent versions of various tools. The docs/INSTALLING.md file contains information about what the tools are, how to get them, and how to install them; currently it can take up to an hour to get them all built on a quad-core machine.

To simply build and run Pyston, a smaller set of dependencies is required; see docs/INSTALLING.md, but skip the "OPTIONAL DEPENDENCIES" section. Once all the dependencies are installed, you should be able to do

$ make check -j4

And see that hopefully all of the tests pass.

If you see that the tests do not pass, please email pyston-dev.

Running Pyston

Pyston builds in a few different configurations; right now there is pyston_dbg, which is the debug configuration and contains assertions and debug symbols, and pyston_release, the release configuration which has no assertions or debug symbols, and has full optimizations. You can build them by saying make pyston_dbg or make pyston_release, respectively. If you are interested in seeing how fast Pyston can go, you should try the release configuration, but there is a good chance that it will crash, in which case you can run the debug configuration to see what is happening.

There are a number of other configurations useful for development: "pyston_debug" contains full LLVM debug information, but will weigh in at a few hundred MB. "pyston_prof" contains gprof-style profiling instrumentation; gprof can't profile JIT'd code, reducing it's usefulness in this case, but the configuration has stuck around since it gets compiled with gcc, and can expose issues with the normal clang-based build.

You can get a simple REPL by simply typing make run; it is not very robust right now, and only supports single-line statements, but can give you an interactive view into how Pyston works. To get more functionality, you can do ./pyston_dbg -i [your_source_file.py], which will go into the REPL after executing the given file, letting you access all the variables you had defined.

Makefile targets

make check: run the tests
make run: run the REPL
make format: run clang-format over the codebase
make run_TESTNAME: searches for a test or benchmark called TESTNAME.py and runs it under pyston
make dbg_TESTNAME: same as above, but runs pyston under gdb
make watch_cmd: uses inotifywait to run make cmd every time a source file changes.
For example, make watch is an alias for make watch_pyston_dbg, and will recompile every time you save a source file.
make wdbg_TESTNAME is mostly an alias for make watch_dbg_TESTNAME, but will automatically quit GDB for you.

There are a number of common flags you can pass to your make invocations:

V=1 or VERBOSE=1: display the full commands being executed
ARGS=-q: pass the given args (in this example, -q) to the executable.
Note: these will usually end up before the script name, and so apply to the pyston runtime as opposed to appearing in sys.argv. For example, make run_test ARGS=-q will execute ./pyston_dbg -q test.py.
BR=breakpoint: when running under gdb, automatically set a breakpoint at the given location.

For a full list, please check out the (Makefile)[https://github.com/dropbox/pyston/blob/master/src/Makefile].

Pyston command-line options:

-q: Set verbosity to 0
-v: Increase verbosity by 1; Pyston by default runs at verbosity 1, which contains a good amount of debugging information. Verbosity 0 contains no debugging information, and should produce the same results as other runtimes.
-n: Disable the Pyston interpreter. This is mostly used for debugging, to force the use of higher compilation tiers in situations they wouldn't typically be used.
-O: Force Pyston to always run at the highest compilation tier. This doesn't always produce the fastest running time due to the lack of type recording from lower compilation tiers, but similar to -n can help test the code generator.
-d: In addition to showing the generated LLVM IR, show the generated assembly code.
-i: Go into the repl after executing the given script.
-b: Benchmark mode: do whatever it would have been done, but do it 1000 times.
-p: Emit profiling information: at exit, Pyston will emit a dump of the code it generated for consumption by other tools.
-r: Use a stripped stdlib. When running pyston_dbg, the default is to use a stdlib with full debugging symbols enabled. Passing -r changes this behavior to load a slimmer, stripped stdlib.

Name		Name	Last commit message	Last commit date
Latest commit History 698 Commits
clang_patches		clang_patches
docs		docs
include		include
lib_python		lib_python
libunwind_patches		libunwind_patches
llvm_patches		llvm_patches
microbenchmarks		microbenchmarks
minibenchmarks		minibenchmarks
src		src
test		test
tools		tools
.gitignore		.gitignore
.vimrc.dir		.vimrc.dir
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
llvm_revision.txt		llvm_revision.txt

Navigation Menu

License

0xcc/pyston

Folders and files

Latest commit

History

Repository files navigation

Pyston

Current state

Contributing

Roadmap

v0.1: released 4/2/2014

v0.2: released 9/11/2014

v0.3: current series

Getting started

Running Pyston

Makefile targets

Pyston command-line options:

Technical features

Compilation tiers

OSR

Inlining

Object representation

Inline caches

Hidden classes

Type feedback

Garbage collection

Native extension module support

Parallelism support

About

Resources

License

Stars

Watchers

Forks

Languages