f8a8243bd3
Without this flag, the configure script prints a warning at the end, like this (reformatted): If you want a release build with all stable optimizations active (PGO, etc), please run ./configure --enable-optimizations We're doing a build to distribute to people for day-to-day use, doing things other than developing the Python interpreter. So that's certainly a release build -- we're the target audience for this recommendation. --- And, trying it out, upstream isn't kidding! I ran the standard benchmark suite that the CPython developers use for performance work, "pyperformance". Following its usage instructions: https://pyperformance.readthedocs.io/usage.html I ran the whole suite, like so: $ nix-shell -p ./result."$variant" --run ' cd $(mktemp -d); python -m venv venv; . venv/bin/activate pip install pyperformance pyperformance run -o ~/tmp/result.'"$variant"'.json ' and then examined the results with commands like: $ python -m pyperf compare_to --table -G \ ~/tmp/result.{$before,$after}.json Across all the benchmarks in the suite, the median speedup was 16%. (Meaning 1.16x faster; 14% less time). The middle half of them ranged from a 13% to a 22% speedup. Each of the 60 benchmarks in the suite got faster, by speedups ranging from 3% to 53%. --- One reason this isn't just the default to begin with is that, until recently, it made the build a lot slower. What it does is turn on profile-guided optimization, which means first build for profiling, then run some task to get a profile, then build again using the profile. And, short of further customization, the task it would use would be nearly the full test suite, which includes a lot of expensive and slow tests, and can easily take half an hour to run. Happily, in 2019 an upstream developer did the work to carefully select a more appropriate set of tests to use for the profile: https://github.com/python/cpython/commit/4e16a4a31 https://bugs.python.org/issue36044 This suite takes just 2 minutes to run. And the resulting final build is actually slightly faster than with the much longer suite, at least as measured by those standard "pyperformance" benchmarks. That work went into the 3.8 release, but the same list works great if used on older releases too. So, start passing that --enable-optimizations flag; and backport that good-for-PGO set of tests, so that we use it on all releases. |
||
---|---|---|
.. | ||
acl2 | ||
angelscript | ||
bats | ||
ceptre | ||
chibi | ||
clips | ||
clisp | ||
clojure | ||
clojurescript/lumo | ||
dart | ||
dhall | ||
duktape | ||
eff | ||
elixir | ||
erlang | ||
evcxr | ||
falcon | ||
gauche | ||
gnu-apl | ||
groovy | ||
gtk-server | ||
guile | ||
hugs | ||
hy | ||
icon-lang | ||
io | ||
j | ||
janet | ||
jimtcl | ||
joker | ||
jruby | ||
jython | ||
kona | ||
lfe | ||
lolcode | ||
love | ||
lua-5 | ||
luajit | ||
lush | ||
maude | ||
metamath | ||
micropython | ||
mujs | ||
nix-exec | ||
octave | ||
perl | ||
php | ||
picoc | ||
picolisp | ||
pixie | ||
proglodyte-wasm | ||
pure | ||
pyrex | ||
python | ||
qnial | ||
quickjs | ||
racket | ||
rakudo | ||
rascal | ||
rebol | ||
red | ||
regina | ||
renpy | ||
ruby | ||
scheme48 | ||
scsh | ||
self | ||
spidermonkey | ||
supercollider | ||
tcl | ||
tinyscheme | ||
unicon-lang | ||
wasmer | ||
wasmtime |