gh-100239: specialize long tail of binary operations #128722

iritkatriel · 2025-01-10T23:27:25Z

This implements part of #100239: the four arithmetic ops between int, float and float, int.

Microbenchmarks:

Old:

>>> timeit("for i in range(10000):\n\tb = a+i", number=100000, setup="a = 1.0")
37.0931663562078
>>> timeit("for i in range(10000):\n\tb = i+a", number=100000, setup="a = 1.0")
40.84421204589307

New:

>>> timeit("for i in range(10000):\n\tb = a+i", number=100000, setup="a = 1.0")
31.263000949984416
>>> timeit("for i in range(10000):\n\tb = i+a", number=100000, setup="a = 1.0")
31.243564788019285

So performance is 20-30% better, and also more uniform (old is 10% slower for int+float compared to float+int).

Full benchmarks don't show an overall speedup, but they do show better specialisation stats for BINARY_OP:
https://github.com/faster-cpython/benchmarking-public/tree/main/results/bm-20250110-3.14.0a3+-7264e37#readme

Issue: Specialize long tail of binary operations using a table. #100239

Fidget-Spinner · 2025-01-11T03:10:32Z

Nice! The arithmetic benchmarks show a good speedup in the provided link:

spectral_norm	107 ms	96.3 ms: 1.11x faster
pyflate	480 ms	437 ms: 1.10x faster
chaos	61.1 ms	57.9 ms: 1.06x faster

eendebakpt

This already looks quite good. To get a feeling for the interfaces I added a few more specializations. See iritkatriel/cpython@gh-100239...eendebakpt:cpython:gh-100239-list-tuple-add

Adding more specializations is quite easy, but if we end up adding more we will need some more macros or tooling (such as for example the TRY_BINARY_SPECIALIZATION in the branch above). Fine to leave that to a followup PR though.

Python/specialize.c

Co-authored-by: Pieter Eendebak <pieter.eendebak@gmail.com>

Include/internal/pycore_code.h

Python/specialize.c

markshannon

Looks promising. We might want to add some filtering based on the class before calling the guard function when specializing. Calling a long chain of guard functions could be expensive.

Looking at the stats, it seems that this doesn't make that much difference to the number of BINARY_OPs that are specialized.
Unfortunately the stats don't tell us which class pairs to add, but I think str % str and str % tuple would be worth a look.
Or we could enhance the stats to give us cls/cls/operator triples, at least for those classes with a small version number?

Include/internal/pycore_code.h

Python/bytecodes.c

Python/specialize.c

…naryOpSpecializationDescr

This reverts commit ede9e8c.

markshannon

A couple of suggestions, but nothing blocking.

OOI what was causing the earlier test failures?

Include/internal/pycore_code.h

Python/bytecodes.c

iritkatriel · 2025-01-16T11:56:58Z

OOI what was causing the earlier test failures?

This was missing: #128892

The test assumes some valid opcode is invalid.

Co-authored-by: Mark Shannon <mark@hotpy.org>

iritkatriel · 2025-01-17T10:37:10Z

I repeated the benchmarks with the multiply bug (that prevented specialisation) fixed:
https://github.com/faster-cpython/benchmarking-public/tree/main/results/bm-20250116-3.14.0a4+-3893a92

pythongh-100239: specialize long tail of binary operations

7264e37

iritkatriel requested review from ericsnowcurrently and markshannon as code owners January 10, 2025 23:27

bedevere-app bot mentioned this pull request Jan 10, 2025

Specialize long tail of binary operations using a table. #100239

Open

bedevere-app bot added the awaiting core review label Jan 10, 2025

iritkatriel requested review from mdboom and eendebakpt January 10, 2025 23:27

iritkatriel added 2 commits January 10, 2025 23:54

add news

89b4679

fix fomatting

6dba21a

eendebakpt reviewed Jan 11, 2025

View reviewed changes

Python/specialize.c Outdated Show resolved Hide resolved

Python/specialize.c Outdated Show resolved Hide resolved

Python/specialize.c Outdated Show resolved Hide resolved

eendebakpt reviewed Jan 11, 2025

View reviewed changes

Python/specialize.c Outdated Show resolved Hide resolved

iritkatriel and others added 3 commits January 11, 2025 23:26

assert rather than deopt if no descr/guard

7bf51d6

Apply suggestions from code review

69112e9

Co-authored-by: Pieter Eendebak <pieter.eendebak@gmail.com>

long --> compactlong

fd88c8d

Fidget-Spinner mentioned this pull request Jan 12, 2025

Need more non-escaping specializations faster-cpython/ideas#704

Closed

mdboom approved these changes Jan 13, 2025

View reviewed changes

Include/internal/pycore_code.h Show resolved Hide resolved

Python/specialize.c Outdated Show resolved Hide resolved

bedevere-app bot added awaiting merge and removed awaiting core review labels Jan 13, 2025

iritkatriel added 4 commits January 13, 2025 19:59

static specs

f29d8be

add test and fix multiply

efe8e0c

Merge remote-tracking branch 'upstream/main' into pythongh-100239

c74a196

Merge remote-tracking branch 'upstream/main' into pythongh-100239

68c4f34

markshannon reviewed Jan 14, 2025

View reviewed changes

iritkatriel added 6 commits January 14, 2025 15:39

void _Py_Specialize_BinaryOp. PyBinaryOpSpecializationDescr --> _PyBi…

0432b9a

…naryOpSpecializationDescr

let generator deal with fetching descr from the cache

d87946a

Merge remote-tracking branch 'upstream/main' into pythongh-100239

cffb3d6

make descr arrays const to please the globals check

ede9e8c

Revert "make descr arrays const to please the globals check"

a144cdc

This reverts commit ede9e8c.

add globals to ignored.tsv

e526f4e

remove unused function

3d6b60f

iritkatriel requested a review from markshannon January 16, 2025 00:04

markshannon approved these changes Jan 16, 2025

View reviewed changes

Include/internal/pycore_code.h Outdated Show resolved Hide resolved

Python/bytecodes.c Outdated Show resolved Hide resolved

markshannon mentioned this pull request Jan 16, 2025

GH-117581: Specialize binary operators by refcount as well as type. #117627

Closed

iritkatriel and others added 3 commits January 16, 2025 11:57

Update Python/bytecodes.c

988369d

Co-authored-by: Mark Shannon <mark@hotpy.org>

write_ptr

16c8ed6

Merge branch 'main' into pythongh-100239

47395aa

markshannon mentioned this pull request Jan 16, 2025

GH-128914: Remove conditional stack effects from bytecodes.c and the code generators #128918

Merged

iritkatriel merged commit 3893a92 into python:main Jan 16, 2025
65 checks passed

bedevere-app bot removed the awaiting merge label Jan 16, 2025

encukou mentioned this pull request Jan 23, 2025

Specialize BINARY_OP by refcount and by type of operands #117581

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-100239: specialize long tail of binary operations #128722

gh-100239: specialize long tail of binary operations #128722

Uh oh!

iritkatriel commented Jan 10, 2025 •

edited by bedevere-app bot

Loading

Uh oh!

Fidget-Spinner commented Jan 11, 2025

Uh oh!

eendebakpt left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

markshannon left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

markshannon left a comment

Uh oh!

Uh oh!

Uh oh!

iritkatriel commented Jan 16, 2025

Uh oh!

Uh oh!

iritkatriel commented Jan 17, 2025

Uh oh!

Uh oh!

Uh oh!

gh-100239: specialize long tail of binary operations #128722

gh-100239: specialize long tail of binary operations #128722

Uh oh!

Conversation

iritkatriel commented Jan 10, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Jan 11, 2025

Uh oh!

eendebakpt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

iritkatriel commented Jan 16, 2025

Uh oh!

Uh oh!

iritkatriel commented Jan 17, 2025

Uh oh!

Uh oh!

iritkatriel commented Jan 10, 2025 •

edited by bedevere-app bot

Loading