Optimize `diagonalize_measurements` Transform for Enhanced Performance #6742

JakeKitchen · 2024-12-29T05:05:35Z

Context:

The original implementation had some inefficiencies, especially with large circuits and multiple measurements.

Change:

Use frozenset:
- Converted supported_base_obs and related observables to frozenset for faster lookups and immutability.
Optimize Membership Checks:
- Replaced multiple set operations with direct frozenset operations to improve performance.
Precompute Measurements:
- Extracted Pauli measurements in a single step to avoid redundant iterations.
Streamline Functions:
- Simplified _check_if_diagonalizing and observable handling functions to reduce computational overhead.
Minimize Copy Operations:
- Limited the use of the copy operations to necessary instances only.

Benefits:

Faster Execution: Improved lookup times and reduced redundant operations enhance overall performance.
Better Readability: Cleaner code structure makes it easier to understand and maintain.
Lower Memory Usage: Reduced unnecessary list creations and copies decrease memory consumption.
Enhanced Scalability: Optimizations support larger and more complex quantum circuits efficiently.

Possible Drawbacks:

None identified.

Related GitHub Issues:

N/A

codecov · 2024-12-29T06:55:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.60%. Comparing base (2a766d4) to head (04a6db0).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #6742   +/-   ##
=======================================
  Coverage   99.60%   99.60%           
=======================================
  Files         476      476           
  Lines       45232    45231    -1     
=======================================
- Hits        45055    45054    -1     
  Misses        177      177

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

albi3ro · 2024-12-30T14:18:08Z

Hi @JakeKitchen . Thanks for this PR.

What types of workflows were this performance bottleneck showing up on? Do you have any profiling and timing data?

JakeKitchen · 2024-12-30T16:44:34Z

Hi @JakeKitchen . Thanks for this PR.

What types of workflows were this performance bottleneck showing up on? Do you have any profiling and timing data?

It wasn't bottlenecks necessarily its just general improvements for about 10% speed improvements for when diagonalize measurements gets called by precomputing as much as possible

JakeKitchen · 2024-12-30T16:56:03Z

n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
       2 |        0.821 |               0.811 |    1.01x
       4 |        1.282 |               1.245 |    1.03x
       8 |        2.300 |               2.210 |    1.04x
      16 |       28.791 |              27.980 |    1.03x

n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
       2 |        0.818 |               0.810 |    1.01x
       4 |        1.490 |               1.460 |    1.02x
       8 |        2.477 |               2.360 |    1.05x
      16 |       29.421 |              28.900 |    1.02x

Here is some benchmarks the Simple Circuit was preforming Hadamard's followed by Pauli-X expectation value measurements on all the qubits and the Complex circuit was mixed measurements that are qubitwise commuting (interleaved Pauli-X and Pauli-Z measurements) each ran through 100 iterations

albi3ro · 2024-12-30T18:03:21Z

n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
       2 |        0.821 |               0.811 |    1.01x
       4 |        1.282 |               1.245 |    1.03x
       8 |        2.300 |               2.210 |    1.04x
      16 |       28.791 |              27.980 |    1.03x

n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
      2 |        0.818 |               0.810 |    1.01x
      4 |        1.490 |               1.460 |    1.02x
      8 |        2.477 |               2.360 |    1.05x
     16 |       29.421 |              28.900 |    1.02x

Here is some benchmarks the Simple Circuit was preforming Hadamard's followed by Pauli-X expectation value measurements on all the qubits and the Complex circuit was mixed measurements that are qubitwise commuting (interleaved Pauli-X and Pauli-Z measurements) each ran through 100 iterations

Thanks for these. Mind including the code you benchmarked as well?

lillian542

Thanks for looking into tidying this up! The docstrings and set conversions look great, I just left a couple of small suggestions.

I have one concern with the implementation, specifically the change to the check that decides whether to attempt the pauli_rep based diagonalization method, or to proceed directly to the less efficient (but broader) backup implementation. I added more details at the relevant line.

pennylane/transforms/diagonalize_measurements.py

Co-authored-by: lillian542 <38584660+lillian542@users.noreply.github.com>

JakeKitchen · 2024-12-30T20:05:21Z

n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
       2 |        0.821 |               0.811 |    1.01x
       4 |        1.282 |               1.245 |    1.03x
       8 |        2.300 |               2.210 |    1.04x
      16 |       28.791 |              27.980 |    1.03x
n_qubits | Original (ms) | New Implementation (ms) | Speedup
------------------------------------------------------------
      2 |        0.818 |               0.810 |    1.01x
      4 |        1.490 |               1.460 |    1.02x
      8 |        2.477 |               2.360 |    1.05x
     16 |       29.421 |              28.900 |    1.02x
Here is some benchmarks the Simple Circuit was preforming Hadamard's followed by Pauli-X expectation value measurements on all the qubits and the Complex circuit was mixed measurements that are qubitwise commuting (interleaved Pauli-X and Pauli-Z measurements) each ran through 100 iterations
Thanks for these. Mind including the code you benchmarked as well?

import pennylane as qml
import timeit
import numpy as np
from updated_diagonalize import diagonalize_measurements as new_diagonalize
from pennylane.transforms import diagonalize_measurements as original_diagonalize

def create_simple_circuit(n_qubits):
    dev = qml.device("default.qubit", wires=n_qubits)
    
    @qml.qnode(dev)
    def circuit():
        for i in range(n_qubits):
            qml.Hadamard(wires=i)
        return [qml.expval(qml.X(i)) for i in range(n_qubits)]
    
    return circuit

def create_complex_circuit(n_qubits):
    dev = qml.device("default.qubit", wires=n_qubits)
    
    @qml.qnode(dev)
    def circuit():
        for i in range(n_qubits):
            qml.Hadamard(wires=i)
            qml.RX(0.5, wires=i)
        
        measurements = []
        for i in range(0, n_qubits-1, 2):
            measurements.append(qml.expval(qml.X(i)))
            measurements.append(qml.expval(qml.Z(i+1)))
        
        if n_qubits % 2:
            measurements.append(qml.expval(qml.X(n_qubits-1)))
            
        return measurements
    
    return circuit

def benchmark_implementation(circuit, transform_fn, n_runs=100):
    transformed_circuit = transform_fn(circuit)
    
    start_time = timeit.default_timer()
    for _ in range(n_runs):
        transformed_circuit()
    end_time = timeit.default_timer()
    
    return (end_time - start_time) / n_runs

def run_benchmarks():
    print("Running benchmarks...")
    print("\nSimple Circuit Benchmarks:")
    print("n_qubits | Original (ms) | New Implementation (ms) | Speedup")
    print("-" * 60)
    
    for n_qubits in [2, 4, 8, 16]:
        circuit = create_simple_circuit(n_qubits)

        original_time = benchmark_implementation(circuit, original_diagonalize) * 1000
        
        new_time = benchmark_implementation(circuit, new_diagonalize) * 1000
        
        speedup = original_time / new_time if new_time > 0 else float('inf')
        
        print(f"{n_qubits:8d} | {original_time:12.3f} | {new_time:19.3f} | {speedup:7.2f}x")
    
    print("\nComplex Circuit Benchmarks:")
    print("n_qubits | Original (ms) | New Implementation (ms) | Speedup")
    print("-" * 60)
    
    for n_qubits in [2, 4, 8, 16]:
        circuit = create_complex_circuit(n_qubits)
        
        original_time = benchmark_implementation(circuit, original_diagonalize) * 1000

        new_time = benchmark_implementation(circuit, new_diagonalize) * 1000
        
        speedup = original_time / new_time if new_time > 0 else float('inf')
        
        print(f"{n_qubits:8d} | {original_time:12.3f} | {new_time:19.3f} | {speedup:7.2f}x")

if __name__ == "__main__":
    run_benchmarks()

pretty rudimentary benchmark but gets the job done

looks a little funny because I ran through black linter

lillian542

Thanks for the updates @JakeKitchen! I've left a couple of comments, and requested a second reviewer from the team (all PRs need two approvals).

lillian542 · 2025-01-02T21:56:27Z

tests/transforms/test_diagonalize_measurements.py

+        """Test that _diagonalize_all_pauli_obs is only used when ALL observables have pauli_rep,
+        not just when ANY observables have pauli_rep. This test would fail if we used the condition
+        (pauli_measurements and diagonalize_all) which only checks if ANY observables have pauli_rep.
+        """


I'm wondering what additional insight this test provides compared to the one above? The earlier one already covers the case where some, but not all, of the observables have a pauli_rep.

Additionally it seems like, for all but the first parametrization, whether or not the observables have a pauli_rep might not matter in this test, since the diagonalize_all condition will be False. So, in that case, not incompatible_measurements and diagonalize_all would end up being False regardless of the value for incompatible_measurements. I'm curious if I'm missing something here!

It's handling a edge case that verifies that the diagonalization behavior is determined by the supported bases, NOT by the pauli_rep property. Even if all observables had pauli_rep, the diagonalization would still follow the supported_base_obs parameter.

Can you update the docstring (and maybe the test name) if that is the intention? Right now it describes testing the behaviour determined by the presence/absence of a pauli_rep.

If that's what you want to test, you will also need to update the circuit so all the observables have a pauli_rep in order to isolate the execution of logic supported bases. Right now the first parametrization uses the fallback method because of the Hadamard, and everything else uses it both because of the Hadamard and the supported gates, so this test doesn't really isolate either part of the 'decision tree'.

tests/transforms/test_diagonalize_measurements.py

pennylane/transforms/diagonalize_measurements.py

albi3ro · 2025-01-02T22:33:30Z

pennylane/transforms/diagonalize_measurements.py

+    diagonalizing_gates, diagonal_measurements = rotations_and_diagonal_measurements(tape)
    new_measurements = []

-    diagonalizing_gates, diagonal_measurements = rotations_and_diagonal_measurements(tape)
    for m in diagonal_measurements:


What is this change doing? It looks like we are just switching the order of lines. Am I missing something?

just switched this because it looks more clear imo

Co-authored-by: Christina Lee <chrissie.c.l@gmail.com>

astralcai · 2025-01-06T21:45:58Z

pretty rudimentary benchmark but gets the job done

Based on the benchmarking script, we see that the thing being benchmarked is the execution time of the transformed circuit. The runtime of the transform itself isn't actually timed.

JakeKitchen added 8 commits December 29, 2024 00:04

Update diagonalize_measurements.py

7cc71e0

Update changelog-dev.md

0849418

Update diagonalize_measurements.py

1a9e23f

Update diagonalize_measurements.py

3a1e2e2

Update diagonalize_measurements.py

a9cc36f

Update diagonalize_measurements.py

fbb4c58

Update diagonalize_measurements.py

6d0f293

Update diagonalize_measurements.py

aa83e17

JakeKitchen added 3 commits December 29, 2024 01:56

Update diagonalize_measurements.py

103584d

Update diagonalize_measurements.py

f7f5c2b

Update changelog-dev.md

350eaa9

JakeKitchen changed the title ~~Update diagonalize_measurements.py~~ Optimize diagonalize_measurements Transform for Enhanced Performance Dec 29, 2024

[no ci] bump nightly version

a0c24f5

Merge branch 'master' into master

01cfba5

lillian542 self-requested a review December 30, 2024 18:08

lillian542 reviewed Dec 30, 2024

View reviewed changes

pennylane/transforms/diagonalize_measurements.py Outdated Show resolved Hide resolved

pennylane/transforms/diagonalize_measurements.py Outdated Show resolved Hide resolved

pennylane/transforms/diagonalize_measurements.py Outdated Show resolved Hide resolved

JakeKitchen and others added 4 commits December 30, 2024 15:00

Update pennylane/transforms/diagonalize_measurements.py

ab9a08b

Co-authored-by: lillian542 <38584660+lillian542@users.noreply.github.com>

Update pennylane/transforms/diagonalize_measurements.py

485d9ba

Co-authored-by: lillian542 <38584660+lillian542@users.noreply.github.com>

Update pennylane/transforms/diagonalize_measurements.py

46f8adf

Co-authored-by: lillian542 <38584660+lillian542@users.noreply.github.com>

Update diagonalize_measurements.py

31e0121

JakeKitchen added 4 commits December 30, 2024 15:11

Merge branch 'master' into master

415574f

Update test_diagonalize_measurements.py

a0b4afd

looks a little funny because I ran through black linter

Update test_diagonalize_measurements.py

ac7774a

Update test_diagonalize_measurements.py

a3ceabc

ringo-but-quantum and others added 9 commits December 31, 2024 09:55

[no ci] bump nightly version

8c9b84b

Merge branch 'master' into master

32b42a7

Merge branch 'master' into master

a9ba891

[no ci] bump nightly version

52630c3

Merge branch 'master' into master

12f3779

Update non_parametric_ops.py

d1c64dc

Update non_parametric_ops.py

31c25be

[no ci] bump nightly version

569ce96

Merge branch 'master' into master

1480a6e

lillian542 requested a review from albi3ro January 2, 2025 22:01

lillian542 reviewed Jan 2, 2025

View reviewed changes

albi3ro reviewed Jan 2, 2025

View reviewed changes

pennylane/transforms/diagonalize_measurements.py Outdated Show resolved Hide resolved

albi3ro reviewed Jan 2, 2025

View reviewed changes

ringo-but-quantum and others added 6 commits January 3, 2025 09:55

[no ci] bump nightly version

c5d5fb8

Update pennylane/transforms/diagonalize_measurements.py

4a836aa

Co-authored-by: Christina Lee <chrissie.c.l@gmail.com>

Update test_diagonalize_measurements.py

82fd5e1

Update changelog-dev.md

0e91fc1

Merge branch 'master' into master

04a6db0

[no ci] bump nightly version

64d2d59

ringo-but-quantum added 4 commits January 7, 2025 09:55

[no ci] bump nightly version

0845947

[no ci] bump nightly version

6000f75

[no ci] bump nightly version

8ec6c9c

[no ci] bump nightly version

f5ee259

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize `diagonalize_measurements` Transform for Enhanced Performance #6742

Optimize `diagonalize_measurements` Transform for Enhanced Performance #6742

JakeKitchen commented Dec 29, 2024 •

edited

Loading

codecov bot commented Dec 29, 2024 •

edited

Loading

albi3ro commented Dec 30, 2024

JakeKitchen commented Dec 30, 2024

JakeKitchen commented Dec 30, 2024 •

edited

Loading

albi3ro commented Dec 30, 2024

lillian542 left a comment

JakeKitchen commented Dec 30, 2024 •

edited

Loading

lillian542 left a comment

lillian542 Jan 2, 2025

JakeKitchen Jan 4, 2025 •

edited

Loading

lillian542 Jan 6, 2025 •

edited

Loading

albi3ro Jan 2, 2025

JakeKitchen Jan 3, 2025 •

edited

Loading

astralcai commented Jan 6, 2025

Optimize diagonalize_measurements Transform for Enhanced Performance #6742

Are you sure you want to change the base?

Optimize diagonalize_measurements Transform for Enhanced Performance #6742

Conversation

JakeKitchen commented Dec 29, 2024 • edited Loading

Context:

Change:

Benefits:

Possible Drawbacks:

Related GitHub Issues:

codecov bot commented Dec 29, 2024 • edited Loading

Codecov Report

albi3ro commented Dec 30, 2024

JakeKitchen commented Dec 30, 2024

JakeKitchen commented Dec 30, 2024 • edited Loading

albi3ro commented Dec 30, 2024

lillian542 left a comment

Choose a reason for hiding this comment

JakeKitchen commented Dec 30, 2024 • edited Loading

lillian542 left a comment

Choose a reason for hiding this comment

lillian542 Jan 2, 2025

Choose a reason for hiding this comment

JakeKitchen Jan 4, 2025 • edited Loading

Choose a reason for hiding this comment

lillian542 Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

albi3ro Jan 2, 2025

Choose a reason for hiding this comment

JakeKitchen Jan 3, 2025 • edited Loading

Choose a reason for hiding this comment

astralcai commented Jan 6, 2025

Optimize `diagonalize_measurements` Transform for Enhanced Performance #6742

Optimize `diagonalize_measurements` Transform for Enhanced Performance #6742

JakeKitchen commented Dec 29, 2024 •

edited

Loading

codecov bot commented Dec 29, 2024 •

edited

Loading

JakeKitchen commented Dec 30, 2024 •

edited

Loading

JakeKitchen commented Dec 30, 2024 •

edited

Loading

JakeKitchen Jan 4, 2025 •

edited

Loading

lillian542 Jan 6, 2025 •

edited

Loading

JakeKitchen Jan 3, 2025 •

edited

Loading