해밀토니안 시뮬레이션 Circuit을 위한 컴파일 방법

예상 사용량: IBM Heron 프로세서에서 1분 미만 (참고: 이는 추정치이며, 실제 런타임은 다를 수 있습니다.)

학습 목표

이 튜토리얼을 완료하면 다음을 이해하게 됩니다:

레이아웃 및 라우팅 최적화를 위해 SABRE와 함께 Qiskit Transpiler를 사용하는 방법
고급 Circuit 최적화를 위해 AI 기반 Transpiler를 활용하는 방법
해밀토니안 시뮬레이션 Circuit에서 PauliEvolutionGate 연산을 합성하기 위해 Rustiq 플러그인을 사용하는 방법
2-Qubit 깊이, 전체 Gate 수, 런타임을 사용하여 컴파일 방법을 벤치마크하고 비교하는 방법

사전 요구사항

이 튜토리얼을 진행하기 전에 다음 주제에 익숙한 것을 권장합니다:

배경

양자 Circuit 컴파일은 고수준의 양자 알고리즘을 대상 하드웨어의 제약 조건을 준수하는 물리적 Circuit으로 변환합니다. 효과적인 컴파일은 Circuit 깊이와 Gate 수를 크게 줄일 수 있으며, 이 두 가지는 모두 근거리 양자 디바이스에서의 결과 품질에 직접적인 영향을 미칩니다.

이 튜토리얼은 PauliEvolutionGate로 구성된 해밀토니안 시뮬레이션 Circuit에서 세 가지 컴파일 방법을 벤치마크합니다. 이러한 Circuit은 쌍별 Qubit 상호작용(예: $ZZ$ , $XX$ , $YY$ 항)을 모델링하며, 양자 화학, 응집물질 물리학, 재료 과학에서 일반적으로 사용됩니다.

벤치마크 Circuit은 Benchpress 저장소를 통해 접근한 Hamlib 컬렉션에서 가져왔습니다. Hamlib는 표준화된 대표 해밀토니안 집합을 제공하여 현실적인 시뮬레이션 워크로드에서 컴파일 전략을 비교할 수 있게 합니다.

컴파일 방법 개요

SABRE를 사용한 Qiskit Transpiler

Qiskit Transpiler는 SABRE(SWAP 기반 양방향 휴리스틱 탐색) 알고리즘을 사용하여 Circuit 레이아웃 및 라우팅을 최적화합니다. SABRE는 하드웨어 연결 제약을 준수하면서 SWAP Gate와 이로 인한 Circuit 깊이 영향을 최소화하는 데 초점을 맞춥니다. 이는 성능과 컴파일 시간 사이의 좋은 균형을 제공하는 범용적인 방법입니다. 자세한 내용은 [1]을 참조하세요. SABRE의 장점과 파라미터 탐색은 별도의 튜토리얼에서 심층적으로 다룹니다.

AI 기반 Transpiler

AI 기반 Transpiler는 머신 러닝을 사용하여 Circuit 구조와 하드웨어 제약의 패턴을 분석해 최적의 트랜스파일 전략을 예측합니다. 또한 강화 학습 기반 합성 방식을 사용하여 Pauli 네트워크 Circuit을 대상으로 하는 AIPauliNetworkSynthesis 패스를 적용할 수 있습니다. 자세한 내용은 [2] 및 [3]을 참조하세요.

Rustiq 플러그인

Rustiq 플러그인은 트로터화된 동역학에서 일반적으로 사용되는 Pauli 회전을 나타내는 PauliEvolutionGate 연산을 위한 고급 합성 기술을 제공합니다. 해밀토니안 시뮬레이션 워크로드를 위해 낮은 깊이의 Circuit 분해를 생성하도록 설계되었습니다. 자세한 내용은 [4]를 참조하세요.

주요 지표

세 가지 방법을 다음 지표로 비교합니다:

2-Qubit 깊이: 2-Qubit Gate만 계산하는 Circuit의 깊이. 이는 실제 하드웨어에서 충실도의 병목이 되는 경우가 많습니다.
Circuit 크기 (전체 Gate 수): 트랜스파일된 Circuit의 전체 Gate 수.
런타임: 트랜스파일에 소요되는 벽시계 시간.

요구 사항

이 튜토리얼을 시작하기 전에 다음이 설치되어 있는지 확인하세요:

Qiskit SDK v2.0 이상 (시각화 지원 포함)
Qiskit Runtime v0.22 이상 (pip install qiskit-ibm-runtime)
Qiskit Aer (pip install qiskit-aer)
Qiskit IBM Transpiler (pip install qiskit-ibm-transpiler)
Qiskit AI Transpiler 로컬 모드 (pip install qiskit_ibm_ai_local_transpiler)
Networkx (pip install networkx)

설정

# Added by doQumentation — required packages for this notebook
!pip install -q matplotlib numpy qiskit qiskit-aer qiskit-ibm-runtime qiskit-ibm-transpiler requests scipy

from qiskit.circuit import QuantumCircuit
from qiskit_ibm_runtime import QiskitRuntimeService, SamplerV2
from qiskit.circuit.library import PauliEvolutionGate
from qiskit_ibm_transpiler import generate_ai_pass_manager
from qiskit.quantum_info import SparsePauliOp
from qiskit.transpiler.preset_passmanagers import generate_preset_pass_manager
from qiskit.transpiler.passes.synthesis.high_level_synthesis import HLSConfig
from qiskit_aer import AerSimulator
from qiskit_aer.noise import NoiseModel, depolarizing_error
from collections import Counter
from statistics import mean, stdev
from scipy.sparse import SparseEfficiencyWarning
import time
import warnings
import matplotlib.pyplot as plt
import matplotlib.ticker as ticker
import numpy as np
import json
import requests
import logging

# Suppress noisy loggers and warnings
logging.getLogger(
    "qiskit_ibm_transpiler.wrappers.ai_local_synthesis"
).setLevel(logging.ERROR)
warnings.filterwarnings("ignore", category=FutureWarning)
warnings.filterwarnings("ignore", category=SparseEfficiencyWarning)

seed = 42  # Seed for reproducibility

Backend에 연결하기

소규모 및 대규모 예제 모두에 사용될 Backend를 선택합니다. Backend는 Transpiler가 목표로 하는 커플링 맵과 기저 게이트를 결정합니다.

# QiskitRuntimeService.save_account(channel="ibm_quantum_platform",
# token="<YOUR-API-KEY>", overwrite=True, set_as_default=True)
service = QiskitRuntimeService(channel="ibm_quantum_platform")
backend = service.least_busy(operational=True, simulator=False)
print(f"Using backend: {backend.name}")

Using backend: ibm_pittsburgh

Pass Manager 정의

세 가지 컴파일 방법을 설정합니다.

# SABRE pass manager (Qiskit default at optimization level 3)
pm_sabre = generate_preset_pass_manager(
    optimization_level=3, backend=backend, seed_transpiler=seed
)

# AI transpiler pass manager (local mode)
pm_ai = generate_ai_pass_manager(
    backend=backend, optimization_level=3, ai_optimization_level=3
)

Fetching 127 files:   0%|          | 0/127 [00:00<?, ?it/s]

# Rustiq pass manager for PauliEvolutionGate synthesis
hls_config = HLSConfig(
    PauliEvolution=[
        (
            "rustiq",
            {
                "nshuffles": 400,
                "upto_phase": True,
                "fix_clifford": True,
                "preserve_order": False,
                "metric": "depth",
            },
        )
    ]
)
pm_rustiq = generate_preset_pass_manager(
    optimization_level=3,
    backend=backend,
    hls_config=hls_config,
    seed_transpiler=seed,
)

헬퍼 함수 정의

다음 함수는 주어진 Pass Manager를 사용하여 Circuit 목록을 트랜스파일하고, 각 Circuit에 대한 주요 지표(2-Qubit 깊이, Circuit 크기, 런타임)를 기록합니다.

def capture_transpilation_metrics(
    results, pass_manager, circuits, method_name
):
    """
    Transpile circuits and append one metrics record per circuit to
    ``results``.

    Args:
        results (list): List of dicts to append the metrics records to.
        pass_manager: Pass manager used for transpilation.
        circuits (list): List of quantum circuits to transpile.
        method_name (str): Name of the transpilation method.

    Returns:
        list: List of transpiled circuits.
    """
    transpiled_circuits = []

    for i, qc in enumerate(circuits):
        start_time = time.time()
        transpiled_qc = pass_manager.run(qc)
        end_time = time.time()

        # Decompose swaps for consistency across methods
        transpiled_qc = transpiled_qc.decompose(gates_to_decompose=["swap"])

        transpilation_time = end_time - start_time
        two_qubit_depth = transpiled_qc.depth(
            lambda x: x.operation.num_qubits == 2
        )
        circuit_size = transpiled_qc.size()

        results.append(
            {
                "method": method_name,
                "qc_name": qc.name,
                "qc_index": i,
                "num_qubits": qc.num_qubits,
                "two_qubit_depth": two_qubit_depth,
                "size": circuit_size,
                "runtime": transpilation_time,
            }
        )
        transpiled_circuits.append(transpiled_qc)
        print(
            f"[{method_name}] Circuit {i} ({qc.name}): "
            f"2Q depth={two_qubit_depth}, size={circuit_size}, "
            f"time={transpilation_time:.2f}s"
        )

    return transpiled_circuits

def _method_order(results):
    """Return the distinct method names in their first-seen order."""
    order = []
    for r in results:
        if r["method"] not in order:
            order.append(r["method"])
    return order

def print_summary_table(results):
    """
    Print the mean and standard deviation of each metric per compilation
    method, followed by the mean percent improvement relative to SABRE.
    """
    metrics = [
        ("two_qubit_depth", "2Q Depth"),
        ("size", "Gate Count"),
        ("runtime", "Runtime (s)"),
    ]
    methods = _method_order(results)
    by_method = {m: [r for r in results if r["method"] == m] for m in methods}
    sabre_by_index = {r["qc_index"]: r for r in by_method.get("SABRE", [])}

    col_w = 22
    name_w = max(len(m) for m in methods)
    header = f"{'Method':<{name_w}}" + "".join(
        f"  {label:>{col_w}}" for _, label in metrics
    )

    print("Mean +/- std per compilation method")
    print(header)
    print("-" * len(header))
    for method in methods:
        cells = []
        for key, _ in metrics:
            values = [r[key] for r in by_method[method]]
            std = stdev(values) if len(values) > 1 else 0.0
            cells.append(f"{mean(values):,.1f} +/- {std:,.1f}")
        print(
            f"{method:<{name_w}}" + "".join(f"  {c:>{col_w}}" for c in cells)
        )

    others = [m for m in methods if m != "SABRE"]
    if others and sabre_by_index:
        print()
        print("Mean % improvement vs SABRE (positive = better than SABRE)")
        print(header)
        print("-" * len(header))
        for method in others:
            cells = []
            for key, _ in metrics:
                pct = [
                    (sabre_by_index[r["qc_index"]][key] - r[key])
                    / sabre_by_index[r["qc_index"]][key]
                    * 100
                    for r in by_method[method]
                    if sabre_by_index.get(r["qc_index"])
                    and sabre_by_index[r["qc_index"]][key]
                ]
                if pct:
                    std = stdev(pct) if len(pct) > 1 else 0.0
                    cells.append(f"{mean(pct):+.1f}% +/- {std:.1f}%")
                else:
                    cells.append("n/a")
            print(
                f"{method:<{name_w}}"
                + "".join(f"  {c:>{col_w}}" for c in cells)
            )

def print_per_circuit_comparison(results, num_rows=5):
    """
    Print a per-metric comparison of the compilation methods for the
    first ``num_rows`` circuits (sorted by qubit count). The best
    (lowest) value for each metric is marked with an asterisk.
    """
    metrics = [
        ("two_qubit_depth", "2Q Depth"),
        ("size", "Gate Count"),
        ("runtime", "Runtime (s)"),
    ]
    methods = _method_order(results)

    by_index = {}
    for r in results:
        by_index.setdefault(r["qc_index"], {})[r["method"]] = r
    ordered = sorted(
        by_index.items(),
        key=lambda kv: (next(iter(kv[1].values()))["num_qubits"], kv[0]),
    )[:num_rows]

    for key, label in metrics:
        print(f"{label} (first {num_rows} circuits by qubit count); * = best")
        header = f"{'Idx':>3}  {'Circuit':<16} {'Q':>3}" + "".join(
            f"{m:>9}" for m in methods
        )
        print(header)
        print("-" * len(header))
        for idx, method_map in ordered:
            any_record = next(iter(method_map.values()))
            present = {
                m: method_map[m][key] for m in methods if m in method_map
            }
            best = min(present.values())
            line = (
                f"{idx:>3}  {any_record['qc_name'][:16]:<16} "
                f"{any_record['num_qubits']:>3}"
            )
            for m in methods:
                value = method_map[m][key]
                text = f"{value:.2f}" if key == "runtime" else f"{int(value)}"
                if value == best:
                    text += "*"
                line += f"{text:>9}"
            print(line)
        print()

Hamlib에서 해밀토니안 Circuit 불러오기

Benchpress 저장소에서 대표적인 해밀토니안 집합을 불러와 PauliEvolutionGate Circuit을 구성합니다. Backend의 Qubit 수를 초과하는 Circuit과 분해된 크기가 1,500개 Gate를 초과하는 Circuit(트랜스파일 시간을 적절히 유지하기 위해)은 제거됩니다.

# Obtain the Hamiltonian JSON from the benchpress repository
url = "https://raw.githubusercontent.com/Qiskit/benchpress/e7b29ef7be4cc0d70237b8fdc03edbd698908eff/benchpress/hamiltonian/hamlib/100_representative.json"
response = requests.get(url)
response.raise_for_status()
ham_records = json.loads(response.text)

# Remove circuits that are too large for the backend
ham_records = [
    h for h in ham_records if h["ham_qubits"] <= backend.num_qubits
]

# Build PauliEvolutionGate circuits
qc_ham_list = []
for h in ham_records:
    terms = h["ham_hamlib_hamiltonian_terms"]
    coeff = h["ham_hamlib_hamiltonian_coefficients"]
    num_qubits = h["ham_qubits"]
    name = h["ham_problem"]

    evo_gate = PauliEvolutionGate(SparsePauliOp(terms, coeff))
    qc = QuantumCircuit(num_qubits)
    qc.name = name
    qc.append(evo_gate, range(num_qubits))
    qc_ham_list.append(qc)

# Remove circuits whose decomposed size exceeds 1500 gates so that transpilation completes in a reasonable time frame
qc_ham_list = [qc for qc in qc_ham_list if qc.decompose().size() <= 1500]

print(f"Total Hamiltonian circuits loaded: {len(qc_ham_list)}")
print(
    f"Qubit range: {min(qc.num_qubits for qc in qc_ham_list)} to {max(qc.num_qubits for qc in qc_ham_list)}"
)

Total Hamiltonian circuits loaded: 42
Qubit range: 2 to 112

Circuit을 소규모(20 Qubit 미만)와 대규모(20 Qubit 이상) 그룹으로 분류합니다.

qc_small = [qc for qc in qc_ham_list if qc.num_qubits < 20]
qc_large = [qc for qc in qc_ham_list if qc.num_qubits >= 20]

print(f"Small-scale circuits (<20 qubits): {len(qc_small)}")
print(f"Large-scale circuits (>=20 qubits): {len(qc_large)}")

Small-scale circuits (<20 qubits): 20
Large-scale circuits (>=20 qubits): 22

트랜스파일 전 소규모 해밀토니안 Circuit 중 하나를 미리 봅니다.

# We decompose the circuit here, otherwise it would just be a PauliEvolutionGate box,
# which isn't very informative to look at!
qc_small[0].decompose().draw("mpl", fold=-1)

Output of the previous code cell

소규모 예제

이 섹션에서는 20 Qubit 미만의 해밀토니안 Circuit에서 세 가지 컴파일 방법을 벤치마크합니다. 이러한 Circuit은 빠르게 트랜스파일되며, 각 방법이 중간 복잡도의 Circuit을 어떻게 처리하는지 명확하게 보여줍니다.

1단계: 고전적 입력을 양자 문제로 매핑하기

각 해밀토니안은 PauliEvolutionGate Circuit으로 인코딩됩니다. Circuit은 설정 섹션에서 Hamlib 벤치마크 데이터로부터 이미 구성되었습니다.

2단계: 양자 하드웨어 실행을 위한 문제 최적화

세 가지 Pass Manager 각각으로 모든 소규모 Circuit을 트랜스파일하고 지표를 수집합니다.

results_small = []

tqc_sabre_small = capture_transpilation_metrics(
    results_small, pm_sabre, qc_small, "SABRE"
)
tqc_ai_small = capture_transpilation_metrics(
    results_small, pm_ai, qc_small, "AI"
)
tqc_rustiq_small = capture_transpilation_metrics(
    results_small, pm_rustiq, qc_small, "Rustiq"
)

[SABRE] Circuit 0 (all-vib-bh): 2Q depth=3, size=30, time=2.09s
[SABRE] Circuit 1 (all-vib-c2h): 2Q depth=18, size=111, time=0.01s
[SABRE] Circuit 2 (all-vib-o3): 2Q depth=6, size=58, time=0.00s
[SABRE] Circuit 3 (all-vib-c2h): 2Q depth=2, size=37, time=0.01s
[SABRE] Circuit 4 (graph-gnp_k-2): 2Q depth=24, size=126, time=0.01s
[SABRE] Circuit 5 (LiH): 2Q depth=66, size=285, time=0.01s
[SABRE] Circuit 6 (all-vib-fccf): 2Q depth=66, size=339, time=0.01s
[SABRE] Circuit 7 (all-vib-ch2): 2Q depth=88, size=413, time=0.01s
[SABRE] Circuit 8 (all-vib-f2): 2Q depth=180, size=1000, time=0.02s
[SABRE] Circuit 9 (all-vib-bhf2): 2Q depth=18, size=223, time=0.03s
[SABRE] Circuit 10 (graph-gnp_k-4): 2Q depth=122, size=675, time=0.02s
[SABRE] Circuit 11 (Be2): 2Q depth=343, size=1628, time=0.03s
[SABRE] Circuit 12 (all-vib-fccf): 2Q depth=14, size=134, time=0.00s
[SABRE] Circuit 13 (uf20-ham): 2Q depth=50, size=341, time=0.01s
[SABRE] Circuit 14 (TSP_Ncity-4): 2Q depth=118, size=615, time=0.01s
[SABRE] Circuit 15 (graph-complete_bipart): 2Q depth=232, size=1420, time=0.03s
[SABRE] Circuit 16 (all-vib-cyclo_propene): 2Q depth=18, size=354, time=0.93s
[SABRE] Circuit 17 (all-vib-hno): 2Q depth=6, size=174, time=0.14s
[SABRE] Circuit 18 (all-vib-fccf): 2Q depth=30, size=286, time=0.01s
[SABRE] Circuit 19 (tfim): 2Q depth=31, size=232, time=0.03s
[AI] Circuit 0 (all-vib-bh): 2Q depth=3, size=30, time=0.01s

Fetching 4 files:   0%|          | 0/4 [00:00<?, ?it/s]

[AI] Circuit 1 (all-vib-c2h): 2Q depth=18, size=101, time=0.18s
[AI] Circuit 2 (all-vib-o3): 2Q depth=6, size=58, time=0.01s
[AI] Circuit 3 (all-vib-c2h): 2Q depth=2, size=37, time=0.01s
[AI] Circuit 4 (graph-gnp_k-2): 2Q depth=24, size=133, time=0.07s
[AI] Circuit 5 (LiH): 2Q depth=62, size=267, time=8.00s
[AI] Circuit 6 (all-vib-fccf): 2Q depth=65, size=300, time=0.18s
[AI] Circuit 7 (all-vib-ch2): 2Q depth=79, size=353, time=0.16s
[AI] Circuit 8 (all-vib-f2): 2Q depth=176, size=998, time=0.43s
[AI] Circuit 9 (all-vib-bhf2): 2Q depth=18, size=194, time=0.11s
[AI] Circuit 10 (graph-gnp_k-4): 2Q depth=114, size=668, time=0.18s
[AI] Circuit 11 (Be2): 2Q depth=292, size=1382, time=0.88s
[AI] Circuit 12 (all-vib-fccf): 2Q depth=14, size=134, time=0.01s
[AI] Circuit 13 (uf20-ham): 2Q depth=40, size=330, time=0.16s
[AI] Circuit 14 (TSP_Ncity-4): 2Q depth=96, size=600, time=0.29s
[AI] Circuit 15 (graph-complete_bipart): 2Q depth=231, size=1531, time=0.46s
[AI] Circuit 16 (all-vib-cyclo_propene): 2Q depth=18, size=309, time=0.25s
[AI] Circuit 17 (all-vib-hno): 2Q depth=10, size=198, time=0.15s
[AI] Circuit 18 (all-vib-fccf): 2Q depth=34, size=402, time=0.02s
[AI] Circuit 19 (tfim): 2Q depth=44, size=311, time=0.15s
[Rustiq] Circuit 0 (all-vib-bh): 2Q depth=3, size=30, time=0.01s
[Rustiq] Circuit 1 (all-vib-c2h): 2Q depth=13, size=69, time=0.00s
[Rustiq] Circuit 2 (all-vib-o3): 2Q depth=13, size=82, time=0.01s
[Rustiq] Circuit 3 (all-vib-c2h): 2Q depth=2, size=40, time=0.01s
[Rustiq] Circuit 4 (graph-gnp_k-2): 2Q depth=31, size=132, time=0.01s
[Rustiq] Circuit 5 (LiH): 2Q depth=59, size=285, time=0.01s
[Rustiq] Circuit 6 (all-vib-fccf): 2Q depth=34, size=193, time=0.00s
[Rustiq] Circuit 7 (all-vib-ch2): 2Q depth=49, size=302, time=0.01s
[Rustiq] Circuit 8 (all-vib-f2): 2Q depth=141, size=807, time=0.02s
[Rustiq] Circuit 9 (all-vib-bhf2): 2Q depth=13, size=146, time=0.02s
[Rustiq] Circuit 10 (graph-gnp_k-4): 2Q depth=129, size=683, time=0.02s
[Rustiq] Circuit 11 (Be2): 2Q depth=220, size=1101, time=0.02s
[Rustiq] Circuit 12 (all-vib-fccf): 2Q depth=53, size=333, time=0.01s
[Rustiq] Circuit 13 (uf20-ham): 2Q depth=63, size=425, time=0.01s
[Rustiq] Circuit 14 (TSP_Ncity-4): 2Q depth=123, size=767, time=0.02s
[Rustiq] Circuit 15 (graph-complete_bipart): 2Q depth=309, size=2107, time=0.05s
[Rustiq] Circuit 16 (all-vib-cyclo_propene): 2Q depth=16, size=283, time=0.32s
[Rustiq] Circuit 17 (all-vib-hno): 2Q depth=19, size=291, time=0.32s
[Rustiq] Circuit 18 (all-vib-fccf): 2Q depth=44, size=546, time=0.02s
[Rustiq] Circuit 19 (tfim): 2Q depth=24, size=416, time=0.01s

아래 표는 모든 소규모 Circuit에 걸친 각 지표의 평균 및 표준 편차와 SABRE 대비 개선율을 요약합니다. Circuit 크기가 크게 다르기 때문에 표준 편차는 평균을 해석하는 중요한 맥락을 제공합니다.

print_summary_table(results_small)

Mean +/- std per compilation method
Method                2Q Depth              Gate Count             Runtime (s)
------------------------------------------------------------------------------
SABRE            71.8 +/- 89.6         424.1 +/- 446.0             0.2 +/- 0.5
AI               67.3 +/- 80.2         416.8 +/- 426.7             0.6 +/- 1.8
Rustiq           67.9 +/- 80.0         451.9 +/- 484.7             0.0 +/- 0.1

Mean % improvement vs SABRE (positive = better than SABRE)
Method                2Q Depth              Gate Count             Runtime (s)
------------------------------------------------------------------------------
AI             -2.1% +/- 19.8%         -0.6% +/- 14.7%   -5635.1% +/- 20725.2%
Rustiq        -25.3% +/- 85.4%        -16.3% +/- 50.4%         -7.0% +/- 60.6%

회로별 표는 각 방법이 개별 Circuit에서 어떻게 비교되는지 보여줍니다. 각 지표에서 가장 좋은 값은 별표(*)로 표시됩니다. 가장 단순한 Circuit에서는 세 방법 모두 종종 동일한 결과에 수렴하는 것을 알 수 있습니다.

print_per_circuit_comparison(results_small, num_rows=8)

2Q Depth (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-bh         2       3*       3*       3*
all-vib-c2h        3       18       18      13*
all-vib-o3         4       6*       6*       13
all-vib-c2h        4       2*       2*       2*
graph-gnp_k-2      4      24*      24*       31
LiH                4       66       62      59*
all-vib-fccf       4       66       65      34*
all-vib-ch2        4       88       79      49*

Gate Count (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-bh         2      30*      30*      30*
all-vib-c2h        3      111      101      69*
all-vib-o3         4      58*      58*       82
all-vib-c2h        4      37*      37*       40
graph-gnp_k-2      4     126*      133      132
LiH                4      285     267*      285
all-vib-fccf       4      339      300     193*
all-vib-ch2        4      413      353     302*

Runtime (s) (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-bh         2     2.09     0.01    0.01*
all-vib-c2h        3     0.01     0.18    0.00*
all-vib-o3         4    0.00*     0.01     0.01
all-vib-c2h        4     0.01     0.01    0.01*
graph-gnp_k-2      4    0.01*     0.07     0.01
LiH                4    0.01*     8.00     0.01
all-vib-fccf       4     0.01     0.18    0.00*
all-vib-ch2        4     0.01     0.16    0.01*

결과 시각화

아래 플롯은 Circuit별로 각 지표에 대해 세 가지 방법을 비교합니다. Circuit은 Qubit 수에 따라 정렬되며, 동일한 Qubit 수를 가진 여러 Circuit이 있을 수 있으므로 x축에는 인덱스로 레이블이 표시됩니다.

def plot_transpilation_comparison(results, title_prefix):
    """
    Create a three-panel figure comparing compilation methods on
    two-qubit depth, circuit size, and runtime.

    Circuits are sorted by qubit count and plotted by circuit index.
    """
    methods = _method_order(results)
    palette = {"SABRE": "#1f77b4", "AI": "#ff7f0e", "Rustiq": "#2ca02c"}
    markers = {"SABRE": "o", "AI": "^", "Rustiq": "s"}

    # Order circuits by qubit count (then index) and map to plot positions
    ref = sorted(
        [r for r in results if r["method"] == methods[0]],
        key=lambda r: (r["num_qubits"], r["qc_index"]),
    )
    pos_map = {r["qc_index"]: pos for pos, r in enumerate(ref)}
    tick_positions = [pos_map[r["qc_index"]] for r in ref]
    tick_labels = [
        f"{pos_map[r['qc_index']]} ({r['num_qubits']}q)" for r in ref
    ]

    metrics = [
        ("two_qubit_depth", "Two-Qubit Depth"),
        ("size", "Total Gate Count (Circuit Size)"),
        ("runtime", "Transpilation Runtime (s)"),
    ]

    fig, axes = plt.subplots(1, 3, figsize=(20, 5.5))
    fig.suptitle(title_prefix, fontsize=15, fontweight="bold", y=1.02)

    for ax, (metric, ylabel) in zip(axes, metrics):
        for method in methods:
            subset = sorted(
                [r for r in results if r["method"] == method],
                key=lambda r: pos_map[r["qc_index"]],
            )
            ax.plot(
                [pos_map[r["qc_index"]] for r in subset],
                [r[metric] for r in subset],
                marker=markers.get(method, "o"),
                label=method,
                color=palette.get(method, None),
                linewidth=1.5,
                markersize=6,
                alpha=0.85,
            )
        ax.set_xlabel("Circuit Index (num qubits)", fontsize=11)
        ax.set_ylabel(ylabel, fontsize=11)
        ax.legend(frameon=True, fontsize=9)
        ax.grid(True, linestyle="--", alpha=0.4)
        step = max(1, len(tick_positions) // 15)
        ax.set_xticks(tick_positions[::step])
        ax.set_xticklabels(
            [tick_labels[i] for i in range(0, len(tick_labels), step)],
            fontsize=7,
            rotation=45,
            ha="right",
        )

    plt.tight_layout()
    plt.show()

def plot_pct_improvement_vs_sabre(results, title_prefix):
    """
    Plot the per-circuit percent improvement of each non-SABRE method
    relative to SABRE, for each metric. A positive value means the
    method improved on SABRE; negative means SABRE was better.
    """
    metrics = [
        ("two_qubit_depth", "2Q Depth"),
        ("size", "Gate Count"),
        ("runtime", "Runtime"),
    ]
    palette = {"AI": "#ff7f0e", "Rustiq": "#2ca02c"}
    markers = {"AI": "^", "Rustiq": "s"}

    methods = _method_order(results)
    sabre = sorted(
        [r for r in results if r["method"] == "SABRE"],
        key=lambda r: (r["num_qubits"], r["qc_index"]),
    )
    other_methods = [m for m in methods if m != "SABRE"]

    tick_positions = list(range(len(sabre)))
    tick_labels = [
        f"{i} ({sabre[i]['num_qubits']}q)" for i in range(len(sabre))
    ]

    fig, axes = plt.subplots(1, 3, figsize=(20, 5.5))
    fig.suptitle(
        f"{title_prefix}: % Improvement over SABRE",
        fontsize=15,
        fontweight="bold",
        y=1.02,
    )

    for ax, (metric, label) in zip(axes, metrics):
        ax.axhline(0, color="#1f77b4", linewidth=2, label="SABRE (baseline)")
        for method in other_methods:
            data = sorted(
                [r for r in results if r["method"] == method],
                key=lambda r: (r["num_qubits"], r["qc_index"]),
            )
            pct = [
                (sabre[i][metric] - data[i][metric]) / sabre[i][metric] * 100
                for i in range(len(sabre))
            ]
            ax.plot(
                tick_positions,
                pct,
                marker=markers.get(method, "o"),
                label=method,
                color=palette.get(method, None),
                linewidth=1.5,
                markersize=6,
                alpha=0.85,
            )
        ax.set_xlabel("Circuit Index (num qubits)", fontsize=11)
        ax.set_ylabel(f"% Improvement ({label})", fontsize=11)
        ax.legend(frameon=True, fontsize=9)
        ax.grid(True, linestyle="--", alpha=0.4)
        step = max(1, len(tick_positions) // 15)
        ax.set_xticks(tick_positions[::step])
        ax.set_xticklabels(
            [tick_labels[i] for i in range(0, len(tick_labels), step)],
            fontsize=7,
            rotation=45,
            ha="right",
        )
        ylims = ax.get_ylim()
        ax.axhspan(0, max(ylims[1], 1), alpha=0.04, color="green")
        ax.axhspan(min(ylims[0], -1), 0, alpha=0.04, color="red")

    plt.tight_layout()
    plt.show()

plot_transpilation_comparison(
    results_small,
    "Small-Scale Hamiltonian Circuits: Compilation Comparison",
)

Output of the previous code cell

plot_pct_improvement_vs_sabre(
    results_small,
    "Small-Scale Hamiltonian Circuits",
)

Output of the previous code cell

이 규모에서는 세 가지 Pass Manager 모두 잘 작동하며 평균 결과가 서로 비슷합니다. 이는 주로 소규모 Circuit이 추가 최적화의 여지가 적기 때문에 방법들이 유사한 해법으로 수렴하는 경향이 있기 때문입니다.

이 예제에서 Rustiq는 2-Qubit 깊이와 Gate 수 모두에서 가장 큰 이상치가 있어 가장 가변적인 결과를 보입니다. 이러한 가변성은 때로는 뒤처질 수 있지만, Rustiq가 다른 두 방법보다 더 좋은 해법을 찾을 때도 있음을 의미합니다. AI Transpiler는 SABRE와 Rustiq에 비해 더 안정적인 결과를 보이며, 대부분의 Circuit에서 큰 이상치 없이 근접하게 추적합니다.

런타임 측면에서 SABRE와 Rustiq는 모두 빠르지만, AI 기반 Transpiler는 특정 Circuit에서 눈에 띄게 느립니다.

지표별 최고 성능 방법

아래 차트는 각 방법이 각 지표에서 가장 좋은(낮은) 값을 달성한 Circuit의 비율을 보여줍니다. 동점이 가능합니다: 더 단순한 Circuit에서는 여러 방법이 동일한 최적 2-Qubit 깊이나 Gate 수에 도달할 수 있습니다. 동점이 발생하면 모든 동점 방법이 점수를 받으므로 특정 지표의 백분율 합계가 100%를 초과할 수 있습니다.

def plot_best_method_bars(results, metrics_list=None):
    """
    Plot a grouped bar chart showing the percentage of circuits
    where each method achieved the best (lowest) value for each metric.

    Ties are counted for all tied methods, so percentages per metric
    can sum to more than 100%.
    """
    if metrics_list is None:
        metrics_list = ["two_qubit_depth", "size", "runtime"]

    labels = {
        "two_qubit_depth": "2Q Depth",
        "size": "Gate Count",
        "runtime": "Runtime",
    }
    methods = _method_order(results)
    palette = {"SABRE": "#1f77b4", "AI": "#ff7f0e", "Rustiq": "#2ca02c"}

    by_index = {}
    for r in results:
        by_index.setdefault(r["qc_index"], []).append(r)
    n_circuits = len(by_index)

    win_data = {m: [] for m in methods}
    tie_counts = []
    metric_labels = []

    for metric in metrics_list:
        metric_labels.append(
            labels.get(metric, metric.replace("_", " ").title())
        )
        counts = Counter()
        ties = 0
        for group in by_index.values():
            min_val = min(r[metric] for r in group)
            best = [r["method"] for r in group if r[metric] == min_val]
            if len(best) > 1:
                ties += 1
            counts.update(best)
        tie_counts.append(ties)
        for m in methods:
            win_data[m].append(counts.get(m, 0) / n_circuits * 100)

    x = np.arange(len(metric_labels))
    width = 0.22
    fig, ax = plt.subplots(figsize=(8, 5))

    for i, method in enumerate(methods):
        bars = ax.bar(
            x + i * width,
            win_data[method],
            width,
            label=method,
            color=palette.get(method, None),
            edgecolor="black",
            linewidth=0.5,
        )
        for bar in bars:
            height = bar.get_height()
            if height > 0:
                ax.text(
                    bar.get_x() + bar.get_width() / 2,
                    height + 1.5,
                    f"{height:.0f}%",
                    ha="center",
                    va="bottom",
                    fontsize=9,
                )

    # Annotate tie counts below each metric label
    for j, ties in enumerate(tie_counts):
        if ties > 0:
            ax.text(
                x[j] + width,
                -8,
                f"({ties} tie{'s' if ties != 1 else ''})",
                ha="center",
                va="top",
                fontsize=8,
                color="gray",
            )

    ax.set_xticks(x + width)
    ax.set_xticklabels(metric_labels, fontsize=11)
    ax.set_ylabel("Circuits with best value (%)", fontsize=11)
    ax.set_title(
        "Best-Performing Method by Metric (ties counted for all tied methods)",
        fontsize=12,
        fontweight="bold",
    )
    ax.legend(frameon=True, fontsize=10)
    ax.set_ylim(-12, 120)
    ax.yaxis.set_major_formatter(ticker.PercentFormatter())
    ax.grid(axis="y", linestyle="--", alpha=0.4)

    plt.tight_layout()
    plt.show()

plot_best_method_bars(results_small)

Output of the previous code cell

이 예제에서 세 가지 방법은 소규모 Circuit에서 매우 유사하게 수행됩니다. 2-Qubit 깊이와 Gate 수에서 각 방법이 최고인 Circuit의 비율은 비슷하며(약 35~55%), 가장 단순한 Circuit은 여러 방법이 찾는 단일한 최적 해법을 가지는 경우가 많아 동점으로 끝나는 Circuit이 많습니다. 가장 명확한 차이는 런타임입니다: SABRE와 Rustiq는 각각 약 절반의 Circuit에서 가장 빠르지만, AI 기반 Transpiler는 거의 빠르지 않습니다. 세 가지 지표를 모두 고려하면 Rustiq가 약간의 전반적인 우위를 가집니다 2-Qubit 깊이에서 가장 자주 승리하고 Gate 수와 런타임에서도 경쟁력을 유지합니다.

3단계: Qiskit 프리미티브를 사용한 실행

트랜스파일 품질이 노이즈 하에서의 실행에 어떻게 영향을 미치는지 평가하기 위해 미러 Circuit 기법을 사용합니다. 각 트랜스파일된 Circuit $U$ 에 대해 역 회로 $U^\dagger$ 를 추가하여 결합된 Circuit $U^\dagger U$ 가 이론적으로 항등 연산이 되도록 합니다. $|0\rangle$ 상태에서 시작하면 완벽한(노이즈 없는) 실행은 확률 1로 전체 0 비트열을 반환해야 합니다.

실제로는 Gate 오류가 Circuit 전체에 걸쳐 누적되므로, $|0\rangle^{\otimes n}$ 을 복구할 확률이 떨어집니다. 더 얕은 Circuit과 더 적은 Gate를 생성하는 컴파일 방법은 더 적은 노이즈를 누적합니다.

미러 Circuit 접근법은 예상 출력이 항상 $|0\rangle^{\otimes n}$ 이고 이상 상태의 고전적 시뮬레이션이 필요 없으므로 단순하고 어떤 Circuit 크기로도 확장됩니다. 다만 다음 주의 사항에 유의하십시오: 미러 Circuit은 실제 Circuit의 대리체(Circuit 자체가 아님)이고, Gate 수를 두 배로 늘리며(노이즈 효과를 과장), 노이즈가 미러 경계를 넘어 대칭적으로 상쇄될 때 특정 오류를 과소평가할 수 있습니다.

소규모 집합에서 인덱스 6의 Circuit을 선택하고 간단한 탈분극 노이즈 모델을 사용하는 Aer 시뮬레이터에서 미러 Circuit을 실행합니다.

# Select circuit index 6 from the small-scale transpiled circuits
test_idx = 6
test_circuit = qc_small[test_idx]
print(f"Test circuit: {test_circuit.name}, {test_circuit.num_qubits} qubits")

# Get the transpiled versions
tqc_methods_small = {
    "SABRE": tqc_sabre_small[test_idx],
    "AI": tqc_ai_small[test_idx],
    "Rustiq": tqc_rustiq_small[test_idx],
}

# Show transpilation metrics for this circuit
print(f"\nTranspilation metrics for circuit index {test_idx}:")
for method, tqc in tqc_methods_small.items():
    depth_2q = tqc.depth(lambda x: x.operation.num_qubits == 2)
    size = tqc.size()
    print(f"  {method:8s}  2Q depth={depth_2q:5d}  size={size:6d}")

Test circuit: all-vib-fccf, 4 qubits

Transpilation metrics for circuit index 6:
  SABRE     2Q depth=   66  size=   339
  AI        2Q depth=   65  size=   300
  Rustiq    2Q depth=   34  size=   193

미러 Circuit을 구성하고( $U^\dagger$ 추가), 연속된 Qubit 인덱스로 재매핑하여 시뮬레이터가 활성 Qubit만 처리하도록 하고, 노이즈 있는 Aer 시뮬레이터에서 실행합니다.

def remap_to_contiguous(tqc):
    """Remap a transpiled circuit to use contiguous qubit indices.

    Transpiled circuits target specific physical qubits (e.g., qubit 45, 67)
    on a large backend. This remaps them to 0, 1, 2, ... so Aer only
    simulates the active qubits.
    """
    active = sorted(
        {tqc.find_bit(q).index for inst in tqc.data for q in inst.qubits}
    )
    qubit_map = {old: new for new, old in enumerate(active)}
    new_qc = QuantumCircuit(len(active))
    for inst in tqc.data:
        old_indices = [tqc.find_bit(q).index for q in inst.qubits]
        new_qc.append(inst.operation, [qubit_map[i] for i in old_indices])
    return new_qc

def build_mirror_circuit(tqc):
    """Build a mirror circuit: U followed by U-dagger, with measurements.

    The combined circuit U-dagger @ U should be the identity, so measuring
    all zeros indicates a noise-free execution.
    """
    tqc_compact = remap_to_contiguous(tqc)
    mirror = tqc_compact.compose(tqc_compact.inverse())
    mirror.measure_all()
    return mirror

# Build a simple depolarizing noise model
noise_model = NoiseModel()
noise_model.add_all_qubit_quantum_error(
    depolarizing_error(0.001, 1),
    ["sx", "x", "rz"],  # ~0.1% per 1Q gate
)
noise_model.add_all_qubit_quantum_error(
    depolarizing_error(0.01, 2),
    ["cx", "ecr"],  # ~1% per 2Q gate
)

aer_sim = AerSimulator(noise_model=noise_model)

shots = 10000
fidelities = {}

for method, tqc in tqc_methods_small.items():
    mirror = build_mirror_circuit(tqc)

    sampler = SamplerV2(mode=aer_sim)
    job = sampler.run([mirror], shots=shots)
    result = job.result()
    counts = result[0].data.meas.get_counts()

    # Fidelity = fraction of all-zeros (error-free) outcomes
    n_qubits = mirror.num_qubits - mirror.num_clbits  # active qubits
    all_zeros = "0" * mirror.num_qubits
    fidelity = counts.get(all_zeros, 0) / shots
    fidelities[method] = fidelity
    print(
        f"{method:8s}  P(|00...0>) = {fidelity:.4f}  ({counts.get(all_zeros, 0)}/{shots})"
    )

SABRE     P(|00...0>) = 0.7796  (7796/10000)
AI        P(|00...0>) = 0.8073  (8073/10000)
Rustiq    P(|00...0>) = 0.8923  (8923/10000)

def plot_mirror_results(tqc_methods, fidelities, circuit_name):
    """
    Plot a three-panel comparison: fidelity, 2Q depth,
    and gate count for each compilation method.
    """
    methods = list(tqc_methods.keys())
    palette = {"SABRE": "#1f77b4", "AI": "#ff7f0e", "Rustiq": "#2ca02c"}
    colors = [palette.get(m, "gray") for m in methods]

    fidelity_vals = [fidelities[m] for m in methods]
    depth_vals = [
        tqc_methods[m].depth(lambda x: x.operation.num_qubits == 2)
        for m in methods
    ]
    size_vals = [tqc_methods[m].size() for m in methods]

    fig, axes = plt.subplots(1, 3, figsize=(16, 5))
    fig.suptitle(
        f"Mirror Circuit Results: {circuit_name}",
        fontsize=14,
        fontweight="bold",
        y=1.02,
    )

    def _annotate_bars(ax, bars, values, fmt="{}"):
        ymax = ax.get_ylim()[1]
        for bar, val in zip(bars, values):
            label = fmt.format(val)
            y = val + ymax * 0.03
            ax.text(
                bar.get_x() + bar.get_width() / 2,
                y,
                label,
                ha="center",
                va="bottom",
                fontsize=10,
                fontweight="bold",
            )

    # Panel 1: Survival Probability
    bars = axes[0].bar(
        methods, fidelity_vals, color=colors, edgecolor="black", linewidth=0.5
    )
    axes[0].set_ylabel("Fidelity  P(|00...0>)", fontsize=11)
    axes[0].set_title("Fidelity (higher is better)", fontsize=12)
    axes[0].set_ylim(
        0, max(fidelity_vals) * 1.18 if max(fidelity_vals) > 0 else 1.0
    )
    axes[0].grid(axis="y", linestyle="--", alpha=0.4)
    _annotate_bars(axes[0], bars, fidelity_vals, fmt="{:.4f}")

    # Panel 2: Two-Qubit Depth
    bars = axes[1].bar(
        methods, depth_vals, color=colors, edgecolor="black", linewidth=0.5
    )
    axes[1].set_ylabel("Two-Qubit Depth", fontsize=11)
    axes[1].set_title("2Q Depth (lower is better)", fontsize=12)
    axes[1].set_ylim(0, max(depth_vals) * 1.18)
    axes[1].grid(axis="y", linestyle="--", alpha=0.4)
    _annotate_bars(axes[1], bars, depth_vals)

    # Panel 3: Gate Count
    bars = axes[2].bar(
        methods, size_vals, color=colors, edgecolor="black", linewidth=0.5
    )
    axes[2].set_ylabel("Total Gate Count", fontsize=11)
    axes[2].set_title("Gate Count (lower is better)", fontsize=12)
    axes[2].set_ylim(0, max(size_vals) * 1.18)
    axes[2].grid(axis="y", linestyle="--", alpha=0.4)
    _annotate_bars(axes[2], bars, size_vals)

    plt.tight_layout()
    plt.show()

plot_mirror_results(tqc_methods_small, fidelities, test_circuit.name)

Output of the previous code cell

관찰 결과

2-Qubit 깊이가 가장 낮고 Gate 수가 가장 적은 방법이 가장 높은 충실도를 달성합니다. 이는 더 짧은 Circuit이 노이즈를 덜 누적한다는 예상과 일치합니다. 깊이와 Gate 수의 작은 차이도 탈분극 노이즈 모델 하에서 충실도에서 측정 가능한 차이로 이어집니다.

이러한 결과는 단일 Circuit에 대한 것임을 유념하십시오. 방법들의 상대적 순위는 해밀토니안 구조에 따라 Circuit마다 달라질 수 있습니다.

대규모 하드웨어 예제

이 섹션에서는 20 Qubit 이상의 해밀토니안 Circuit에서 동일한 세 가지 컴파일 방법을 벤치마크합니다. 이러한 Circuit은 실용적인 해밀토니안 시뮬레이션 워크로드를 더 잘 대표하며, 각 방법이 Circuit 품질과 컴파일 시간 측면에서 어떻게 확장되는지 테스트합니다.

1~4단계 통합

워크플로는 소규모 예제와 동일한 구조를 따릅니다. 각 방법으로 모든 대규모 Circuit을 트랜스파일하고, 지표를 수집하고, 실제 양자 하드웨어에 미러 Circuit을 제출합니다.

results_large = []

tqc_sabre_large = capture_transpilation_metrics(
    results_large, pm_sabre, qc_large, "SABRE"
)
tqc_ai_large = capture_transpilation_metrics(
    results_large, pm_ai, qc_large, "AI"
)
tqc_rustiq_large = capture_transpilation_metrics(
    results_large, pm_rustiq, qc_large, "Rustiq"
)

[SABRE] Circuit 0 (all-vib-hc3h2cn): 2Q depth=2, size=258, time=0.16s
[SABRE] Circuit 1 (ham-graph-gnp_k-5): 2Q depth=345, size=4036, time=0.08s
[SABRE] Circuit 2 (TSP_Ncity-5): 2Q depth=187, size=2045, time=0.04s
[SABRE] Circuit 3 (tfim): 2Q depth=100, size=489, time=0.21s
[SABRE] Circuit 4 (all-vib-h2co): 2Q depth=30, size=570, time=0.18s
[SABRE] Circuit 5 (uuf100-ham): 2Q depth=414, size=4779, time=0.09s
[SABRE] Circuit 6 (uuf100-ham): 2Q depth=523, size=5667, time=0.11s
[SABRE] Circuit 7 (graph-gnp_k-4): 2Q depth=3028, size=24885, time=0.39s
[SABRE] Circuit 8 (uf100-ham): 2Q depth=700, size=8271, time=0.15s
[SABRE] Circuit 9 (uf100-ham): 2Q depth=698, size=8957, time=0.15s
[SABRE] Circuit 10 (TSP_Ncity-7): 2Q depth=432, size=6353, time=0.12s
[SABRE] Circuit 11 (all-vib-cyclo_propene): 2Q depth=30, size=1144, time=0.20s
[SABRE] Circuit 12 (TSP_Ncity-8): 2Q depth=704, size=10287, time=0.18s
[SABRE] Circuit 13 (uf100-ham): 2Q depth=2454, size=30195, time=0.46s
[SABRE] Circuit 14 (tfim): 2Q depth=245, size=3670, time=0.08s
[SABRE] Circuit 15 (flat100-ham): 2Q depth=154, size=3836, time=0.12s
[SABRE] Circuit 16 (graph-regular_reg-4): 2Q depth=863, size=14063, time=0.22s
[SABRE] Circuit 17 (tfim): 2Q depth=581, size=8810, time=0.15s
[SABRE] Circuit 18 (FH_D-1): 2Q depth=1704, size=9528, time=0.35s
[SABRE] Circuit 19 (TSP_Ncity-10): 2Q depth=1091, size=22041, time=0.38s
[SABRE] Circuit 20 (TSP_Ncity-10): 2Q depth=1091, size=22005, time=0.38s
[SABRE] Circuit 21 (ham-unary-color02-queen13_13_k-4): 2Q depth=224, size=8321, time=0.17s
[AI] Circuit 0 (all-vib-hc3h2cn): 2Q depth=2, size=258, time=0.17s
[AI] Circuit 1 (ham-graph-gnp_k-5): 2Q depth=323, size=4418, time=3.13s
[AI] Circuit 2 (TSP_Ncity-5): 2Q depth=161, size=2229, time=1.47s
[AI] Circuit 3 (tfim): 2Q depth=20, size=402, time=0.34s
[AI] Circuit 4 (all-vib-h2co): 2Q depth=38, size=661, time=0.19s
[AI] Circuit 5 (uuf100-ham): 2Q depth=391, size=5130, time=3.27s
[AI] Circuit 6 (uuf100-ham): 2Q depth=463, size=6095, time=4.23s
[AI] Circuit 7 (graph-gnp_k-4): 2Q depth=3207, size=25641, time=15.15s
[AI] Circuit 8 (uf100-ham): 2Q depth=637, size=8267, time=5.87s
[AI] Circuit 9 (uf100-ham): 2Q depth=632, size=9330, time=7.29s
[AI] Circuit 10 (TSP_Ncity-7): 2Q depth=452, size=7418, time=6.02s
[AI] Circuit 11 (all-vib-cyclo_propene): 2Q depth=38, size=1323, time=0.27s
[AI] Circuit 12 (TSP_Ncity-8): 2Q depth=609, size=11131, time=10.07s
[AI] Circuit 13 (uf100-ham): 2Q depth=2251, size=31128, time=38.77s
[AI] Circuit 14 (tfim): 2Q depth=165, size=3460, time=1.64s
[AI] Circuit 15 (flat100-ham): 2Q depth=91, size=3497, time=2.49s
[AI] Circuit 16 (graph-regular_reg-4): 2Q depth=664, size=15256, time=12.35s
[AI] Circuit 17 (tfim): 2Q depth=583, size=9157, time=6.28s
[AI] Circuit 18 (FH_D-1): 2Q depth=1193, size=7754, time=4.54s
[AI] Circuit 19 (TSP_Ncity-10): 2Q depth=1134, size=22577, time=25.64s
[AI] Circuit 20 (TSP_Ncity-10): 2Q depth=1172, size=23851, time=28.97s
[AI] Circuit 21 (ham-unary-color02-queen13_13_k-4): 2Q depth=219, size=8600, time=8.85s
[Rustiq] Circuit 0 (all-vib-hc3h2cn): 2Q depth=2, size=257, time=0.16s
[Rustiq] Circuit 1 (ham-graph-gnp_k-5): 2Q depth=640, size=5831, time=0.13s
[Rustiq] Circuit 2 (TSP_Ncity-5): 2Q depth=408, size=3985, time=0.08s
[Rustiq] Circuit 3 (tfim): 2Q depth=31, size=688, time=0.07s
[Rustiq] Circuit 4 (all-vib-h2co): 2Q depth=65, size=1058, time=2.91s
[Rustiq] Circuit 5 (uuf100-ham): 2Q depth=633, size=6757, time=0.14s
[Rustiq] Circuit 6 (uuf100-ham): 2Q depth=795, size=8495, time=0.17s
[Rustiq] Circuit 7 (graph-gnp_k-4): 2Q depth=13768, size=139793, time=2.92s
[Rustiq] Circuit 8 (uf100-ham): 2Q depth=1099, size=11878, time=0.25s
[Rustiq] Circuit 9 (uf100-ham): 2Q depth=911, size=11111, time=0.22s
[Rustiq] Circuit 10 (TSP_Ncity-7): 2Q depth=1183, size=13197, time=0.27s
[Rustiq] Circuit 11 (all-vib-cyclo_propene): 2Q depth=67, size=2491, time=13.56s
[Rustiq] Circuit 12 (TSP_Ncity-8): 2Q depth=1615, size=21358, time=0.48s
[Rustiq] Circuit 13 (uf100-ham): 2Q depth=2920, size=40465, time=0.91s
[Rustiq] Circuit 14 (tfim): 2Q depth=489, size=6552, time=0.15s
[Rustiq] Circuit 15 (flat100-ham): 2Q depth=378, size=5906, time=0.14s
[Rustiq] Circuit 16 (graph-regular_reg-4): 2Q depth=12163, size=168679, time=2.94s
[Rustiq] Circuit 17 (tfim): 2Q depth=1208, size=17042, time=0.36s
[Rustiq] Circuit 18 (FH_D-1): 2Q depth=1061, size=24000, time=0.47s
[Rustiq] Circuit 19 (TSP_Ncity-10): 2Q depth=2565, size=41340, time=1.38s
[Rustiq] Circuit 20 (TSP_Ncity-10): 2Q depth=2565, size=41275, time=1.38s
[Rustiq] Circuit 21 (ham-unary-color02-queen13_13_k-4): 2Q depth=808, size=17548, time=0.42s

print_summary_table(results_large)

Mean +/- std per compilation method
Method                2Q Depth              Gate Count             Runtime (s)
------------------------------------------------------------------------------
SABRE          709.1 +/- 783.8     9,100.5 +/- 8,493.1             0.2 +/- 0.1
AI             656.6 +/- 777.5     9,435.6 +/- 8,853.0            8.5 +/- 10.2
Rustiq     2,062.5 +/- 3,631.1   26,804.8 +/- 43,403.1             1.3 +/- 2.9

Mean % improvement vs SABRE (positive = better than SABRE)
Method                2Q Depth              Gate Count             Runtime (s)
------------------------------------------------------------------------------
AI             +9.6% +/- 22.8%          -3.4% +/- 9.4%    -3620.0% +/- 2405.5%
Rustiq      -154.5% +/- 273.9%      -137.1% +/- 233.2%     -527.0% +/- 1405.5%

print_per_circuit_comparison(results_large, num_rows=8)

2Q Depth (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-hc3h2cn   24       2*       2*       2*
ham-graph-gnp_k-  24      345     323*      640
TSP_Ncity-5       25      187     161*      408
tfim              26      100      20*       31
all-vib-h2co      32      30*       38       65
uuf100-ham        40      414     391*      633
uuf100-ham        40      523     463*      795
graph-gnp_k-4     40    3028*     3207    13768

Gate Count (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-hc3h2cn   24      258      258     257*
ham-graph-gnp_k-  24    4036*     4418     5831
TSP_Ncity-5       25    2045*     2229     3985
tfim              26      489     402*      688
all-vib-h2co      32     570*      661     1058
uuf100-ham        40    4779*     5130     6757
uuf100-ham        40    5667*     6095     8495
graph-gnp_k-4     40   24885*    25641   139793

Runtime (s) (first 8 circuits by qubit count); * = best
Idx  Circuit            Q    SABRE       AI   Rustiq
----------------------------------------------------
all-vib-hc3h2cn   24     0.16     0.17    0.16*
ham-graph-gnp_k-  24    0.08*     3.13     0.13
TSP_Ncity-5       25    0.04*     1.47     0.08
tfim              26     0.21     0.34    0.07*
all-vib-h2co      32    0.18*     0.19     2.91
uuf100-ham        40    0.09*     3.27     0.14
uuf100-ham        40    0.11*     4.23     0.17
graph-gnp_k-4     40    0.39*    15.15     2.92

plot_transpilation_comparison(
    results_large,
    "Large-Scale Hamiltonian Circuits: Compilation Comparison",
)

Output of the previous code cell

plot_pct_improvement_vs_sabre(
    results_large,
    "Large-Scale Hamiltonian Circuits",
)

Output of the previous code cell

plot_best_method_bars(results_large)

Output of the previous code cell

# Select circuit index 3 from the large-scale transpiled circuits
test_idx_large = 3
test_circuit_large = qc_large[test_idx_large]
print(
    f"Test circuit: {test_circuit_large.name}, {test_circuit_large.num_qubits} qubits"
)

tqc_methods_large = {
    "SABRE": tqc_sabre_large[test_idx_large],
    "AI": tqc_ai_large[test_idx_large],
    "Rustiq": tqc_rustiq_large[test_idx_large],
}

print(f"\nTranspilation metrics for circuit index {test_idx_large}:")
for method, tqc in tqc_methods_large.items():
    depth_2q = tqc.depth(lambda x: x.operation.num_qubits == 2)
    size = tqc.size()
    print(f"  {method:8s}  2Q depth={depth_2q:5d}  size={size:6d}")

Test circuit: tfim, 26 qubits

Transpilation metrics for circuit index 3:
  SABRE     2Q depth=  100  size=   489
  AI        2Q depth=   20  size=   402
  Rustiq    2Q depth=   31  size=   688

pm_mirror = generate_preset_pass_manager(
    optimization_level=0, backend=backend
)

for method, tqc in tqc_methods_large.items():
    # print the count ops for each circuit
    mirror = tqc.copy()
    mirror.compose(tqc.inverse(), inplace=True)
    mirror.measure_all()
    mirror = pm_mirror.run(mirror)
    print(f"\n{method} transpiled circuit:")
    print(tqc.count_ops())
    print(f"{method} mirror circuit count ops:")
    print(mirror.count_ops())

SABRE transpiled circuit:
OrderedDict({'sx': 211, 'rz': 163, 'cz': 104, 'x': 11})
SABRE mirror circuit count ops:
OrderedDict({'rz': 1170, 'sx': 422, 'cz': 208, 'measure': 156, 'x': 22, 'barrier': 1})

AI transpiled circuit:
OrderedDict({'sx': 165, 'rz': 162, 'cz': 68, 'x': 7})
AI mirror circuit count ops:
OrderedDict({'rz': 984, 'sx': 330, 'measure': 156, 'cz': 136, 'x': 14, 'barrier': 1})

Rustiq transpiled circuit:
OrderedDict({'sx': 316, 'rz': 225, 'cz': 140, 'x': 7})
Rustiq mirror circuit count ops:
OrderedDict({'rz': 1714, 'sx': 632, 'cz': 280, 'measure': 156, 'x': 14, 'barrier': 1})

# Build mirror circuits and submit to real hardware
# The inverse may introduce gates (e.g., sxdg) not in the backend's
# basis gate set, so we re-transpile the mirror circuit.
pm_mirror = generate_preset_pass_manager(
    optimization_level=0, backend=backend
)

shots_hw = 10000
hw_jobs = {}

for method, tqc in tqc_methods_large.items():
    mirror = tqc.copy()
    mirror.compose(tqc.inverse(), inplace=True)
    mirror.measure_all()

    # Re-transpile at opt level 0 to decompose into basis gates
    # without changing the layout or routing
    mirror = pm_mirror.run(mirror)

    sampler = SamplerV2(mode=backend)
    sampler.options.environment.job_tags = ["TUT_CMHSC"]
    job = sampler.run([mirror], shots=shots_hw)
    hw_jobs[method] = job
    print(f"{method}: submitted job {job.job_id()}")

SABRE: submitted job d8gvgq66983c73dqe5og
AI: submitted job d8gvgqe6983c73dqe5pg
Rustiq: submitted job d8gvgqm6983c73dqe5q0

# Retrieve results and compute fidelities
fidelities_large = {}

for method, job in hw_jobs.items():
    result = job.result()
    counts = result[0].data.meas.get_counts()

    n_qubits = backend.num_qubits
    all_zeros = "0" * n_qubits
    fidelity = counts.get(all_zeros, 0) / shots_hw
    fidelities_large[method] = fidelity
    print(
        f"{method:8s}  P(|00...0>) = {fidelity:.4f}  ({counts.get(all_zeros, 0)}/{shots_hw})"
    )

SABRE     P(|00...0>) = 0.0005  (5/10000)
AI        P(|00...0>) = 0.3267  (3267/10000)
Rustiq    P(|00...0>) = 0.1845  (1845/10000)

plot_mirror_results(
    tqc_methods_large, fidelities_large, test_circuit_large.name
)

Output of the previous code cell

컴파일 결과 분석

위의 벤치마크는 Hamlib 컬렉션의 해밀토니안 시뮬레이션 Circuit에서 SABRE, AI 기반 Transpiler, Rustiq를 소규모 및 대규모 모두에서 비교합니다.

2-Qubit 깊이 및 Gate 수

대규모에서 SABRE와 AI 기반 Transpiler가 가장 강력한 두 방법으로, 각각 서로 다른 지표에서 앞섭니다. 지표별 최고 성능 방법 차트에서 볼 수 있듯이, SABRE는 대다수의 Circuit에서 가장 낮은 Gate 수를 생성하고 거의 모든 Circuit에서 가장 빠른 방법입니다. 이는 삽입된 SWAP Gate를 최소화하도록 설계된 휴리스틱과 최근의 레이아웃 및 라우팅 최적화와 일치합니다. AI 기반 Transpiler는 대부분의 Circuit에서 가장 낮은 2-Qubit 깊이를 생성합니다. 이는 Circuit 깊이를 목표로 하는 강화 학습 목적 함수의 일부와 일치합니다. 요약 표도 동일한 분리를 반영합니다: SABRE는 평균 Gate 수가 낮고, AI Transpiler는 평균 2-Qubit 깊이가 낮습니다. 두 방법 모두 전체 Circuit 범위에 걸쳐 일관되고 신뢰할 수 있습니다.

PauliEvolutionGate 합성을 위해 특별히 제작된 Rustiq는 대규모 Circuit의 극히 일부에서만 단일 최고 결과를 생성합니다. 평균 지표는 컴파일 비교 플롯에서 큰 스파이크로 보이는 소수의 상당한 이상치에 의해 크게 왜곡됩니다. Rustiq가 다른 방법보다 훨씬 높은 깊이와 Gate 수를 생성하는 경우입니다. 이러한 이상치가 없다면 평균 성능은 SABRE와 AI 기반 Transpiler에 훨씬 더 가까울 것입니다.

핵심 관찰은 어떤 단일 방법도 모든 Circuit에서 지배적이지 않다는 것입니다. 각 방법은 특정 경우에 다른 방법보다 뛰어나며, 이는 사용 가능한 모든 도구를 시도하고 각 Circuit에 대해 최상의 결과를 선택하는 것이 가치 있음을 보여줍니다.

런타임

SABRE는 지속적으로 가장 빠른 방법입니다. Rustiq는 일반적으로 비슷한 속도로 실행되지만, 컴파일이 훨씬 오래 걸리는 이상치가 발생할 수 있습니다. 이는 대규모 결과에서 특히 두드러지며, 일부 Circuit으로 인해 Rustiq의 런타임이 급증합니다. 이러한 이상치는 평균 런타임에 큰 영향을 미치므로, Rustiq의 경우 중앙값이 더 대표적인 요약일 수 있습니다. AI 기반 Transpiler는 세 가지 중 가장 느리며, 더 크고 복잡한 Circuit에서 런타임이 눈에 띄게 증가합니다.

미러 Circuit 결과

미러 Circuit 실험은 예상되는 추세를 확인합니다: 2-Qubit 깊이가 낮고 Gate 수가 적은 방법이 노이즈 하에서 더 높은 충실도를 달성합니다. 이는 노이즈 있는 시뮬레이터(소규모)와 실제 하드웨어(대규모) 모두에서 성립합니다.

각 미러 Circuit 플롯은 전체 집합이 아닌 단일 Circuit을 반영한다는 점을 유념하십시오. 위의 하드웨어 예제는 26-Qubit tfim Circuit 하나를 사용하는데, 이 경우 SABRE가 AI 기반 Transpiler와 Rustiq보다 훨씬 높은 2-Qubit 깊이를 생성하므로 충실도가 그에 상응하게 훨씬 낮습니다. 이것은 전체 결과를 대표하지 않습니다: 전체 대규모 Circuit 집합에 걸쳐 SABRE의 2-Qubit 깊이는 보통 AI 기반 Transpiler의 것과 비슷하며, 두 방법은 각각 서로 다른 지표에서 앞섭니다(AI 기반 Transpiler는 2-Qubit 깊이, SABRE는 Gate 수와 런타임). 단일 미러 결과는 전체 워크로드가 아닌 하나의 Circuit의 두 배 버전을 테스트하므로, 전체 방법 품질에 대한 판단으로 읽어서는 안 됩니다.

권장 사항

모든 Circuit에 단일한 최고의 트랜스파일 전략은 없습니다. 최선의 선택은 Circuit 구조, 최적화 목표, 사용 가능한 컴파일 시간 예산에 따라 다릅니다:

SABRE는 권장되는 기본 옵션입니다. 빠르고 신뢰할 수 있으며, 다양한 Circuit에 걸쳐 강력한 결과를 생성합니다. 추가 조정을 위해 레이아웃 및 라우팅 시도 횟수를 늘릴 수 있습니다(SABRE 최적화 튜토리얼 참조).
AI 기반 Transpiler는 컴파일 시간이 제약이 아닐 때, 특히 2-Qubit 깊이 최소화가 우선순위일 때 시도해 볼 가치가 있습니다: 이 벤치마크의 대규모 Circuit 대부분에서 가장 낮은 2-Qubit 깊이를 생성했습니다.
Rustiq는 PauliEvolutionGate Circuit을 위해 특별히 제작되었으며, 특히 소규모 Circuit에서 매우 낮은 깊이와 낮은 Gate 수 해법을 찾을 수 있습니다. 대규모 Circuit에서는 훨씬 더 큰 결과를 생성할 수 있으므로, 기본값이 아닌 여러 방법 중 하나로 사용하는 것이 가장 좋습니다.

실용적으로는 사용 가능한 모든 방법을 실행하고 각 Circuit에 대해 최상의 결과를 선택하는 것이 최선입니다. 여러 방법을 시도하는 컴파일 오버헤드는 실제 하드웨어에서의 잠재적인 실행 품질 향상에 비해 작습니다.

다음 단계

이 튜토리얼이 유용했다면, 다음 내용에도 관심을 가질 수 있습니다:

권장 사항

참고 문헌

[1] "LightSABRE: A Lightweight and Enhanced SABRE Algorithm". H. Zou, M. Treinish, K. Hartman, A. Ivrii, J. Lishman et al. https://arxiv.org/abs/2409.08368

[2] "Practical and efficient quantum circuit synthesis and transpiling with Reinforcement Learning". D. Kremer, V. Villar, H. Paik, I. Duran, I. Faro, J. Cruz-Benito et al. https://arxiv.org/abs/2405.13196

[3] "Pauli Network Circuit Synthesis with Reinforcement Learning". A. Dubal, D. Kremer, S. Martiel, V. Villar, D. Wang, J. Cruz-Benito et al. https://arxiv.org/abs/2503.14448

[4] "Faster and shorter synthesis of Hamiltonian simulation circuits". T. Goubault de Brugiere, S. Martiel et al. https://arxiv.org/abs/2404.03280

학습 목표​

사전 요구사항​

배경​

컴파일 방법 개요​

SABRE를 사용한 Qiskit Transpiler​

AI 기반 Transpiler​

Rustiq 플러그인​

주요 지표​

요구 사항​

설정​

Backend에 연결하기​

Pass Manager 정의​

헬퍼 함수 정의​

Hamlib에서 해밀토니안 Circuit 불러오기​

소규모 예제​

1단계: 고전적 입력을 양자 문제로 매핑하기​

2단계: 양자 하드웨어 실행을 위한 문제 최적화​

결과 시각화​

지표별 최고 성능 방법​

3단계: Qiskit 프리미티브를 사용한 실행​

관찰 결과​

대규모 하드웨어 예제​

1~4단계 통합​

컴파일 결과 분석​

2-Qubit 깊이 및 Gate 수​

런타임​

미러 Circuit 결과​

권장 사항​

다음 단계​

참고 문헌​

학습 목표

사전 요구사항

배경

컴파일 방법 개요

SABRE를 사용한 Qiskit Transpiler

AI 기반 Transpiler

Rustiq 플러그인

주요 지표

요구 사항

설정

Backend에 연결하기

Pass Manager 정의

헬퍼 함수 정의

Hamlib에서 해밀토니안 Circuit 불러오기

소규모 예제

1단계: 고전적 입력을 양자 문제로 매핑하기

2단계: 양자 하드웨어 실행을 위한 문제 최적화

결과 시각화

지표별 최고 성능 방법

3단계: Qiskit 프리미티브를 사용한 실행

관찰 결과

대규모 하드웨어 예제

1~4단계 통합

컴파일 결과 분석

2-Qubit 깊이 및 Gate 수

런타임

미러 Circuit 결과

권장 사항

다음 단계

참고 문헌