Saga Pattern

Overview

The Saga pattern manages distributed transactions across multiple services by coordinating a sequence of local transactions, each with a compensating action that can undo its effects if subsequent steps fail.

Problem

In distributed systems, you need to maintain data consistency across multiple services or databases without using traditional ACID transactions. When a multi-step business process fails partway through, you must undo the effects of completed steps to maintain system consistency. Traditional two-phase commit does not scale well and creates tight coupling between services.

Solution

You implement each step as a local transaction with a corresponding compensation transaction. If any step fails, you execute compensation transactions in reverse order to undo the effects of all completed steps. You register compensations as each step completes, then automatically trigger them when errors occur to ensure cleanup happens reliably.

The following describes each step in the diagram:

The Saga begins by executing Step 1.
If Step 1 succeeds, the Workflow proceeds to Step 2. If it fails, the Saga ends immediately.
If Step 2 succeeds, the Workflow proceeds to Step 3. If it fails, the Workflow runs the compensation for Step 1.
If Step 3 succeeds, the Saga completes. If it fails, the Workflow runs compensations in reverse order: Step 2 first, then Step 1.

Implementation

The following examples show how each SDK implements the Saga pattern. Each language uses a different mechanism to register and execute compensations, but the core principle is the same: register a compensation before or after each step, and run all compensations in reverse order on failure.

PythonGoJavaTypeScript

python

# workflows.py
from temporalio import workflow

@workflow.defn
class TransferMoneyWorkflow:
    @workflow.run
    async def run(self, details: TransferDetails) -> None:
        compensations = []

        try:
            # Register compensation for Step 1 BEFORE execution
            compensations.append(
                lambda: workflow.execute_activity(
                    withdraw_compensation,
                    details,
                    start_to_close_timeout=timedelta(seconds=10),
                )
            )
            # Step 1: Withdraw from source account
            await workflow.execute_activity(
                withdraw,
                details,
                start_to_close_timeout=timedelta(seconds=10),
            )

            # Register compensation for Step 2 BEFORE execution
            compensations.append(
                lambda: workflow.execute_activity(
                    deposit_compensation,
                    details,
                    start_to_close_timeout=timedelta(seconds=10),
                )
            )
            # Step 2: Deposit to target account
            await workflow.execute_activity(
                deposit,
                details,
                start_to_close_timeout=timedelta(seconds=10),
            )

            # Step 3: Additional operation
            await workflow.execute_activity(
                step_with_error,
                details,
                start_to_close_timeout=timedelta(seconds=10),
            )
        except Exception as e:
            # On error, run compensations in reverse order
            for compensation in reversed(compensations):
                await compensation()
            raise

// saga_workflow.go
func TransferMoney(ctx workflow.Context, details TransferDetails) error {
    // Register compensation for Step 1 BEFORE execution
    defer func() {
        if err != nil {
            _ = workflow.ExecuteActivity(ctx, WithdrawCompensation, details).Get(ctx, nil)
        }
    }()

    // Step 1: Withdraw from source account
    err := workflow.ExecuteActivity(ctx, Withdraw, details).Get(ctx, nil)
    if err != nil {
        return err
    }

    // Register compensation for Step 2 BEFORE execution
    defer func() {
        if err != nil {
            _ = workflow.ExecuteActivity(ctx, DepositCompensation, details).Get(ctx, nil)
        }
    }()

    // Step 2: Deposit to target account
    err = workflow.ExecuteActivity(ctx, Deposit, details).Get(ctx, nil)
    if err != nil {
        return err // Triggers defer, which runs both compensations
    }

    // Step 3: Additional operation
    err = workflow.ExecuteActivity(ctx, StepWithError, details).Get(ctx, nil)
    return err // If error, both compensations run in reverse order
}

java

// HelloSaga.java
public class HelloSaga {
    @WorkflowInterface
    public interface GreetingWorkflow {
        @WorkflowMethod
        String getGreeting(String name);
    }

    public static class GreetingWorkflowImpl implements GreetingWorkflow {
        @Override
        public String getGreeting(String name) {
            // Create a Saga instance with compensation options
            Saga saga = new Saga(new Saga.Options.Builder()
                .setParallelCompensation(false) // Run compensations sequentially
                .build());

            try {
                // Register compensation for Step 1 BEFORE execution
                saga.addCompensation(activities::cleanupHello, name);
                // Step 1: Execute activity
                String hello = Workflow.executeActivity(
                    activities::hello,
                    String.class,
                    name
                ).get();

                // Register compensation for Step 2 BEFORE execution
                saga.addCompensation(activities::cleanupBye, name);
                // Step 2: Execute activity
                String bye = Workflow.executeActivity(
                    activities::bye,
                    String.class,
                    name
                ).get();

                // Register compensation for Step 3 BEFORE execution
                saga.addCompensation(activities::cleanupFile, name);
                // Step 3: This might fail
                Workflow.executeActivity(
                    activities::processFile,
                    Void.class,
                    name
                ).get();

                return hello + "; " + bye;

            } catch (Exception e) {
                // On any error, run all registered compensations in reverse order
                saga.compensate();
                throw e;
            }
        }
    }
}

typescript

// workflows.ts
export async function openAccount(params: OpenAccount): Promise<void> {
  const compensations: Compensation[] = [];

  try {
    // Step 1: Create account
    await createAccount({ accountId: params.accountId });

    // Register compensation for Step 2 BEFORE execution
    compensations.unshift({
      fn: () => clearPostalAddresses({ accountId: params.accountId }),
    });
    // Step 2: Add address
    await addAddress({
      accountId: params.accountId,
      address: params.address,
    });

    // Register compensation for Step 3 BEFORE execution
    compensations.unshift({
      fn: () => removeClient({ accountId: params.accountId }),
    });
    // Step 3: Add client
    await addClient({
      accountId: params.accountId,
      clientEmail: params.clientEmail,
    });

    // Register compensation for Step 4 BEFORE execution
    compensations.unshift({
      fn: () => disconnectBankAccounts({ accountId: params.accountId }),
    });
    // Step 4: Add bank account
    await addBankAccount({
      accountId: params.accountId,
      details: params.bankDetails,
    });
  } catch (err) {
    // On error, run all compensations in reverse order
    for (const comp of compensations) {
      await comp.fn();
    }
    throw err;
  }
}

The key differences between SDKs are:

Go: Uses defer statements that execute in LIFO order when the function returns.
Python: Uses a list with reversed() to iterate compensations in LIFO order on error.
TypeScript: Uses an array with unshift() to maintain LIFO order, and manually iterates on error.
Java: Uses an explicit Saga object to track and trigger compensations.

In all SDKs, compensations are registered before Activity execution and run in reverse order of registration. All compensations must be idempotent and able to handle cases where the forward Activity never executed.

When to register compensations

There are two approaches for when to register compensation Activities:

Register before Activity execution (recommended for safety): This ensures the compensation runs even if the Activity fails after partial completion. For example, a credit card may be charged but the Activity fails before returning success. The compensation must be idempotent and handle cases where the forward Activity never executed (no-op). This is the safer default when Activities have side effects that may occur before failure.
Register after Activity execution (appropriate when safe): This only compensates Activities that completed successfully. The compensation logic is simpler because you do not need to check whether the forward action occurred. This approach is appropriate when Activities are truly atomic (all-or-nothing). The risk is partial completion without compensation if the Activity fails mid-execution.

The choice depends on your Activity's failure characteristics and whether the compensation can safely handle cases where the forward Activity never executed. When in doubt, register compensations before execution and ensure they are idempotent.

When to use

The Saga pattern is a good fit when you need to maintain consistency across multiple services or databases, traditional distributed transactions (two-phase commit) are too slow or unavailable, you can define compensating actions for each step in your business process, eventual consistency is acceptable for your use case, and you need to handle long-running transactions that may span hours or days.

It is not a good fit for operations that require strong ACID consistency, single-service transactions that can use a local database transaction, processes where compensations cannot be defined, or operations that must appear atomic to external observers.

Benefits and trade-offs

The Saga pattern maintains eventual consistency without distributed locks, and each service can use its own database and transaction model. Temporal's durable execution guarantees that compensations will execute even after Worker failures. The pattern scales better than two-phase commit protocols.

The trade-offs to consider are that only eventual consistency is provided — intermediate states are visible to other processes. You must design idempotent compensation Activities, and compensation logic must be maintained alongside forward logic. Some operations may not have meaningful compensations.

Comparison with alternatives

Approach	Consistency	Rollback mechanism	Coupling	Scalability
Saga (orchestration)	Eventual	Compensating transactions	Loose	High
Two-phase commit	Strong (ACID)	Distributed lock/rollback	Tight	Low
Saga (choreography)	Eventual	Event-driven compensations	Very loose	High
Local transaction	Strong (ACID)	Database rollback	None	Single service

Best practices

Make all compensations idempotent. Compensations may run even when the forward Activity never executed (if registered before execution) or may run multiple times on retry. Use idempotency keys to ensure safe re-execution.
Register compensations before Activity execution. This ensures cleanup runs even if the Activity fails after partial completion. The compensation must handle the case where the forward action never occurred (no-op).
Use idempotency keys for forward Activities. Pass a unique identifier (such as a client ID or Workflow ID) to each Activity so retries do not create duplicate side effects.
Set StartToCloseTimeout on compensation Activities. Set a StartToCloseTimeout but avoid ScheduleToCloseTimeout on compensations. Do not set Workflow-level timeouts — let compensations retry until they succeed.
Use a disconnected context for cancellation compensation. In Go, use NewDisconnectedContext to run compensation Activities after Workflow cancellation, since the original context is already cancelled.
Keep compensation payloads small. Pass references (IDs, URLs) instead of full data objects to avoid exceeding the 2 MB payload limit.
Log compensation failures but continue. If a compensation fails, log the error and continue executing remaining compensations. In production, alert for manual intervention on persistent compensation failures.
Re-throw the original error after compensating. Always re-throw the original exception after running compensations so the Workflow reports the correct failure reason.

Common pitfalls

Non-idempotent compensations. Compensations may run even when the forward Activity never executed (if registered before execution) or may run multiple times on retry. All compensations must be idempotent.
Forgetting to register a compensation. If a step succeeds but its compensation was never registered, a later failure leaves that step's effects permanently in place.
Compensations that can fail permanently. If a compensation Activity fails with a non-retryable error, the Saga cannot fully roll back. Design compensations with generous retry policies.
Large payloads in compensation state. Passing large objects through the compensation chain can exceed the 2 MB payload limit. Use references (IDs, URLs) instead of full data.
Swallowing the ContinueAsNew exception in TypeScript. In TypeScript, continueAsNew works by throwing a special exception. A catch block that does not re-throw it, or a finally block that returns a value, silently prevents Continue-As-New.

Retry Policies: Often combined with the Saga pattern to handle transient failures before compensating.
Child Workflows: You can use Child Workflows to organize complex Sagas with multiple sub-Sagas.
Long-Running Activity: Heartbeats work well with long-running compensation Activities.
Early Return: You can combine Early Return with the Saga pattern to return initialization results before compensation runs.

Sample code

Go Sample — Saga with defer-based compensations.
Java Sample — Saga with the Saga API.
TypeScript Sample — Saga with array-based compensations.
Python Sample — Saga with list-based compensations.

Saga Pattern

Overview ​

Problem ​

Solution ​

Implementation ​

When to register compensations ​

When to use ​

Benefits and trade-offs ​

Comparison with alternatives ​

Best practices ​

Common pitfalls ​

Related patterns ​

Sample code ​