SafeMPI API Reference

The SafeMPI module provides distributed reference management for MPI-based parallel computing.

Core Types

SafePETSc.SafeMPI.DRef — Type

DRef{T}

A distributed reference to an object of type T that is managed across MPI ranks.

When all ranks have released their references via garbage collection, the object is collectively destroyed on all ranks using the type's destroy_obj! method.

Constructor

DRef(obj::T; manager=default_manager[]) -> DRef{T}

Create a distributed reference to obj. The type T must opt-in to distributed management by defining destroy_trait(::Type{T}) = CanDestroy() and implementing destroy_obj!(obj::T).

Finalizers automatically enqueue releases when the DRef is garbage collected. Call check_and_destroy!() to perform the actual collective destruction.

Example

# Define a type that can be managed
struct MyDistributedObject
    data::Vector{Float64}
end

SafeMPI.destroy_trait(::Type{MyDistributedObject}) = SafeMPI.CanDestroy()
SafeMPI.destroy_obj!(obj::MyDistributedObject) = println("Destroying object")

# Create a distributed reference
ref = DRef(MyDistributedObject([1.0, 2.0, 3.0]))
# ref.obj accesses the underlying object
# When ref is garbage collected and check_and_destroy!() is called, the object is destroyed

SafePETSc.SafeMPI.DistributedRefManager — Type

DistributedRefManager

Manages reference counting and collective destruction of distributed objects across MPI ranks.

Every rank keeps an identical counter_pool/free_ids state and runs the same ID allocation algorithm simultaneously, so there is no special root role. Finalizers simply enqueue release IDs locally. At safe points (check_and_destroy!), ranks Allgather pending releases, update mirrored counters deterministically, and destroy ready objects together, pushing the released IDs back into free_ids on every rank for reuse.

Reference Management

SafePETSc.SafeMPI.check_and_destroy! — Function

check_and_destroy!(manager=default_manager[]; max_check_count::Integer=1)

MPI Collective

Perform garbage collection and process pending object releases, destroying objects when all ranks have released their references.

This function must be called explicitly to allow controlled cleanup points in the application. It performs a full garbage collection to trigger finalizers, then processes all pending release messages and collectively destroys objects that are ready.

The max_check_count parameter controls throttling: the function only performs cleanup every max_check_count calls. This reduces overhead in tight loops.

Example

SafeMPI.check_and_destroy!()  # Process releases immediately
SafeMPI.check_and_destroy!(max_check_count=10)  # Only cleanup every 10th call

SafePETSc.SafeMPI.destroy_obj! — Function

destroy_obj!(obj)

Trait method called to collectively destroy an object when all ranks have released their references. Types that opt-in to distributed reference management must implement this method.

Example

SafeMPI.destroy_obj!(obj::MyType) = begin
    # Perform collective cleanup (e.g., free MPI/PETSc resources)
    cleanup_resources(obj)
end

Trait System

SafePETSc.SafeMPI.DestroySupport — Type

DestroySupport

Abstract type for the trait system controlling which types can be managed by DRef. See CanDestroy and CannotDestroy.

SafePETSc.SafeMPI.CanDestroy — Type

CanDestroy <: DestroySupport

Trait indicating that a type can be managed by DRef and supports collective destruction. Types must opt-in by defining destroy_trait(::Type{YourType}) = CanDestroy().

SafePETSc.SafeMPI.CannotDestroy — Type

CannotDestroy <: DestroySupport

Trait indicating that a type cannot be managed by DRef (default for all types).

SafePETSc.SafeMPI.destroy_trait — Function

destroy_trait(::Type) -> DestroySupport

Trait function determining whether a type can be managed by DRef.

Returns CanDestroy() for types that opt-in to distributed reference management, or CannotDestroy() for types that don't support it (default).

Example

# Opt-in a custom type
SafeMPI.destroy_trait(::Type{MyType}) = SafeMPI.CanDestroy()

MPI Utilities

SafePETSc.SafeMPI.mpi_any — Function

mpi_any(local_bool::Bool, comm=MPI.COMM_WORLD) -> Bool

MPI Collective

Collective logical OR reduction across all ranks in comm.

Returns true on all ranks if any rank has local_bool == true, otherwise returns false on all ranks. This is useful for checking whether any rank encountered an error or special condition.

Example

local_error = (x < 0)  # Some local condition
if SafeMPI.mpi_any(local_error)
    # At least one rank has an error, all ranks enter this branch
    error("Error detected on at least one rank")
end

Configuration

SafePETSc.SafeMPI.enable_assert — Constant

enable_assert

Global flag controlling whether @mpiassert macros perform their checks. Set to false to disable all MPI assertions for performance. Default is true.