Contributor onboarding

This page summarizes recurring lessons from DynamicPPL and AbstractPPL history for contributors who are new to Julia, Turing.jl, or DynamicPPL internals. It is a starting point, not a checklist. For day-to-day Julia style, see JULIA.md; for coding-agent instructions, see AGENTS.md.

The source pass covered GitHub history available on 2026-05-06. For DynamicPPL, that included 422 issues, 957 pull requests, 6,958 issue/PR comments, 3,726 PR reviews, and 5,176 inline review comments. For AbstractPPL, that included 46 issues, 101 pull requests, 654 issue/PR comments, 332 PR reviews, and 441 inline review comments. Linked issues and PRs are representative starting points, not current API documentation.

What DynamicPPL Does

DynamicPPL is the modelling and evaluation layer under Turing.jl. It provides @model, tilde (~) statement handling, conditioning, fixing, parameter transforms, accumulators, and log-density interfaces for samplers and automatic differentiation. It uses AbstractPPL for shared interfaces such as VarName, contexts, and evaluator protocols.

A useful mental model:

@model lowers user code into a model function.
Each ordinary ~ statement becomes an assume or observe statement.
Contexts and initialisation strategies decide where values come from.
Accumulators decide which quantities are collected.
LogDensityFunction maps named model parameters to flat vectors.

Start with these docs:

Core Lessons

Prefer explicit evaluation state

For new evaluation code, prefer explicit initialisation strategies and accumulators over adding more responsibilities to VarInfo. VarInfo remains important, but fast paths should carry only the state they need.

A common migration shape is:

evaluate!!(model, varinfo)

to:

init!!(
    model,
    OnlyAccsVarInfo(accumulators...),
    InitFromParams(varinfo.values),
    varinfo.transform_strategy,
)

The exact strategy and accumulator set depend on the caller.

Use names and shapes carefully

Use @varname(x) and @varname(x[1]); avoid manual construction of indexed VarNames. Use subsumption for containment checks: @varname(x) can cover @varname(x[1]), but they are not equal.

VarName display, sorting, prefixing, unprefixing, and serialization are downstream-facing interface behaviour. Test nested fields, indices, ranges, Colon, and non-standard indices when changing them. Avoid broad Base overloads such as generic get(obj, vn) unless the method is clearly owned.

VarNamedTuple is the preferred internal container for named parameter values where supported. Convert user-facing NamedTuple or Dict{VarName} inputs at API boundaries. Preserve templates, shapes, and index structure so values can round-trip between named form and flat vectors. Avoid large mostly-empty shadow arrays and keep eltypes concrete in hot paths.

Keep `!!` return values

DynamicPPL uses BangBang-style !! functions. They may mutate in place or return a replacement object. Always use the returned value.

vi = accumulate_assume!!(vi, value, tval, logjac, vn, dist, template)

If your function calls a !! function, it usually needs to return the updated state as well.

Treat `@model` as Julia code

@model lowering must preserve ordinary Julia behaviour as well as PPL semantics. For compiler changes, test positional and keyword arguments, defaults, splatting, closures, interpolation, return values, no-observation models, and data- or parameter-dependent control flow.

Macro hygiene matters. User variables, generated temporaries, and globals should not capture each other accidentally. Returned quantities are user-visible and are distinct from accumulated random variables.

DynamicPPL tracks variables through tilde statements. A left-hand-side value can be treated as a model variable even when it was derived earlier in the model.

@model function f()
    x ~ Normal()
    y = x + 1
    return y ~ Normal()
end

If the intent is to add a likelihood term for a derived value, prefer @addlogprob! or a clearer model structure. Do not copy old .~ examples; the dot-tilde pipeline was removed.

Passing missing can affect whether a value is observed or latent. Add tests for the exact data shape you support, especially arrays with missing values, arrays of arrays, and mutable structs.

Test contexts with nested models

Contexts change model evaluation without rewriting the model body. condition, fix, decondition, unfix, to_submodel, and prefixes all interact.

Prefer condition, fix, and to_submodel over hardcoded special cases. Use the same VarName semantics as the tilde pipeline. Add nested-submodel tests when changing contexts, prefixes, conditioning, or fixing.

Know which space values live in

DynamicPPL moves between constrained model space and unconstrained sampler space. Be explicit about which space each value lives in.

val: constrained model-space value used for distribution densities.
tval: TransformedValue, which may contain a linked value.
logjac: log absolute Jacobian contribution from the link transform.
getlogjoint: constrained-space log joint.
getlogjoint_internal: internal log density for sampler-facing paths.
vi[:]: internal stored vector; do not assume it is in distribution support.

LogDensityFunction is the usual boundary for HMC/NUTS, optimisers, and AD. When changing log-density or transform code, test the relevant AD backends. Avoid compiled ReverseDiff tapes for models whose control flow depends on parameter values.

Evaluator APIs should separate structural preparation from AD-specific preparation. !! evaluator and gradient APIs may reuse internal buffers, so copy results before storing them long term.

Working in Julia

DynamicPPL code often sits on hot paths for inference and AD. Small edits can change inference, allocations, invalidation, or downstream package behaviour, so performance-sensitive changes need measurement rather than intuition.

The general rules live in JULIA.md. The ones most likely to matter here are generic numeric code, concrete storage types, deterministic doctests, extension-based backend integrations, and type-stability checks for compiler output, VarNamedTuples, accumulators, transforms, and log-density paths.

Copying, Accumulators, and Threading

Be explicit about aliasing. Copy stored values when later mutation by model code would otherwise change accumulated results. Use the cheapest correct copy: copy or collect is often enough, while deepcopy can be much slower.

Accumulators collect outputs from model execution, such as log probabilities, raw values, vector values, pointwise log densities, and returned values. Add only the accumulators you need. copy(acc) must not accidentally share mutable internal state.

Avoid designs that depend on Threads.threadid() indexing. Promote accumulator storage when thread-safe evaluation must hold AD tracer types. Treat threaded assume support as subtle unless current docs and tests cover the exact case.

Getting a PR Ready

For a first contribution, scope the change by deciding whether it is user-facing, internal, or downstream-facing through Turing.jl. Add the smallest tests that exercise the behaviour, then widen coverage only where the change touches shared machinery: nested submodels for contexts and prefixes, AD backends for log-density or transform paths, round trips for flattening and unflattening, and type-stability or allocation checks for hot paths.

Run JuliaFormatter before submitting and treat docs, Aqua, JET, formatting, and extension-loading failures as part of the change. Put dependencies in the narrowest environment that owns them: runtime, extension, test, or docs.