Tracking Extra Quantities

Often, there are quantities in models that we might be interested in viewing the values of, but which are not random variables in the model that are explicitly drawn from a distribution.

As a motivating example, the most natural parameterization for a model might not be the most computationally feasible. Consider the following (efficiently reparametrized) implementation of Neal’s funnel (Neal, 2003):

using Turing
setprogress!(false)

@model function Neal()
    # Raw draws
    y_raw ~ Normal(0, 1)
    x_raw ~ arraydist([Normal(0, 1) for i in 1:9])

    # Transform:
    y = 3 * y_raw
    x = exp.(y ./ 2) .* x_raw
    return nothing
end

[ Info: [Turing]: progress logging is disabled globally

Neal (generic function with 2 methods)

In this case, the random variables exposed in the chain (x_raw, y_raw) are not in a helpful form — what we’re after are the deterministically transformed variables x and y.

There are two ways to track these extra quantities in Turing.jl.

Using `:=` (during inference)

The first way is to use the := operator, which behaves exactly like = except that the values of the variables on its left-hand side are automatically added to the chain returned by the sampler. For example:

@model function Neal_coloneq()
    # Raw draws
    y_raw ~ Normal(0, 1)
    x_raw ~ arraydist([Normal(0, 1) for i in 1:9])

    # Transform:
    y := 3 * y_raw
    x := exp.(y ./ 2) .* x_raw
end

sample(Neal_coloneq(), NUTS(), 1000)

┌ Info: Found initial step size
└   ϵ = 1.6

Chains MCMC chain (1000×34×1 Array{Float64, 3}):

Iterations        = 501:1:1500
Number of chains  = 1
Samples per chain = 1000
Wall duration     = 7.66 seconds
Compute duration  = 7.66 seconds
parameters        = y_raw, x_raw[1], x_raw[2], x_raw[3], x_raw[4], x_raw[5], x_raw[6], x_raw[7], x_raw[8], x_raw[9], y, x[1], x[2], x[3], x[4], x[5], x[6], x[7], x[8], x[9]
internals         = n_steps, is_accept, acceptance_rate, log_density, hamiltonian_energy, hamiltonian_energy_error, max_hamiltonian_energy_error, tree_depth, numerical_error, step_size, nom_step_size, lp, logprior, loglikelihood

Use `describe(chains)` for summary statistics and quantiles.

Using `returned` (post-inference)

Alternatively, one can specify the extra quantities as part of the model function’s return statement:

@model function Neal_return()
    # Raw draws
    y_raw ~ Normal(0, 1)
    x_raw ~ arraydist([Normal(0, 1) for i in 1:9])

    # Transform and return as a NamedTuple
    y = 3 * y_raw
    x = exp.(y ./ 2) .* x_raw
    return (x=x, y=y)
end

chain = sample(Neal_return(), NUTS(), 1000)

┌ Info: Found initial step size
└   ϵ = 1.6

Chains MCMC chain (1000×24×1 Array{Float64, 3}):

Iterations        = 501:1:1500
Number of chains  = 1
Samples per chain = 1000
Wall duration     = 1.74 seconds
Compute duration  = 1.74 seconds
parameters        = y_raw, x_raw[1], x_raw[2], x_raw[3], x_raw[4], x_raw[5], x_raw[6], x_raw[7], x_raw[8], x_raw[9]
internals         = n_steps, is_accept, acceptance_rate, log_density, hamiltonian_energy, hamiltonian_energy_error, max_hamiltonian_energy_error, tree_depth, numerical_error, step_size, nom_step_size, lp, logprior, loglikelihood

Use `describe(chains)` for summary statistics and quantiles.

The sampled chain does not contain x and y, but we can extract the values using the returned function. Calling this function outputs an array:

nts = returned(Neal_return(), chain)

1000×1 Matrix{@NamedTuple{x::Vector{Float64}, y::Float64}}:
 (x = [-0.8641236278171113, 22.99909099488052, -11.268261534594945, 0.3078824542415632, -7.287651434775003, -17.5478435095055, 4.3864784935353125, -4.948890094937868, -3.4078565849057747], y = 5.108668965341639)
 (x = [-1.2648070183404627, 0.21483483342434223, 0.5718516045328935, 0.2218920092177965, -0.23450244842471538, -0.46656130092688997, 0.6831653515781843, -0.47285171450199287, -1.2083722243511426], y = -0.8202600741100154)
 (x = [3.109255892956394, -1.3131444532693186, -1.203603381193927, -0.2006781379103033, 0.2770351743542932, 0.35346870178307904, -2.224373590814932, 0.8532423542685047, 3.3991168848015936], y = 1.1814290376473893)
 (x = [-1.474832516944132, 0.7798746531016534, 0.15354354182127203, 0.3139179167856834, 0.603099275992411, -0.115466699960286, -0.313557026189624, 0.26012097682656316, -1.0533683581129452], y = -1.2481794088758411)
 (x = [13.651195310506749, -7.797783358768689, -2.9833328270019037, -0.8030142795836266, -7.436849428824008, -7.3475521151706085, 3.575147720253345, 0.9498623164500145, 1.033357724385729], y = 3.8519877717343207)
 (x = [13.651195310506749, -7.797783358768689, -2.9833328270019037, -0.8030142795836266, -7.436849428824008, -7.3475521151706085, 3.575147720253345, 0.9498623164500145, 1.033357724385729], y = 3.8519877717343207)
 (x = [-0.10539471841332175, 0.03848721483518811, 0.007408111569414828, 0.008345662439028198, 0.06370095942275003, 0.013608048240503611, -0.06997697571228112, -0.0321674354500134, -0.03254590032344237], y = -5.841649880043171)
 (x = [0.7795363761826529, 1.0889300622127314, 0.10997978002365522, 0.009218898337814134, -0.46719627288092275, 0.5013567115310841, 0.649259326426951, -0.03349634266346819, -0.48549007253747545], y = -0.8080982252150646)
 (x = [7.46149304863186, 7.988896098975018, 0.39938866141353646, -0.15078916803943948, 4.991770228154776, -4.76741909590924, 5.973445877978943, 8.413536136947588, -1.9676974087871608], y = 3.9313526308123676)
 (x = [0.2609854793445009, 1.1251050776764857, -0.1647228450519901, -0.7161454781156448, -0.9511282931458953, -1.7540319360235557, -4.04998235630089, -0.8804606883592633, 0.39622383169820946], y = 1.8329342083415294)
 ⋮
 (x = [1.1237308000609825, 0.9759811476826366, 1.1457757109522912, 0.021044305461488316, 0.12474708150953753, -1.4688318403439975, 1.0533683889607794, 1.781822481793503, -1.9638819236156768], y = 0.6321678749064233)
 (x = [-0.3686990398829534, -0.8587104137328412, -0.5221235644375244, 0.14154038935647412, -0.21045693426196205, 1.0168261355687676, -0.4738986101983364, -0.8365462677399865, 1.073637409303231], y = -0.7703621683437787)
 (x = [0.8126211276326744, 2.458019392502882, 2.368383001501557, -0.7649109876087827, -0.9876728668900251, -1.7416299386582856, 0.9614352560172265, 0.8571831080504331, -2.174548752956416], y = 1.1965583750183026)
 (x = [-0.17572010945002006, -0.8333334439043664, -0.7989484235584051, 0.5787449549494469, -0.004333148544533069, 0.9926898756273019, -0.6352594633111405, -0.12328823718400943, 0.49024575920083385], y = -0.8679748428438504)
 (x = [0.1931899486130794, 0.762675666897428, 2.5014770537156816, -3.201391492455146, -0.8238059131455178, -4.958025576095447, 2.6041659521890788, 2.470066805263938, -1.8851541765865087], y = 2.085616074080251)
 (x = [0.05345195303830636, -0.048857045308917366, -0.25646846054456185, 0.27471170466966915, 0.03646728540424241, 0.3136603947369883, -0.112754702936123, -0.04942395139361393, 0.06315054358671965], y = -3.541541244723876)
 (x = [3.6692472444173636, -0.9596741118743667, -0.5525094582084112, -4.582972833395992, 6.290705903963241, -1.7175586450212965, 0.07850601169991342, 3.065080858746138, -2.86479947137223], y = 2.728125296240982)
 (x = [-0.1075400617075585, 0.17174656654311996, 0.1156901031465544, 0.09937226576612415, -0.23273032059232743, 0.07150347224481507, 0.014937145683643316, 0.015405505724557426, 0.07988802488453423], y = -4.080664155116795)
 (x = [5.307186340028556, -13.241470813355603, -6.3207918944656605, -5.058071161327767, 10.768180839749299, -4.242234969052577, -2.4128883587167502, 0.05929015327386222, -2.0317548483917487], y = 3.9137424294870975)

where each element of which is a NamedTuple, as specified in the return statement of the model.

nts[1]

(x = [-0.8641236278171113, 22.99909099488052, -11.268261534594945, 0.3078824542415632, -7.287651434775003, -17.5478435095055, 4.3864784935353125, -4.948890094937868, -3.4078565849057747], y = 5.108668965341639)

Which to use?

There are some pros and cons of using returned, as opposed to :=.

Firstly, returned is more flexible, as it allows you to track any type of object; := only works with variables that can be inserted into an MCMCChains.Chains object. (Notice that x is a vector, and in the first case where we used :=, reconstructing the vector value of x can also be rather annoying as the chain stores each individual element of x separately.)

A drawback is that naively using returned can lead to unnecessary computation during inference. This is because during the sampling process, the return values are also calculated (since they are part of the model function), but then thrown away. So, if the extra quantities are expensive to compute, this can be a problem.

To avoid this, you will essentially have to create two different models, one for inference and one for post-inference. The simplest way of doing this is to add a parameter to the model argument:

@model function Neal_coloneq_optional(track::Bool)
    # Raw draws
    y_raw ~ Normal(0, 1)
    x_raw ~ arraydist([Normal(0, 1) for i in 1:9])

    if track
        y = 3 * y_raw
        x = exp.(y ./ 2) .* x_raw
        return (x=x, y=y)
    else
        return nothing
    end
end

chain = sample(Neal_coloneq_optional(false), NUTS(), 1000)

┌ Info: Found initial step size
└   ϵ = 1.6

Chains MCMC chain (1000×24×1 Array{Float64, 3}):

Iterations        = 501:1:1500
Number of chains  = 1
Samples per chain = 1000
Wall duration     = 1.72 seconds
Compute duration  = 1.72 seconds
parameters        = y_raw, x_raw[1], x_raw[2], x_raw[3], x_raw[4], x_raw[5], x_raw[6], x_raw[7], x_raw[8], x_raw[9]
internals         = n_steps, is_accept, acceptance_rate, log_density, hamiltonian_energy, hamiltonian_energy_error, max_hamiltonian_energy_error, tree_depth, numerical_error, step_size, nom_step_size, lp, logprior, loglikelihood

Use `describe(chains)` for summary statistics and quantiles.

The above ensures that x and y are not calculated during inference, but allows us to still use returned to extract them:

returned(Neal_coloneq_optional(true), chain)

1000×1 Matrix{@NamedTuple{x::Vector{Float64}, y::Float64}}:
 (x = [1.3595401228813244, 7.851391127213838, 0.47158120223225714, -1.1476446324129799, -0.1902279945059434, -0.9435951660250315, 0.8711497158133918, -6.034234874327416, 0.07352278579184685], y = 2.639370546468135)
 (x = [-0.48758707531652207, -1.0639226552371097, -0.6923740202069759, -0.7121040421663513, 0.3394910433699232, -0.44444764428609007, -0.7363162053237882, 0.9650519793218303, -0.9336438011146087], y = -0.1927662253807001)
 (x = [-0.4128604102704071, -1.0121759540930229, -0.35472346377612013, -0.07705818151034696, 0.6986972633357298, -0.575754274636176, -0.19003976030836114, 0.5052754570070758, 0.9733462008263606], y = -0.41725585517127933)
 (x = [0.8029460790519405, 1.479813329822615, 0.5426138105526562, -0.4122263588526179, -0.9719315383413906, 0.3053913063552876, -0.006808413074111122, -0.7339244078142997, -1.2925929404093548], y = 0.5502397270973736)
 (x = [1.7484148712720575, 2.653169397489418, 0.85668845390037, -0.004813986312806089, 3.371238593501245, 1.7854791068563862, -0.30741003394360084, 1.2664199274604748, -2.7003517854110872], y = 1.972779303717049)
 (x = [-1.191184015240832, -0.19822991459095726, 0.9661791583437067, 1.333400129041543, -0.08926363248441543, 0.04615601364219173, 2.124886330007535, 0.7695283384049063, 1.0923182081638394], y = 0.2314129727078298)
 (x = [0.2051986295970803, 2.3676914723311393, -0.04893964241712603, -0.8990869274383013, 0.9070854689366602, -0.6831112652484912, -1.1357818064783871, 0.029121073461576403, 0.6928141117057219], y = 0.31338884542353407)
 (x = [-2.995482557262743, 3.025556046532654, 1.0360009986695002, 1.2085894125818866, 3.5663494229281762, 2.2781974872198423, 0.13101437533385216, -1.6303147465740158, 0.007654849374079447], y = 1.2197839298167756)
 (x = [-4.883896629914776, 2.308588744633808, -1.7403395648281443, -8.54593453813256, 1.4792908225234684, 6.351582256377142, 0.2741977809506046, 5.728694427003611, 3.4804883456292344], y = 2.6505790127363644)
 (x = [2.5929223992295976, -5.19601439072321, 15.11943962252941, 5.814755128630673, -4.17452441890249, -9.131735610249187, 1.0253128428212506, -2.4817604248279017, -5.320514836221847], y = 3.1556295166336623)
 ⋮
 (x = [0.7932190329746095, 2.144835367486241, 2.2357694183149373, -1.6706891611326715, -0.2482793139127233, -0.7485602302621661, -1.3528260791428361, -1.4406532559827847, -0.8272713195056488], y = 0.8955468658587591)
 (x = [-0.3546539026866118, -0.5305266758952752, -0.620307824526431, 0.5847720087571198, 0.26430581213608806, 0.19442163480118496, 0.7172070106975639, 0.3034091565513354, 0.4612880040539674], y = -1.2757338363861566)
 (x = [1.9515102819491952, 2.8971636928608495, 3.840270333064749, -3.7634456922357056, -3.3568901279793626, -2.373980867055495, -2.961842789421038, -2.42323646238976, -0.9186378318196173], y = 2.4251198604961894)
 (x = [1.335257007020945, 6.043832313174072, 12.953420647772303, -3.7368767167669508, -12.51804116893741, 4.620334337748626, 12.549656891121263, 6.192929492206478, -16.28364743187985], y = 4.632593793330061)
 (x = [-0.04018164522593142, -0.029618452754786802, -0.10261832455287229, 0.045259188140478676, 0.11409383995379492, -0.03311969383469284, -0.12543500385275838, -0.04775520148241284, 0.14717518159601722], y = -4.851101998791171)
 (x = [-2.1185078697724813, -1.8710180439127446, 6.152966043126883, 5.5327041834346575, 8.996332273964137, 7.76207153108245, 3.192980722395198, -1.6068106372107822, 6.957277053370236], y = 3.0720169397431993)
 (x = [-0.8935153157114741, -1.3159256006938775, 1.0535139080230322, -0.2713464032619352, 0.5809808487706298, 2.52881489739396, -0.19805320548863287, 0.8636627986418163, 2.109174996212228], y = 0.8162787100017537)
 (x = [0.15951354214666882, -0.012123816970122589, -0.11133276703731719, -0.006848273604525833, -0.0924835407648116, 0.08142293976072391, 0.15196734775450965, -0.13161283344587013, -0.23919133752601382], y = -3.8945669445035667)
 (x = [-0.0027953840117340314, -0.10313857550835949, -1.3613558012693898, 0.006892896132204996, 0.08874410662851612, -0.3065907724882792, -1.250252618600804, 0.18554226385207825, 0.5358132984367844], y = -0.14488485628277847)

Another equivalent option is to use a submodel:

@model function Neal()
    y_raw ~ Normal(0, 1)
    x_raw ~ arraydist([Normal(0, 1) for i in 1:9])
    return (x_raw=x_raw, y_raw=y_raw)
end

chain = sample(Neal(), NUTS(), 1000)

@model function Neal_with_extras()
    neal ~ to_submodel(Neal(), false)
    y = 3 * neal.y_raw
    x = exp.(y ./ 2) .* neal.x_raw
    return (x=x, y=y)
end

returned(Neal_with_extras(), chain)

┌ Info: Found initial step size
└   ϵ = 1.6

1000×1 Matrix{@NamedTuple{x::Vector{Float64}, y::Float64}}:
 (x = [0.13327423496376992, 1.2252685608296463, 0.648325599773187, 0.08368953473503468, 0.38413053455226737, 0.34597900044056584, -0.37451958891409715, 0.007920653037092263, -1.0414487920224864], y = -0.7980240712886499)
 (x = [4.000926499074251, -3.2554489586305353, -2.238365564898108, -1.294521067642834, -4.465934074228177, -7.31971327934968, 2.4461187952239554, -2.5586877151487606, 7.481381501258308], y = 2.8884144514656436)
 (x = [0.3099883405952595, -0.9386760903578267, -0.2973638139688041, 0.388357030500136, -0.850298718445284, -0.3285277097432644, -0.48578986591565904, 0.6094087888376033, 0.20923326093766512], y = -1.9333885609419468)
 (x = [-2.2950370234399173, 0.27635818864481415, 2.0683559311092883, -2.674159607821279, 4.828769201168165, 3.644773416355987, 3.6129074314063447, -8.227899816128044, -1.4909294374819904], y = 2.309738293182408)
 (x = [0.1336408084938759, 0.008988856897296685, -0.22557272028034583, 0.17924253236061544, -0.329728077590868, -0.3016938590156232, -0.24231805116121152, 0.5060777830300773, -0.034372753755877304], y = -3.2124026703233137)
 (x = [0.5944823556739387, -0.6198404177451211, -0.025034286809376725, -0.18228249723200893, -1.1114187185408908, 1.4381335403709827, -0.8818203169424937, -1.354812430989392, 1.1633066391383227], y = 0.3754351461771439)
 (x = [-0.21574005926387096, 0.07583230115258943, -0.03515621748449502, -0.0117013561036526, 0.11081467073610858, 0.22878790844795144, 0.014367719173280974, 0.18330121965740334, -0.021187995235690767], y = -2.6967116057310014)
 (x = [-0.21574005926387096, 0.07583230115258943, -0.03515621748449502, -0.0117013561036526, 0.11081467073610858, 0.22878790844795144, 0.014367719173280974, 0.18330121965740334, -0.021187995235690767], y = -2.6967116057310014)
 (x = [-0.21574005926387096, 0.07583230115258943, -0.03515621748449502, -0.0117013561036526, 0.11081467073610858, 0.22878790844795144, 0.014367719173280974, 0.18330121965740334, -0.021187995235690767], y = -2.6967116057310014)
 (x = [-0.21574005926387096, 0.07583230115258943, -0.03515621748449502, -0.0117013561036526, 0.11081467073610858, 0.22878790844795144, 0.014367719173280974, 0.18330121965740334, -0.021187995235690767], y = -2.6967116057310014)
 ⋮
 (x = [-0.4548440119431446, -0.9673410338301413, -0.00972192118835419, 1.4180503522886232, -2.3447335604967776, -0.26660110975434503, -1.0912196826117995, -0.6482420206618467, -1.3558945366236572], y = 0.7418441286848443)
 (x = [0.3511031576097433, 0.8938934126197776, -0.5546675139910741, -1.4947859459362514, 2.186886670177737, -0.06841528853099776, 0.7687642260120467, 0.5130469780177379, 0.671102792433841], y = 0.3292187304112171)
 (x = [0.3511031576097433, 0.8938934126197776, -0.5546675139910741, -1.4947859459362514, 2.186886670177737, -0.06841528853099776, 0.7687642260120467, 0.5130469780177379, 0.671102792433841], y = 0.3292187304112171)
 (x = [-0.6949751926270217, -0.3269591757322304, 0.41044655806766883, 0.6455610141284149, -0.7026381508625529, 0.13152918368811592, -0.014617801702193777, 0.1610740496086987, -0.5067085782590613], y = -1.5175706872754657)
 (x = [0.3001926965952509, 0.5102261124212533, -0.20227820113021996, -0.5067344824083376, -0.27900260178784947, -0.10266899752939782, -0.40326889598595295, -0.44452291438183916, 0.334756795501539], y = -1.7953184030988183)
 (x = [-1.4085829050491836, -2.3351408864835412, 1.6226635779808136, 3.4125376593471444, 0.8375296260535924, 0.6717781711746094, 0.6057278721617685, 2.9437393656978426, -2.3717262751440544], y = 1.6670382944885112)
 (x = [0.7218948289306728, 0.42786537887286036, -0.5087152493136993, -0.815686340337932, -0.014244751643828886, -0.15471041520167356, -0.04018996197023151, -0.5462841585326942, 0.7907012087174106], y = -0.7424551609034191)
 (x = [1.186929119984339, -0.3477031141767526, -0.1664772341659399, -1.445346306817053, 0.9717480888080745, -1.2991130980459982, 0.5079725687660681, 2.5977955494004137, 6.771641142607306], y = 1.645499036950881)
 (x = [5.402237809289436, -5.931128727428508, -3.0159361390613246, -2.7934384677630812, 10.215115098762604, 5.272556541294822, 4.928003071879915, 11.325296745696445, -3.4457822654541306], y = 3.5828195860538123)

Note that for the returned call to work, the Neal_with_extras() model must have the same variable names as stored in chain. This means the submodel Neal() must not be prefixed, i.e. to_submodel() must be passed a second parameter false.

Using := (during inference)

Using returned (post-inference)

Which to use?

Using `:=` (during inference)

Using `returned` (post-inference)