Methodology · v2.0 · Experience Calibration Addendum

METRA Workforce Claims Forecast — Methodology v2.0

The Experience Calibration addendum. It specifies how a group's own paid medical and Rx experience is credibility-blended against a manual rate, and how the forward metabolic signal from the Workforce Exposure Forecast (Methodology v1.0) is then applied as a dollar offset to return a projected forward claims figure with a confidence band. Published in the open, in v2.0, for review and attestation by credentialed actuaries.

Version 2.0 · Published · Status For review & attestation · Permalink usemetra.com/methodology/v2.0/
Document version2.0
Methodology hashv2.0-2026.06
Claims credibility k400 lives
Default forward trend0.08 / yr
Attrition α0.15 / yr
Benchmark PEPY fallback$16,501
Benchmark sourceMercer National Survey of Employer-Sponsored Health Plans (2024)

Reading order — this is an addendum, not a replacement

v2.0 is additive to Methodology v1.0, which remains the published, version-pinned specification for the Workforce Exposure Forecast and is unchanged. v1.0 produces the forward metabolic dollar signal; v2.0 specifies how that finished signal is consumed as an offset against a credibility-blended claims projection. Read v1.0 first. Every symbol re-used from v1.0 carries its v1.0 meaning unless redefined here. The two credibility weightings in this system are distinct and are not stacked — see §8.

Contents

  1. Scope and Intent
  2. Notation
  3. Connected Experience and Data Quality
  4. Completion and IBNR Maturation
  5. Large-Claim Pooling
  6. Credibility Blend of Observed and Expected
  7. Forward Projection and the Metabolic Offset
  8. Non-Double-Counting of Credibility
  9. Confidence Band Construction
  10. Worked Example
  11. PHI, Aggregation, and Governance
  12. Conformance with Actuarial Standards of Practice
  13. Limitations and Refusal Conditions
  14. Version History and Change Control
  15. Bibliography

§1. Scope and Intent

What this addendum specifies. The procedure by which Metra calibrates a single employer cohort's forward claims expectation using that cohort's own paid medical and pharmacy experience. The procedure has three stages: (a) mature and pool the connected paid experience into a credibility base; (b) blend the resulting observed per-employee-per-year (PEPY) rate against a manual or expected rate using Bühlmann credibility; and (c) project the blended rate forward one year under a medical-trend and attrition assumption, then subtract the forward metabolic dollar signal produced by the v1.0 Workforce Exposure Forecast. The output is a projected forward claims figure with a confidence band.

What this addendum does not specify. It does not specify a rate filing, a stop-loss credibility table, or a replacement for the carrier's own renewal math. It is the buyer-side counterpart to that math: the same renewal arithmetic a carrier already runs on the employer's population, run by the employer, with the forward metabolic signal folded in as a disclosed, auditable offset. The output is a forecast the CFO can plan against and the actuary across the table can check; it is not raw material handed to the reader to build their own model.

Relationship to v1.0. v1.0 is consumed here as a finished input. The 12-month metabolic point estimate and its confidence interval enter this procedure as fixed dollar quantities. v2.0 does not re-open, re-weight, or re-credibilize the v1.0 result; it nets it against the gross claims projection.

§2. Notation

Symbols introduced in this addendum. Where a symbol also appears in v1.0, the v1.0 definition governs unless noted.

MM
Σ enrolled member-months across the connected experience period.
N
Average enrolled lives over the period, N = MM / 12. This is the exposure base for claims credibility.
P
Total paid claims in the period, P = paid medical + paid Rx.
c
Completion factor (0 < c ≤ 1, typically near 1). Immature paid claims are inflated to an incurred basis by dividing by c.
R
Explicit IBNR reserve add-on, used only when no completion factor is supplied.
C
Completed (incurred-basis) claims after maturation.
X
Claims dollars above the pooling point, excluded from the credibility base.
B
Credibility base, B = max(0, CX).
O
Observed PEPY, O = B / N.
E
Expected / manual PEPY (carrier manual rate, or national benchmark fallback).
k
Bühlmann credibility constant, k = 400 lives (the same constant family used in v1.0).
Z
Claims credibility weight, Z = N / (N + k).
π
Blended PEPY, π = Z·O + (1 − ZE.
α
Annual attrition assumption (default 0.15).
L
Forward lives, L = N · (1 − α).
t
Forward medical trend (default 0.08; override with the carrier renewal trend).
G
Gross projected claims, G = L · π · (1 + t).
M
Forward metabolic offset = v1.0 12-month point estimate (with CI bounds Mlo, Mhi).
Ĝ
Projected forward claims, Ĝ = GM.

§3. Connected Experience and Data Quality

The procedure consumes aggregate paid experience only: total paid medical, total paid Rx, and the enrolled member-months that generated them, for a defined incurred period with a stated paid-through date. No member-level claim records, diagnosis codes, or any other protected health information enter the procedure (see §10). The connected experience carries a source label (carrier report, TPA file, ASO summary) so that the provenance of the calibration is recorded with the output.

Data sufficiency is the actuary's first checkpoint. The procedure does not impute missing months, does not annualize a partial period by assumption, and does not silently substitute a benchmark when experience is thin — it surfaces what was supplied and what was defaulted. Where the group has not supplied a manual rate, the expected rate falls back to a sourced, dated national benchmark — the average total health benefit cost per employee from Mercer National Survey of Employer-Sponsored Health Plans, 2024 ($16,501) — which is carried verbatim into the output (with its source and year) and explicitly flagged as a benchmark to be replaced with the carrier manual rate. This benchmark is refreshed when Mercer publishes a new survey. A blend resting on the benchmark is weaker than a blend resting on the group's own manual rate, and the output says so.

§4. Completion and IBNR Maturation

Paid claims for a recent period are immature: claims incurred near the end of the period have not yet been paid. The procedure matures paid to an incurred basis in one of two mutually exclusive ways, in priority order:

Completion factor (preferred) C = P / c
Explicit IBNR add-on (used only when no completion factor is supplied) C = P + R

When neither is supplied, C = P and the experience is treated as fully mature — appropriate only for a run-out-complete period. The completion factor is the actuary's lever; it is supplied per group, not assumed by the platform, and it materially moves the observed rate. The procedure rejects a non-positive completion factor.

§5. Large-Claim Pooling

A single catastrophic claimant can dominate a mid-size group's experience and destroy the credibility of the observed rate. Where a pooling point is supplied, the dollars above it are excluded from the credibility base:

Pooled credibility base B = max(0, C − X)

The pooling point itself is informational and the excluded dollars X are reported alongside the result, so a reviewer can see how much of the raw experience was set aside and reconstruct the unpooled figure. Pooling is applied to the credibility base only; it does not alter the manual/expected rate, which is assumed to already sit on whatever basis the carrier quoted. Pooling is optional — when no excess is supplied, B = C and the result is flagged as an unpooled basis.

§6. Credibility Blend of Observed and Expected

The observed PEPY is the pooled, completed base per average enrolled life:

Observed PEPY O = B / N

It is blended against the expected/manual PEPY using the Bühlmann credibility weight, with the same constant family as v1.0:

Claims credibility weight Z = N / (N + k),   k = 400 lives
Blended PEPY π = Z·O + (1 − Z)·E

The interpretation is the standard renewal one: a small group (low N) receives a low Z and its forward rate leans on the manual; a large group (high N) receives a high Z and its forward rate leans on its own experience. At N = k = 400 the weight is exactly 0.5. The constant k is held common with v1.0's credibility treatment so that a reviewer encounters one credibility parameter across the whole instrument, not two unrelated ones.

§7. Forward Projection and the Metabolic Offset

The blended rate is carried forward one year on the surviving lives, trended, then reduced by the forward metabolic dollar signal:

Forward lives L = N · (1 − α)
Gross projected claims G = L · π · (1 + t)
Projected forward claims Ĝ = G − M

Here M is the v1.0 Workforce Exposure Forecast 12-month point estimate — the modelled forward healthcare exposure that the cohort's measured metabolic trajectory is expected to avoid over the year. It enters as finished dollars. The trend t defaults to a conservative single-digit anchor and is intended to be overridden with the carrier's own renewal trend; the attrition α defaults to the v1.0 value. The implied trend impact of the metabolic offset, M / G, is reported so the reader can read the offset as a fraction of gross.

§8. Non-Double-Counting of Credibility

This system contains two distinct credibility applications, applied to two different quantities, and they are never stacked. This is the single most important integrity property of the addendum and the first thing a reviewing actuary should verify.

WeightCredibilizesAgainstWhere it lives
ZclaimsThe group's observed paid PEPYThe manual / expected PEPYThis document, §6
ZmetabolicThe cohort biometric exposure signalA conservative priorv1.0, inside the metabolic offset M

The metabolic offset M arrives already credibility-weighted by v1.0's Zmetabolic. v2.0 consumes it as a fixed dollar figure and applies no further credibility weight to it. Zclaims touches only the observed-versus-expected PEPY blend and never touches M. The two weights operate on disjoint quantities; there is no path by which a single dollar of exposure is credibility-discounted twice.

§9. Confidence Band Construction

The confidence band on the projected figure is inherited entirely from the v1.0 metabolic offset's confidence interval. The gross projection G is treated as a point quantity for this purpose; the band reflects the uncertainty in the modelled metabolic offset, which is the quantity Metra estimates:

Lower / upper bound (sorted so low ≤ high) Ĝlo = G − Mhi,   Ĝhi = G − Mlo

The mapping is deliberately inverted: a larger avoided-exposure offset produces a lower projected claims figure, so the high end of the offset CI maps to the low end of the projected-claims band, and vice versa. The bounds are sorted after construction so the reported band is always ordered. The band intentionally does not attempt to layer a second source of statistical uncertainty onto the connected paid experience; the connected experience is the group's own and is reported as supplied. A reviewer who wishes to stress the trend or completion assumptions can do so directly, since both are disclosed inputs.

§10. Worked Example

A 322-life group with a full plan year of connected experience, a 0.92 completion factor, a $150,000 pooling point with $380,000 above it, a carrier manual rate of $16,500 PEPY, an 8% forward trend, 15% attrition, and a v1.0 metabolic offset of $240,000 (CI $160,000–$330,000).

Calibration walk-through

Lives N = 3,864 member-months ÷ 12 = 322
L = 322 × (1 − 0.15) = 273.7 forward lives
Mature total paid $5,300,000 ÷ 0.92 completion
C = 5,300,000 / 0.92 = 5,760,870
Pool exclude $380,000 above the pooling point
B = 5,760,870 − 380,000 = 5,380,870
Observed credibility base per life
O = 5,380,870 / 322 = 16,711 PEPY
Credibility Bühlmann weight at N = 322, k = 400
Z = 322 / (322 + 400) = 0.446 (44.6%)
Blend observed ⊕ manual $16,500
π = 0.446 × 16,711 + 0.554 × 16,500 = 16,594 PEPY
Gross forward lives × blended × trend
G = 273.7 × 16,594 × 1.08 = 4,905,121
Offset subtract the v1.0 metabolic point estimate
Ĝ = 4,905,121 − 240,000 = 4,665,121
Band inherited from the offset CI, inverted & sorted
Ĝ_lo = 4,905,121 − 330,000 = 4,575,121 Ĝ_hi = 4,905,121 − 160,000 = 4,745,121

Result. Projected forward claims $4,665,121, 95% band $4,575,121 – $4,745,121, at a claims credibility of 44.6% on the group's own experience. The metabolic offset represents 4.9% of gross projected claims.

§11. PHI, Aggregation, and Governance

The calibration is aggregate-only by construction. The connected-experience record stores period-level dollar totals and member-months — never a member-level claim line, diagnosis, procedure, or identifier. The forward metabolic signal it consumes is itself a cohort-level figure subject to the v1.0 small-cell governance floor. The calibration therefore never touches, joins, or re-identifies protected health information, and there is no path in the procedure by which a single member's claim or biometric history could be reconstructed from the output.

Access to the calibration and to the signed output is gated to the employer's forecast-authorized administrators (the primary administrator and HR administrators). Every generated calibration is recorded in an audit trail carrying the inputs' provenance, the resulting credibility weight, the methodology version, and a reproducibility digest of the calibration inputs, so any signed figure can be regenerated and checked.

§12. Conformance with Actuarial Standards of Practice

ASOPSubjectHow the addendum conforms
No. 23Data QualityConnected experience carries a source label and paid-through date; completion and IBNR are disclosed inputs, never silently assumed; benchmark fallback is explicitly flagged as a placeholder.
No. 25Credibility ProceduresBühlmann weight Z = N/(N+k) with a stated, common constant; observed and expected quantities and the resulting weight are reported; the two credibility applications are kept disjoint (§8).
No. 41Actuarial CommunicationsThe signed output names the methodology version, the signer tier, the assumptions used, and a reproducibility digest; an unsigned tier is labelled methodology-only.
No. 56ModelingThe engine is pure and deterministic, unit-tested, and reproducible from its digest; assumptions (trend, attrition, completion, pooling) are externalized and disclosed rather than embedded.

§13. Limitations and Refusal Conditions

  1. The blend is only as good as the manual rate.

    When no group manual rate is supplied, the expected term falls back to a national benchmark and the output is flagged. A benchmark-anchored blend is materially weaker than a manual-anchored one and should not be carried to a renewal table without the carrier's own manual rate.

  2. Completion and trend are supplied, not derived.

    The procedure does not estimate a completion factor or a trend from the data; it applies what the actuary supplies. A wrong completion factor moves the observed rate proportionally and a wrong trend moves the entire gross projection.

  3. The confidence band reflects metabolic uncertainty, not experience volatility.

    The band is inherited from the v1.0 offset CI. It does not capture random fluctuation in the group's own paid experience; a reviewer wishing to stress experience volatility must do so against the disclosed inputs directly.

  4. Cohort floor.

    Because the metabolic offset is drawn from the v1.0 forecast, the calibration inherits v1.0's minimum-cohort requirement (≥ 15 enrolled lives). Below the floor the procedure refuses to generate rather than return a non-credible figure.

  5. One period, one forward year.

    The procedure calibrates a single connected period to a single forward year. It is not a multi-year reserve model and does not chain successive renewals.

§14. Version History and Change Control

v2.0 is additive to v1.0 and introduces no change to the v1.0 specification, which remains published and version-pinned at /methodology/v1.0/. A change to any constant in the version block above, to the credibility constant, or to the offset-consumption rule constitutes a version bump and requires re-attestation. Attestations are version-pinned: an attestation against v1.0 does not transfer to v2.0, and a signed Workforce Claims Forecast records the methodology version it was generated under.

§15. Bibliography

Bühlmann, H. (1967). Experience rating and credibility. ASTIN Bulletin, 4(3). · Klugman, S., Panjer, H., & Willmot, G. Loss Models: From Data to Decisions. · Actuarial Standards Board. ASOP No. 23 (Data Quality), No. 25 (Credibility Procedures), No. 41 (Actuarial Communications), No. 56 (Modeling). · Society of Actuaries, group health renewal and manual-rating practice notes. · The metabolic-exposure derivation, per-marker coefficients, and bootstrap confidence-interval construction consumed as the offset M are specified in full in Methodology v1.0.

For credentialed actuaries

Critiques, corrections, and counter-derivations are welcomed and inform the next version. Attestation submissions and review correspondence: care@usemetra.com, subject line “Methodology Review — v2.0” or “Methodology Attestation — v2.0”.