Healthcare costs are the wildcard in financial planning. They're not the largest expense for most households — but they're the most variable, the most regulation-driven, and the most likely to derail an otherwise well-built plan. The Healthcare Benefits & HSA Pack is 250 households built for the planning tools that need to model healthcare seriously: HSA accumulation as a retirement strategy, Medicare bridge timing for early retirees, ACA marketplace subsidies for self-employed and gap-filling households, SSDI/LTD claim trajectories for disabled clients, and the IRMAA tier interactions that can quietly add four-figures to a retiree's annual Medicare premium.
Most planning engines treat healthcare as an opaque expense line. The household pays $X per month for premiums, $Y per year for out-of-pocket, and the plan moves on. This is fine until the household hits a transition: COBRA coverage from a job loss, ACA marketplace shopping during a sabbatical, Medicare enrollment at 65, IRMAA-tier-driving income decisions in the gap years. At each transition, the healthcare line item suddenly becomes a strategic variable — and most planning engines can't model it because the underlying data doesn't capture the structural distinctions.
For builders, the data problem is one of structural depth. A household's healthcare data isn't a single field; it's a structured profile of plan type (HDHP, PPO, HMO, Medicare, ACA marketplace), HSA / FSA balances and contribution history, benefit-eligibility transitions, disability claim histories where applicable, and the income-driven premium adjustments that interact with the household's tax planning. Most fixture data has none of this.
This Data Set provides 250 households where the full healthcare-benefits picture is structured: HDHP-with-HSA accumulators (where the HSA is being treated as a retirement account); COBRA / ACA marketplace gap-fillers; SSDI and LTD claimants; Medicare-bridge pre-retirees deciding when to retire vs. when to enroll; and IRMAA-exposed retirees whose income decisions affect Medicare premiums on a 2-year lag.
Validates the platform's HSA-as-retirement-account modeling — triple-tax-advantage projections, spending-vs-saving optimization, post-65 expanded use cases — against 250 households whose HSA contribution and balance histories span the realistic range from new-HSA-holders to long-time accumulators.
Demos the platform's Medicare bridge planning capability to advisor and prospect audiences using realistic households whose ages span 60-67 (the bridge-planning window), with the structured ACA/COBRA/Medicare cost trajectories pre-populated.
Tests the firm's disability-claim assistance workflow against households with pending or ongoing SSDI/LTD claims, including the realistic edge cases (denial-and-appeal trajectories, SSDI/LTD offset calculations, retroactive payments) that complicate standard income-replacement modeling.
Validates the firm's self-employed health insurance deduction and ACA APTC subsidy logic against gig-worker and small-business-owner households whose income volatility creates subsidy-recapture and -underestimation risks.
Tests the cost-of-care projection engine against retirees whose plan elections (Original Medicare with Medigap vs. Medicare Advantage) interact with their geographic location, income tier, and chronic-condition status to drive the actual annual healthcare cost.
The 250 households span twelve archetypes covering the full healthcare-benefits lifecycle. About 25% are early-career employees in HDHP plans (the HSA accumulator population); 20% are self-employed or gig workers navigating ACA marketplace plans; 25% are pre-retirees in the Medicare bridge window (60-65); 20% are recently-retired (65-70) with active Medicare planning decisions; 10% are in benefits-gap or disability-claim situations.
Each household has structured healthcare data: current plan type, premium and out-of-pocket cost structure, HSA balance and contribution history (where applicable), FSA participation, ACA marketplace subsidy eligibility (with APTC and PTC calculations), Medicare enrollment status and election, IRMAA tier history, and disability claim status (SSDI / private LTD). Where the household has had a recent benefits transition (job loss, retirement, disability onset), the structured event log captures the transition timing and the bridge-coverage details.
The Data Set ships as JSON and CSV. The WealthSynth Methodology PDF documents the plan-type taxonomy, the HSA-as-retirement-account modeling methodology, the Medicare and IRMAA calculation chain, the ACA APTC subsidy formula, and the SSDI / LTD offset structures. Calibration sources include CMS, KFF, the SSA, and the major HSA-administrator industry publications.
A redacted summary of one household from this Data Set — names, employers, exact balances, and metro area are stripped. Ages are bucketed, income and net worth are reported as bands. The full record (and all 250 like it) ships in the ZIP.
{
"insurance.health.plan_type": <value>,
"accounts.hsa.balance": <value>,
"accounts.hsa.contribution_history": <value>,
"insurance.disability.benefit_amount": <value>,
"taxes.irmaa_tier": <value>
}Returns HDHP-with-HSA households whose HSA contribution rate is at the family limit AND whose age suggests the HSA could meaningfully accumulate to age 65 — the population where HSA-as-retirement-account positioning matters.
households.filter(h => h.insurance.health.plan_type === 'HDHP' && h.accounts.hsa.contribution_history.current_year_pct === 1.0 && h.members[0].age < 55 )
Returns pre-retirees age 60-64 whose retirement is planned within 36 months — the planning window where COBRA, ACA marketplace, or early-retiree employer coverage decisions matter for the bridge to age 65.
households.filter(h => {
const age = h.members.find(m => m.role === 'primary').age;
return age >= 60 && age < 65 &&
h.events.life_events.some(e =>
e.type === 'planned_retirement' &&
monthsBetween(today(), e.date) <= 36);
})Returns retirees whose current-year income places them within $5K of the next IRMAA threshold — small income reductions or deferrals could keep them in the lower tier, saving meaningful Medicare premium dollars on the 2-year lag.
households.filter(h => {
const margin = h.taxes.next_irmaa_threshold -
h.taxes.modified_agi;
return margin > 0 && margin < 5000;
})Returns ACA-marketplace households whose actual income exceeds the income they reported for advance-premium-tax-credit calculation — these households face APTC clawback at tax filing.
households.filter(h =>
h.insurance.health.plan_type === 'ACA_marketplace' &&
h.taxes.modified_agi >
h.insurance.health.aca_aptc_assumed_income * 1.10
)Each household's healthcare profile is generated against archetype-specific patterns. HSA accumulators draw from realistic HDHP-population distributions (about 30% of W-2 workers under 55 have HDHP coverage). ACA marketplace households are concentrated in self-employed and gig-worker archetypes, with subsidy eligibility computed against current-year FPL thresholds. Medicare-eligible retirees split between Original Medicare with Medigap (~40%, weighted toward higher-income retirees) and Medicare Advantage (~45%, weighted toward middle-income retirees). IRMAA tier history uses the 2-year-lookback methodology with current Medicare Part B and Part D surcharge brackets. SSDI claim populations are sized from SSA program statistics; private LTD claim populations from disability-insurance industry data. The corpus passes the WealthSynth consistency validator (premium and benefit calculations are internally consistent; IRMAA tier history is correctly lagged; HSA contribution math reconciles with IRS limits) and the LLM-as-judge gate. Annual refresh tracks ACA FPL updates, Medicare bracket changes, HSA contribution limits, and any benefit-program statutory changes.
Yes. About 35% of the corpus's HSA-eligible households are explicitly modeling the HSA as a retirement account — paying out-of-pocket medical expenses from cash flow rather than the HSA, and treating the HSA as a triple-tax-advantage retirement vehicle. The structured data includes the medical-expense receipt log (so the post-65 reimbursement strategy can be modeled correctly).
Yes. Households with both SSDI and private LTD coverage have the structured offset calculation — typically the LTD benefit is reduced by the SSDI benefit dollar-for-dollar, with offset terms documented in the LTD policy. About 8% of the corpus has active SSDI claims, with structured trajectories including the 5-month elimination period, the 24-month Medicare-eligibility waiting period, and the appeal history where applicable.
APTC is calculated against the household's projected MAGI as a percentage of the federal poverty level, using the current year's premium tax credit formula. The corpus structures both the household's projected income (used for advance subsidy) and actual income (which determines actual subsidy at tax filing) — letting your tools model the reconciliation that surprises many ACA-marketplace households at year-end.
Yes. Medicare Advantage adoption rates and plan availability vary significantly by ZIP code. The corpus distributes Medicare Advantage participants across geographies in line with CMS regional enrollment data, with realistic plan-feature distributions (Part D included, gym membership, dental coverage levels).
Yes. The SECURE Act 2.0 inflation-indexed the IRMAA brackets going forward (previously they were fixed in nominal dollars); the corpus uses the current-year brackets with the indexing reflected. The Methodology PDF notes the bracket schedule and the political risk that these brackets could be re-fixed in the future.
The corpus uses current-year IRS limits ($4,150 single, $8,300 family for 2025; family limits and the catch-up contribution for 55+ are structured per household). Households at the contribution limit are flagged so your tools can surface the maxed-out planning case. About 18% of the HSA-eligible households are contributing at the family limit.
Yes — about 12% of the corpus has structured long-term-care insurance positions (traditional LTC, hybrid life-LTC, or self-insurance via dedicated reserves). The premium history, benefit period, and inflation-rider structures are documented. LTC is increasingly a planning priority for the 60-75 cohort and the corpus reflects this.
B03 focuses on the income side of retirement: Social Security, pension, RMD, withdrawal sequencing. B18 focuses on the healthcare side: HSA, Medicare, ACA, disability. They share the IRMAA / Medicare seam (where income decisions affect premium tiers) and the bridge-planning seam (where pre-Medicare healthcare costs interact with retirement-timing). Many planning-tool builders buy both — they cover complementary planning surfaces for the same client.
210 near-retiree and retiree households with multi-source income modeling: Social Security claiming ages, pension elections, RMD schedules, annuity payouts, and taxable/tax-deferred withdrawal sequencing. Includes Roth conversion windows and IRMAA tier calculations.
110 HNW and UHNW households with estate planning readiness scores, trust structures, gifting histories, charitable giving data, and GST exemption tracking. Complements B09 (Next-Gen Attrition) and B12 (Estate & Trust Planning).
90 active-duty, retired, and veteran households spanning enlisted to officer ranks. BRS pension elections, TSP allocations, VA loan entitlements, BAH/BAS allowances, SBP elections, VA disability compensation, CRDP/CRSC, and PSLF for nonprofit-employed veterans.
Purchases are for internal use only. Redistribution or resale of data is prohibited under the WealthSchema Data License.
View data license →