30 purpose-built Data Sets plus the WealthSynth Master Corpus. Every Data Set ships in JSON and CSV with a Methodology PDF documenting field derivations, statistical calibration, and integration guidance. Format availability per Data Set is shown on each card.
Audit-ready households for Reg BI, fiduciary review, fee benchmarking, and lending fairness. Built for compliance teams, supervisory testers, and RIA onboarding.
130 synthetic households tuned for Reg BI suitability testing — concentrated holdings, age 75+, recent inheritance, cognitive decline markers, and risk-mismatch flags. Each record carries the eligibility triggers required to exercise broker-dealer supervisory workflows end to end.
270 households with monthly cash-flow models including income shocks, expense spikes, and liquidity-stress scenarios. Variable-income earners, single parents, gig workers, distressed mortgages, and post-divorce rebuild cases. Each household has 96 monthly snapshots.
130 affluent and HNW households with detailed fee structures: AUM-based advisory fees, tiered breakpoints, fund expense ratios, transaction costs, and tax-drag estimates. Includes complex fee arrangements (multi-firm, family-office, performance-based).
400 prospect households covering RIA client variety from formation through retirement. KYC-complete records, goal-based planning fields, initial recommendation outputs, and CRM-compatible field naming. The broadest single bundle by archetype coverage.
100 households underserved by traditional financial services: thin-credit-file ITIN filers, post-bankruptcy rebuilders, low-income working families, recent immigrants, disability claimants, and cannabis-industry workers. Useful for CRA-aligned lending and inclusion analytics.
80 underbanked and underserved households: prepaid-card users, check-cashing customers, ITIN filers, post-bankruptcy unbanked, and cash-economy participants. Companion to B23 (CRA) but focused on transactional / banking inclusion rather than lending.
Tax-complex households for harvesting algorithms, equity comp, K-1 pass-throughs, QSBS, crypto, and multi-state planning. Field-level cost basis, lots, and tax events.
350 households with detailed taxable brokerage positions, lot-level cost basis, unrealized gain/loss schedules, and wash-sale tracking. Specific-ID lots, holding periods, QSBS attestations, and cross-account wash-sale flags — built for direct-indexing engines and TLH backtesters.
150 households with detailed equity compensation: RSU vesting calendars, ISO/NQSO grants, ESPP with lookback, 83(b) elections, AMT exposure, and exercise window expirations. Each grant has a structured grant_type, vesting schedule, and vested-to-date calculation.
140 small business owner households across LLC, S-Corp, and partnership structures. Each carries reasonable-salary calculations, K-1 distributions, QBI deduction modeling (with SSTB classification and phaseout), guaranteed payments, and capital account tracking.
60 founder and early-employee households with QSBS Section 1202 attestation chains, 5-year holding period satisfaction, gross-asset tests at issuance, and stacking via non-grantor trusts. Includes IPO and secondary-sale liquidity events.
50 crypto-heavy households with detailed wallet inventories, transaction histories, cost-basis lots (FIFO/specific-ID), DeFi positions (LP, staking, lending), NFT income, airdrops, and 1099-DA reconciliation. Self-custody and exchange-custody both represented.
320 households with multi-state tax exposure: HCOL-to-LCOL relocations, dual residency, MA millionaires tax, NY/CA convenience-of-employer rules, and digital nomads with no fixed domicile. Each carries source-state allocation history and residency change events.
Near-retiree, retiree, and benefits households with multi-source income modeling: Social Security, pensions, RMDs, IRMAA, HSAs, and withdrawal sequencing.
210 near-retiree and retiree households with multi-source income modeling: Social Security claiming ages, pension elections, RMD schedules, annuity payouts, and taxable/tax-deferred withdrawal sequencing. Includes Roth conversion windows and IRMAA tier calculations.
250 households across the healthcare-benefits lifecycle: HDHP-with-HSA accumulators, COBRA/ACA-marketplace gap-fillers, SSDI/LTD claimants, Medicare-bridge pre-retirees, and IRMAA-exposed retirees. HSA-as-retirement-account strategies fully modeled.
Estate, trust, and wealth-transfer households for legacy planning, dynasty trusts, beneficiary stress tests, and life-event transitions.
200 households across the wealth-transfer lifecycle: young families establishing wills, dual-income couples buying first homes, peak earners building trusts, retirees coordinating beneficiaries, and blended families with QTIP planning. Each carries a current estate document inventory.
110 HNW and UHNW households with estate planning readiness scores, trust structures, gifting histories, charitable giving data, and GST exemption tracking. Complements B09 (Next-Gen Attrition) and B12 (Estate & Trust Planning).
180 households representing the full estate-planning lifecycle: peak earners building first trusts, HNW with complex multi-trust structures, late-retirees simplifying for handoff, recent inheritors navigating step-up basis, and next-gen heirs receiving distributions.
220 households mid-transition: divorce in progress, post-bankruptcy recovery, medical-debt crisis, sandwich-generation caregivers, recent windowhood, sudden wealth, distressed mortgages, and blended-family formation. High behavioral-event density and asset reshuffling.
Underwriting, illustration, and stress-test households with insurance schedules, mortgage exposure, claims history, and cybersecurity risk overlays.
240 households with detailed insurance needs analysis: life, disability, LTC, and annuity illustration inputs. High deviation surfaces edge cases (sub-standard underwriting, dependent special needs, blended families, recent claims history) that stress illustration engines.
90 households spanning the mortgage lifecycle: first-time buyers stretched to DTI limit, young families with new mortgages, distressed homeowners evaluating modification, and underwater scenarios. Includes complete mortgage application data, current LTV, and reserve adequacy.
50 households with embedded fraud, claims, and cybersecurity scenario flags: medical claims disputes, account takeover patterns, predatory lending exposure, and disability claim documentation. High deviation for adversarial ML training.
Specialized client segments with distinct planning needs: ESG, next-gen, student debt, military, faith-based, gig income, and behavioral coaching cohorts.
230 accredited and qualified-purchaser households with private equity, hedge fund, real assets, and private credit exposure. SEC accreditation status, illiquidity tolerance scoring, capital-call schedules, vintage-by-vintage TVPI/DPI/IRR for PE positions.
25 households tagged for the conditional ESG / values overlay (one of two bundles where race/ethnicity and religion fields are populated, per the v4 privacy contract). MSCI ESG taxonomy alignment, sector exclusion lists, impact allocation percentages, and DAF holdings.
15 next-generation wealth-recipient households (teen/young-adult heirs from HNW families) with engagement metrics, custodial account structures, kiddie-tax modeling, and intergenerational communication scoring. Companion pack to B11 (Wealth Transfer Readiness).
180 households with detailed student loan data: loan types, servicers, IDR plan enrollment, PSLF qualifying-payment counts, refinancing history, and forgiveness-tax-bomb projections. Includes Parent PLUS borrowers and double-consolidation paths.
50 households with values-driven investment overlays: ESG screening, halal/biblically-responsible exclusions, impact bond allocations, DAF management, and shareholder advocacy participation. Companion to B08 (ESG Values Alignment) — fully populates the demographic conditional overlay.
90 active-duty, retired, and veteran households spanning enlisted to officer ranks. BRS pension elections, TSP allocations, VA loan entitlements, BAH/BAS allowances, SBP elections, VA disability compensation, CRDP/CRSC, and PSLF for nonprofit-employed veterans.
70 households with international or faith-based financial considerations: H-1B visa-dependent earners, recent immigrants navigating ITIN-to-SSN transitions, FBAR/FATCA filers, and faith-based investors with strict exclusions. Conditional populates religion field per PLAN privacy contract.
100 households with non-W-2 income patterns: gig workers, creators, freelancers, royalty-income artists, military with variable allowances, and remote-worker digital nomads. Variable cash flow modeling, quarterly estimated tax payments, self-employed retirement plans.
130 households with elevated behavioral risk: financial-anxiety avoiders, overconfident DIY investors with concentration, lifestyle-inflation spenders, post-divorce rebuilders, and student-loan tax-bomb candidates. Each carries a behavioral profile and intervention-trigger flags.
The full corpus — every household, every overlay, every document. The single deliverable that powers all other Data Sets.
Everything: all 1,451 households, all 71 archetypes, every bundle overlay (30 bundles), and 96 monthly longitudinal snapshots per household — delivered as structured JSON profiles. The single source-of-truth corpus that powers every WealthSynth bundle. Includes the full schema documentation and conditional-overlay guide.