Independent earners are the fastest-growing client segment in financial services. Gig workers, creators, freelancers, royalty-income artists, and digital nomads collectively now exceed traditional W-2 employees in some metros. Their financial lives don't fit the W-2 mold the modal wealth platform was designed around: variable cash flow, multiple platform-derived income streams, quarterly estimated tax payments, self-employed health insurance, Solo 401k contribution limits that depend on entity type, and the budget-and-savings patterns that emerge from income volatility rather than stable monthly paychecks. The Gig & Variable Income Pack is 100 households built for the platforms serving this segment correctly.
Variable-income clients break the budgeting and savings logic in most wealth platforms. The 'monthly savings' field assumes a stable monthly base. The 'tax-withholding' calculation assumes W-2 withholding rather than quarterly estimated payments. The 'health insurance' field assumes employer coverage rather than ACA marketplace or COBRA. The 'retirement contribution' field assumes 401k matching rather than Solo 401k or SEP-IRA self-funding. None of these defaults fit the gig / freelancer / creator client.
The production-data problem reinforces the platform-design problem. Production data from any individual platform reflects the platform's existing client base — which is biased toward W-2 clients because that's who the platform was designed for. A platform trying to expand into the variable-income segment can't validate the design choices against production data because the segment isn't in production data.
This Data Set provides 100 households with non-W-2 income patterns: gig workers (rideshare, delivery, task platforms), creators (YouTube, Substack, OnlyFans, Twitch), freelancers (graphic design, writing, consulting), royalty-income artists (musicians, authors, photographers), military with variable allowances (BAH-driven income variation), and remote-worker digital nomads. Each carries the structural data variable-income planning requires.
Validates the platform's variable-income cash-flow modeling, smoothing-savings logic, and quarterly-estimated-tax-payment automation against 100 realistic gig and creator profiles. Demos to gig-platform-partnership prospects without using real client data.
Tests the firm's quarterly estimated tax projection, self-employed health insurance deduction, Solo 401k optimization, and SEP-IRA modeling against realistic non-W-2 client profiles. Surfaces the cases where naive 'apply employer W-2 rules' logic produces wrong tax outcomes.
Tests the platform's creator-monetization-and-financial-management features against realistic creator profiles spanning the income range from $30K side-income to $500K full-time creator. Ensures features handle the long-tail creator before scaling.
Validates the firm's variable-income planning workflow against client profiles whose income volatility, multi-platform aggregation, and self-employed-tax-structure complexity match the modal independent-earner client.
Tests the tool's plan-selection logic against gig-and-self-employed households whose income volatility creates ACA APTC subsidy reconciliation risks at year-end. Surfaces the cases where the income-projection input drives the subsidy calculation.
The 100 households span eight archetypes covering the variable-income spectrum: F-01 new-graduate tech workers (where some are 1099 contractors); F-02 gig economy starters (rideshare, delivery); F-04 first-generation wealth builders (often gig-economy participants); F-05 active-duty enlisted (with BAH-driven variable income); F-06 international workers on visa; X-01 remote workers / digital nomads; X-02 creator economy / influencers; AR-01 artist / creative with royalty and irregular income.
Every household has structured income-stream data: each income stream identified by source, type (W-2, 1099-MISC, 1099-NEC, 1099-K, royalty, business-distribution, platform-revenue), platform identifier where applicable (Uber, DoorDash, YouTube, Patreon, Etsy, Upwork, Substack, etc.), and the realistic monthly-volatility distribution. Multi-platform income aggregation is structured for creators with 3+ revenue sources. Quarterly estimated tax payment history is documented (the dates and amounts of the prior 8 quarters of estimated payments). Self-employed retirement-plan contributions (Solo 401k, SEP-IRA, or SIMPLE-IRA) are structured with the contribution-limit calculation given the household's entity type. Self-employed health insurance is structured with the deduction calculation.
The Data Set ships as JSON and CSV. The WealthSynth Methodology PDF documents the income-stream taxonomy, the multi-platform aggregation methodology, the quarterly-estimated-tax-payment framework, the Solo 401k/SEP-IRA contribution-limit math, and the calibration source for typical variable-income profiles.
A redacted summary of one household from this Data Set — names, employers, exact balances, and metro area are stripped. Ages are bucketed, income and net worth are reported as bands. The full record (and all 100 like it) ships in the ZIP.
{
"income.income_streams[]": <value>,
"income.volatility_pct": <value>,
"income.platform_revenue": <value>,
"taxes.quarterly_estimated_payments": <value>,
"accounts.solo_401k_balance": <value>
}Returns households whose YTD estimated tax payments are below the safe-harbor threshold (the lower of 110% of prior-year tax or 100% of estimated current-year tax) — the queue for advisor outreach to avoid underpayment penalties.
households.filter(h => {
const safeHarbor = Math.min(
h.taxes.prior_year_tax * 1.10,
h.taxes.estimated_current_year_tax
);
return h.taxes.quarterly_estimated_payments_ytd < safeHarbor;
})Returns self-employed households where current Solo 401k contribution is below the IRS maximum given their income — the planning queue for end-of-year contribution-decision conversations.
households.filter(h => {
if (!h.business.entity_type) return false;
const maxContribution = h.business.entity_type === 'S-Corp'
? Math.min(23000, h.business.reasonable_salary) +
h.business.reasonable_salary * 0.25
: 23000 + h.business.k1_distributions * 0.20;
return h.accounts.solo_401k_balance.ytd_contribution <
Math.min(maxContribution, 69000);
})Returns creators with 3+ active revenue platforms, with the structured platform-by-platform breakdown — useful for testing aggregation tools and for surfacing the platforms where revenue is concentrating.
households.filter(h =>
h.income.income_streams.filter(s =>
s.platform_identifier).length >= 3
).map(h => ({
id: h.id,
platforms: h.income.income_streams
.filter(s => s.platform_identifier)
.map(s => ({
platform: s.platform_identifier,
ytd_revenue: s.ytd_revenue
}))
}))Returns households whose monthly income coefficient-of-variation exceeds 30% — the population where smoothing-savings and reserve-fund recommendations matter most.
households.filter(h => {
const monthly = h.longitudinal.monthly.map(m => m.income);
const cv = stddev(monthly) / mean(monthly);
return cv > 0.30;
})Each household's income-stream profile is generated against archetype-specific patterns. Gig economy starters concentrate in 1-2 platforms with realistic revenue distributions per platform (Uber/Lyft drivers tend to have hourly-rate-and-hours-driven distributions; DoorDash drivers slightly different). Creators typically have 3-7 platforms with skewed distributions (one or two revenue-dominant platforms plus several small contributors). Royalty-income artists have realistic seasonal distributions reflecting the actual royalty-payment cycles of music, book, and licensing income. Quarterly estimated tax payment behaviour is calibrated against IRS data on self-employed compliance (about 30% of self-employed taxpayers under-pay quarterly estimates). Solo 401k and SEP-IRA contribution decisions are calibrated against industry data showing about 35% of eligible self-employed contribute to retirement accounts. The corpus passes the WealthSynth consistency validator (income-stream aggregation reconciles with reported income; quarterly payment math is correct; retirement contribution limits respect IRS thresholds) and the LLM-as-judge gate. Annual refresh tracks IRS rule changes and gig-platform policy changes.
The distributions are calibrated against published creator-economy and gig-economy income data: Patreon and Substack creator-revenue distributions; YouTube partner-program revenue distributions; Uber/Lyft and DoorDash driver-earnings data. The distributions reflect the realistic long-tail (most creators earn modestly, a small percentage earn substantially).
Yes. About 25% of the corpus has at least one platform issuing both 1099-K and 1099-NEC for overlapping revenue (a recurring source of double-counting in self-employed tax filings). The structured platform-revenue data lets your tools test the 1099-K-versus-1099-NEC reconciliation logic.
Yes. The 2024 IRS threshold for 1099-K issuance ($5,000 down from $20,000) is reflected — about 65% of the corpus's gig and creator households cross the lowered threshold. Annual refresh tracks IRS threshold changes (the threshold is expected to step down further in subsequent years).
Yes. The safe-harbor methodology applies the lower of 110% prior-year tax (for higher-income households) or 100% of estimated current-year tax. About 30% of the corpus is in safe-harbor compliance; about 35% is at risk of underpayment penalty. The structured penalty calculation is documented per household.
Yes. About 18% of the corpus is military with BAH-driven variable income — a source of income variability that comes from PCS moves and rank changes. The structured BAH/BAS data is shared with the B25 Military & Veterans Pack but with the variable-income lens applied here.
Yes. The structured health-insurance data covers the realistic options for self-employed households: ACA marketplace with APTC subsidy (with the income-volatility-driven subsidy reconciliation risk), employer-spouse coverage, COBRA continuation, and the rare cases of cash-pay healthcare.
Yes. Royalty income for music (typically 6-month publisher accounting cycles), books (typically 6-month or annual accounting cycles), and licensing (variable cycles depending on contract) follows realistic distributions calibrated against published industry data. About 12% of the corpus has royalty income as a primary or significant secondary income source.
B17 focuses on entity-type small business owners — LLCs, S-Corps, partnerships with structured K-1 distributions, QBI deductions, and reasonable-salary determinations. B28 focuses on the broader independent-earner population — gig workers, creators, freelancers — many of whom operate as sole proprietors without entity structures. There's overlap in the LLC-as-sole-prop space; many tax-tech buyers purchase both for the integrated coverage.
140 small business owner households across LLC, S-Corp, and partnership structures. Each carries reasonable-salary calculations, K-1 distributions, QBI deduction modeling (with SSTB classification and phaseout), guaranteed payments, and capital account tracking.
270 households with monthly cash-flow models including income shocks, expense spikes, and liquidity-stress scenarios. Variable-income earners, single parents, gig workers, distressed mortgages, and post-divorce rebuild cases. Each household has 96 monthly snapshots.
320 households with multi-state tax exposure: HCOL-to-LCOL relocations, dual residency, MA millionaires tax, NY/CA convenience-of-employer rules, and digital nomads with no fixed domicile. Each carries source-state allocation history and residency change events.
Purchases are for internal use only. Redistribution or resale of data is prohibited under the WealthSchema Data License.
View data license →