wealthschema/data sets/gig-variable-income-pack
All Data Sets

Gig & Variable Income Pack

Independent earners are the fastest-growing client segment in financial services. Gig workers, creators, freelancers, royalty-income artists, and digital nomads collectively now exceed traditional W-2 employees in some metros. Their financial lives don't fit the W-2 mold the modal wealth platform was designed around: variable cash flow, multiple platform-derived income streams, quarterly estimated tax payments, self-employed health insurance, Solo 401k contribution limits that depend on entity type, and the budget-and-savings patterns that emerge from income volatility rather than stable monthly paychecks. The Gig & Variable Income Pack is 100 households built for the platforms serving this segment correctly.

Households
100
Archetypes
8
Formats
JSON, CSV
Deviation
Moderate

Why this Data Set exists

Variable-income clients break the budgeting and savings logic in most wealth platforms. The 'monthly savings' field assumes a stable monthly base. The 'tax-withholding' calculation assumes W-2 withholding rather than quarterly estimated payments. The 'health insurance' field assumes employer coverage rather than ACA marketplace or COBRA. The 'retirement contribution' field assumes 401k matching rather than Solo 401k or SEP-IRA self-funding. None of these defaults fit the gig / freelancer / creator client.

The production-data problem reinforces the platform-design problem. Production data from any individual platform reflects the platform's existing client base — which is biased toward W-2 clients because that's who the platform was designed for. A platform trying to expand into the variable-income segment can't validate the design choices against production data because the segment isn't in production data.

This Data Set provides 100 households with non-W-2 income patterns: gig workers (rideshare, delivery, task platforms), creators (YouTube, Substack, OnlyFans, Twitch), freelancers (graphic design, writing, consulting), royalty-income artists (musicians, authors, photographers), military with variable allowances (BAH-driven income variation), and remote-worker digital nomads. Each carries the structural data variable-income planning requires.

Use Cases

Variable income cash-flow modeling
Quarterly estimated tax planning
Solo 401k / SEP-IRA optimization
Self-employment health insurance

Who uses this Data Set

Gig-Economy-Focused FinTech Builder

Validates the platform's variable-income cash-flow modeling, smoothing-savings logic, and quarterly-estimated-tax-payment automation against 100 realistic gig and creator profiles. Demos to gig-platform-partnership prospects without using real client data.

Self-Employed Tax Specialist

Tests the firm's quarterly estimated tax projection, self-employed health insurance deduction, Solo 401k optimization, and SEP-IRA modeling against realistic non-W-2 client profiles. Surfaces the cases where naive 'apply employer W-2 rules' logic produces wrong tax outcomes.

Creator Economy Platform Designer

Tests the platform's creator-monetization-and-financial-management features against realistic creator profiles spanning the income range from $30K side-income to $500K full-time creator. Ensures features handle the long-tail creator before scaling.

RIA Building Variable-Income Planning

Validates the firm's variable-income planning workflow against client profiles whose income volatility, multi-platform aggregation, and self-employed-tax-structure complexity match the modal independent-earner client.

ACA Marketplace Plan Selection Tool Builder

Tests the tool's plan-selection logic against gig-and-self-employed households whose income volatility creates ACA APTC subsidy reconciliation risks at year-end. Surfaces the cases where the income-projection input drives the subsidy calculation.

What's inside

The 100 households span eight archetypes covering the variable-income spectrum: F-01 new-graduate tech workers (where some are 1099 contractors); F-02 gig economy starters (rideshare, delivery); F-04 first-generation wealth builders (often gig-economy participants); F-05 active-duty enlisted (with BAH-driven variable income); F-06 international workers on visa; X-01 remote workers / digital nomads; X-02 creator economy / influencers; AR-01 artist / creative with royalty and irregular income.

Every household has structured income-stream data: each income stream identified by source, type (W-2, 1099-MISC, 1099-NEC, 1099-K, royalty, business-distribution, platform-revenue), platform identifier where applicable (Uber, DoorDash, YouTube, Patreon, Etsy, Upwork, Substack, etc.), and the realistic monthly-volatility distribution. Multi-platform income aggregation is structured for creators with 3+ revenue sources. Quarterly estimated tax payment history is documented (the dates and amounts of the prior 8 quarters of estimated payments). Self-employed retirement-plan contributions (Solo 401k, SEP-IRA, or SIMPLE-IRA) are structured with the contribution-limit calculation given the household's entity type. Self-employed health insurance is structured with the deduction calculation.

The Data Set ships as JSON and CSV. The WealthSynth Methodology PDF documents the income-stream taxonomy, the multi-platform aggregation methodology, the quarterly-estimated-tax-payment framework, the Solo 401k/SEP-IRA contribution-limit math, and the calibration source for typical variable-income profiles.

Preview a sample household

A redacted summary of one household from this Data Set — names, employers, exact balances, and metro area are stripped. Ages are bucketed, income and net worth are reported as bands. The full record (and all 100 like it) ships in the ZIP.

F-01·New Graduate Tech Worker
representative archetype household
Household
Single
State
WV
Gross income (band)
$50k–$100k
Net worth (band)
Dependents
0
Income source types
w2 salary, w2 bonus
Members (1)
primary
Age 25–29
professional services

Technical Highlights

Multi-platform income aggregation
Income volatility scoring
Self-employment tax accruals
Solo 401k contribution limits

Sample Schema Fields

sample_record.json
{
  "income.income_streams[]": <value>,
  "income.volatility_pct": <value>,
  "income.platform_revenue": <value>,
  "taxes.quarterly_estimated_payments": <value>,
  "accounts.solo_401k_balance": <value>
}

Sample queries

Find under-quartered estimated-tax payment cases

Returns households whose YTD estimated tax payments are below the safe-harbor threshold (the lower of 110% of prior-year tax or 100% of estimated current-year tax) — the queue for advisor outreach to avoid underpayment penalties.

households.filter(h => {
  const safeHarbor = Math.min(
    h.taxes.prior_year_tax * 1.10,
    h.taxes.estimated_current_year_tax
  );
  return h.taxes.quarterly_estimated_payments_ytd < safeHarbor;
})
Surface Solo 401k optimization opportunities

Returns self-employed households where current Solo 401k contribution is below the IRS maximum given their income — the planning queue for end-of-year contribution-decision conversations.

households.filter(h => {
  if (!h.business.entity_type) return false;
  const maxContribution = h.business.entity_type === 'S-Corp'
    ? Math.min(23000, h.business.reasonable_salary) +
      h.business.reasonable_salary * 0.25
    : 23000 + h.business.k1_distributions * 0.20;
  return h.accounts.solo_401k_balance.ytd_contribution <
    Math.min(maxContribution, 69000);
})
Identify multi-platform creator income aggregation

Returns creators with 3+ active revenue platforms, with the structured platform-by-platform breakdown — useful for testing aggregation tools and for surfacing the platforms where revenue is concentrating.

households.filter(h =>
  h.income.income_streams.filter(s =>
    s.platform_identifier).length >= 3
).map(h => ({
  id: h.id,
  platforms: h.income.income_streams
    .filter(s => s.platform_identifier)
    .map(s => ({
      platform: s.platform_identifier,
      ytd_revenue: s.ytd_revenue
    }))
}))
Track income-volatility for cash-flow planning

Returns households whose monthly income coefficient-of-variation exceeds 30% — the population where smoothing-savings and reserve-fund recommendations matter most.

households.filter(h => {
  const monthly = h.longitudinal.monthly.map(m => m.income);
  const cv = stddev(monthly) / mean(monthly);
  return cv > 0.30;
})

Methodology

Each household's income-stream profile is generated against archetype-specific patterns. Gig economy starters concentrate in 1-2 platforms with realistic revenue distributions per platform (Uber/Lyft drivers tend to have hourly-rate-and-hours-driven distributions; DoorDash drivers slightly different). Creators typically have 3-7 platforms with skewed distributions (one or two revenue-dominant platforms plus several small contributors). Royalty-income artists have realistic seasonal distributions reflecting the actual royalty-payment cycles of music, book, and licensing income. Quarterly estimated tax payment behaviour is calibrated against IRS data on self-employed compliance (about 30% of self-employed taxpayers under-pay quarterly estimates). Solo 401k and SEP-IRA contribution decisions are calibrated against industry data showing about 35% of eligible self-employed contribute to retirement accounts. The corpus passes the WealthSynth consistency validator (income-stream aggregation reconciles with reported income; quarterly payment math is correct; retirement contribution limits respect IRS thresholds) and the LLM-as-judge gate. Annual refresh tracks IRS rule changes and gig-platform policy changes.

Included Archetypes (8)

Frequently asked questions

How realistic are the platform-by-platform income distributions?+

The distributions are calibrated against published creator-economy and gig-economy income data: Patreon and Substack creator-revenue distributions; YouTube partner-program revenue distributions; Uber/Lyft and DoorDash driver-earnings data. The distributions reflect the realistic long-tail (most creators earn modestly, a small percentage earn substantially).

Are 1099-NEC and 1099-K reconciliation issues represented?+

Yes. About 25% of the corpus has at least one platform issuing both 1099-K and 1099-NEC for overlapping revenue (a recurring source of double-counting in self-employed tax filings). The structured platform-revenue data lets your tools test the 1099-K-versus-1099-NEC reconciliation logic.

Does the corpus reflect the new $5K 1099-K threshold?+

Yes. The 2024 IRS threshold for 1099-K issuance ($5,000 down from $20,000) is reflected — about 65% of the corpus's gig and creator households cross the lowered threshold. Annual refresh tracks IRS threshold changes (the threshold is expected to step down further in subsequent years).

Are estimated-tax safe-harbor calculations realistic?+

Yes. The safe-harbor methodology applies the lower of 110% prior-year tax (for higher-income households) or 100% of estimated current-year tax. About 30% of the corpus is in safe-harbor compliance; about 35% is at risk of underpayment penalty. The structured penalty calculation is documented per household.

Do military households with BAH variable allowances appear here?+

Yes. About 18% of the corpus is military with BAH-driven variable income — a source of income variability that comes from PCS moves and rank changes. The structured BAH/BAS data is shared with the B25 Military & Veterans Pack but with the variable-income lens applied here.

Are health-insurance solutions for self-employed handled?+

Yes. The structured health-insurance data covers the realistic options for self-employed households: ACA marketplace with APTC subsidy (with the income-volatility-driven subsidy reconciliation risk), employer-spouse coverage, COBRA continuation, and the rare cases of cash-pay healthcare.

Are royalty income payment cycles realistic?+

Yes. Royalty income for music (typically 6-month publisher accounting cycles), books (typically 6-month or annual accounting cycles), and licensing (variable cycles depending on contract) follows realistic distributions calibrated against published industry data. About 12% of the corpus has royalty income as a primary or significant secondary income source.

How does this fit alongside B17 (Small Business / K-1)?+

B17 focuses on entity-type small business owners — LLCs, S-Corps, partnerships with structured K-1 distributions, QBI deductions, and reasonable-salary determinations. B28 focuses on the broader independent-earner population — gig workers, creators, freelancers — many of whom operate as sole proprietors without entity structures. There's overlap in the LLC-as-sole-prop space; many tax-tech buyers purchase both for the integrated coverage.

Related Wealth Data Sets

$3,500
one-time purchase
100 households (ZIP)
Methodology PDF
JSON, CSV formats
Account required to purchase

Purchases are for internal use only. Redistribution or resale of data is prohibited under the WealthSchema Data License.

View data license →