wealthschema/data sets/multi-state-tax-optimization-pack
All Data Sets

Multi-State Tax Optimization Pack

State tax planning used to be a niche practice. Today it's a default question for nearly every wealth client: should they relocate to a no-income-tax state before the liquidity event, what's the convenience-of-employer rule going to do to their NY tax bill if they move to Florida but the company headquarters stays in Manhattan, what triggers MA's millionaires tax surcharge, and how does dual residency between two states actually work? The Multi-State Tax Optimization Pack is 320 households built for the planning tools and tax engines that need to handle these questions correctly — with the residency change events, source-state allocation, and state-specific surtax structures fully populated.

Households
320
Archetypes
12
Formats
JSON, CSV, Parquet
Deviation
Moderate

Why this Data Set exists

Multi-state tax used to be solvable with a residency calendar and an apportionment formula. Three things changed that. First, remote work made dual-state employment normal — a worker living in NJ but employed by an NY-headquartered firm now triggers NY's convenience-of-employer rule, which most planning tools don't handle. Second, MA's millionaires tax surcharge created a state-specific bracket structure that interacts non-trivially with relocation timing. Third, digital nomads — workers without a fixed-domicile state at all — created a planning category that simply didn't exist five years ago.

For planning tools, this means state tax logic that used to be 'pick the residency state, apply that state's rates' now needs to handle: residency-change events with prorated multi-state filings; source-state income allocation that separates W-2 wages from RSU vesting from K-1 distributions from real-estate-located income; convenience-of-employer rules that override the location-of-work analysis for specific state pairs; and state-specific surtax brackets. None of this fits the generic 'apply state rate' approach most tools use.

This Data Set provides 320 households where this complexity is structurally expressed: residency-change event histories, source-state allocation per income type, properties-by-state real estate inventories, and state-specific flags for the surtax structures (MA, CA, NY-convenience-of-employer) that drive most state-level planning value.

Use Cases

State tax engine validation
Residency change modeling
Convenience-of-employer rule testing
MA millionaires tax calculation

Who uses this Data Set

State Tax Engine Engineer

Validates the engine's multi-state apportionment, residency-change handling, and convenience-of-employer logic against 320 households whose state-tax structures span the realistic range — from straightforward single-state filers to digital nomads with no fixed domicile.

Wealth Platform PM Building Relocation Tools

Tests the relocation-modeling tool against households mid-relocation (HCOL-to-LCOL, NY-to-FL, CA-to-TX) ensuring the projected after-relocation tax savings calculations correctly handle convenience-of-employer overrides where applicable.

RIA Serving Mobile Professionals

Tests the firm's multi-state planning workflow against representative client profiles before client engagement: tech workers relocating before IPO, retirees splitting time between two states, dual-residency UHNW couples managing residency tests.

CPA Building Multi-State Filing Tools

Validates the firm's multi-state filing software against households whose source-state income allocation breaks naive 'wages-where-you-live' assumptions — particularly for stock-option exercise income tied to a former state of employment.

MA / CA / NY Surtax Planning Specialist

Tests planning recommendations specifically for clients exposed to MA's millionaires tax, CA's mental health services tax, or NY's high-earner surtax. The corpus pre-populates the surtax-flag fields so the planning tool can offer state-specific recommendations.

What's inside

The 320 households span twelve archetypes intentionally chosen for state-tax complexity: tech workers in HCOL states, dual-income professionals across state lines, peak earners with concentrated equity tied to their employment state, HNW retirees considering relocation, FIRE achievers who moved to no-income-tax states, real estate investors with multi-state property, digital nomads, and pre-IPO founders timing relocations against liquidity events.

Every household has structured tax-residency data: primary state, residency history (with each change event timestamped and reasoned), and where applicable, dual residency status. Source-state income allocation breaks down income by type and source state — W-2 wages allocated to the location-of-work state (with convenience-of-employer overrides flagged), stock-option exercise income tied to the state-where-vested, K-1 distributions allocated to the entity's state, real estate income located to the property state. Properties-by-state real estate inventories are structured with current valuations and rental-income status. State-specific surtax flags pre-populate for households whose income or wealth places them above the relevant thresholds (MA $1M, CA $1M for mental health services tax, NY high-earner surcharge). Digital-nomad households carry a `domicile_state: null` flag with the 12-month state-presence history.

The Data Set ships as JSON, CSV, and Parquet. The WealthSynth Methodology PDF documents the residency-change event taxonomy, the source-state allocation methodology (with citations to the FTA's apportionment-rule database), the convenience-of-employer rule states (currently NY, NE, PA, AR, DE — though this list changes), and the surtax structures by state.

Preview a sample household

A redacted summary of one household from this Data Set — names, employers, exact balances, and metro area are stripped. Ages are bucketed, income and net worth are reported as bands. The full record (and all 320 like it) ships in the ZIP.

F-01·New Graduate Tech Worker
representative archetype household
Household
Single
State
WV
Gross income (band)
$50k–$100k
Net worth (band)
Dependents
0
Income source types
w2 salary, w2 bonus
Members (1)
primary
Age 25–29
professional services

Technical Highlights

Multi-state income source allocation
Residency change event history
State-specific surtax flags (MA, CA)
No-domicile / digital nomad markers

Sample Schema Fields

sample_record.json
{
  "tax_residency.primary_state": <value>,
  "tax_residency.history[]": <value>,
  "income.source_state_allocation": <value>,
  "real_estate.properties_by_state": <value>,
  "events.relocation_events": <value>
}

Sample queries

Find HCOL-to-LCOL relocation candidates with equity events

Returns households living in CA/NY/NJ whose equity-comp grants vest within 12 months and who have not yet relocated — the planning population where pre-vesting relocation could meaningfully reduce state tax on the vest event.

households.filter(h =>
  ['CA', 'NY', 'NJ'].includes(h.tax_residency.primary_state) &&
  h.equity_comp.grants?.some(g =>
    g.vesting_schedule.some(v =>
      monthsBetween(today(), v.vest_date) <= 12)) &&
  !h.events.relocation_events?.some(e =>
    daysSinceEvent(e) < 180)
)
Surface convenience-of-employer rule exposures

Returns households whose state-of-residence differs from their employer's HQ state where the HQ state has a convenience-of-employer rule (NY is the canonical example) — the population where residing in the lower-tax state may not actually reduce state tax.

households.filter(h => {
  const hqState = h.income.employer_hq_state;
  const homeState = h.tax_residency.primary_state;
  const convenienceStates = ['NY', 'NE', 'PA', 'AR', 'DE'];
  return convenienceStates.includes(hqState) && hqState !== homeState;
})
Track residency-change events for proration

Returns households with a residency change in the current tax year, with the change date and the prorated days in each state — the canonical input for state-by-state tax filings under part-year residency.

households.flatMap(h =>
  h.tax_residency.history.filter(c =>
    c.change_date.startsWith(currentTaxYear)
  ).map(c => ({
    household: h.id,
    from_state: c.from_state,
    to_state: c.to_state,
    change_date: c.change_date,
    days_in_from_state: daysInYearBefore(c.change_date),
    days_in_to_state: daysInYearAfter(c.change_date)
  }))
)
Identify digital nomads with no clear domicile

Returns households whose `domicile_state` is null AND whose 12-month state-presence history shows no single state above the residency-test threshold (typically 183 days). These households need bespoke planning since standard state-tax tools assume a domicile.

households.filter(h =>
  h.tax_residency.primary_state === null &&
  Math.max(...Object.values(h.tax_residency.days_by_state)) < 183
)

Methodology

Each household's state-tax profile is generated against archetype-specific patterns. Tech workers and dual-income professionals are sampled with realistic distributions of HCOL state residency (CA, NY, MA, WA dominant) plus the realistic relocation-rate distribution (about 6% of high-income tech workers relocate per year, weighted toward FL, TX, NV destinations). Source-state income allocation uses the FTA's apportionment-rule database for each state pair. Convenience-of-employer rule states (NY, NE, PA, AR, DE) are correctly flagged where the household's home state differs from the employer HQ state. State-specific surtax thresholds (MA $1M, CA $1M, NY high-earner) are applied based on actual household income. Digital-nomad households (about 8% of the corpus) carry the 12-month state-presence history needed for the planning question. The corpus passes the WealthSynth consistency validator (state tax math reconciles, residency-change events have valid date sequencing, source-state allocation sums correctly across income types) and the LLM-as-judge gate. Annual refresh tracks state-level tax changes (rate increases, surtax brackets, conformity changes).

Included Archetypes (12)

Frequently asked questions

Are state rates current?+

Yes. The corpus uses the rates and brackets effective for the current tax year, including any state-level surtaxes (MA millionaires tax 4% surcharge, CA mental health services tax 1% over $1M, NY high-earner surcharge). Annual refresh updates against the prior year's state-legislation changes.

How is the convenience-of-employer rule represented?+

Households whose home state differs from the employer HQ state — where the HQ state has a convenience-of-employer rule — are flagged with a structured `convenience_of_employer_override` field on the W-2 income allocation. The current convenience-of-employer states are NY, NE, PA, AR, and DE; the Methodology PDF notes the rule's interpretation differs slightly across these states.

Are digital nomads' state-presence histories realistic?+

Digital-nomad households (about 8% of the corpus) have 12-month state-presence histories generated from realistic patterns: rotating among 4–8 states per year with no single state above 183 days. The patterns reflect the Remote Year, Cohorts, and other digital-nomad-program demographic data.

Does the corpus handle the IRS-vs-state-residency distinction?+

Yes. State residency tests differ from federal residency (a US tax resident might be a state nonresident or vice versa). The corpus structures both: each household has a federal residency status (the §7701(b) test) and a state residency status (state-by-state, since each state's test is different). For most households these are the same; for some (international workers, recent expats, Puerto Rico residents) they're not.

Are stock-option exercise income allocations handled correctly?+

Yes. Stock-option exercise income is allocated based on the location of work during the vesting period, not the location of work at exercise. About 22% of the corpus's relocated households have stock options that vested partially in two states; the structured `equity_comp.grants[].vesting_state_allocation` field shows the apportionment.

How are real estate properties handled?+

Real estate is structured by state with current valuation, ownership percentage, rental status (primary, secondary, rental, investment), and the rental income generated. Multi-state real estate investors in the corpus typically have 2–4 properties spread across investment-friendly states (FL, TX, AZ, NC). The state-of-property income allocation flows into the source-state allocation correctly.

Does the corpus include the QBI state-conformity issues?+

Yes — the QBI deduction has different state-level conformity (some states fully conform, some partially conform, some don't allow QBI for state purposes). About 30% of the corpus is in states with QBI non-conformity issues; the Methodology PDF documents the state-by-state status.

How does this differ from B17 (Small Business / K-1)?+

B17 focuses on entity-level tax for small business owners — entity type, reasonable salary, QBI deduction, partnership K-1 mechanics. B22 focuses on multi-state issues, which apply to W-2 earners as much as to business owners. A multi-state business owner might appear in both bundles. Most state-tax-focused buyers purchase B22; tax-tech tools serving small business owners across multiple states often buy both.

Related Wealth Data Sets

$4,500
one-time purchase
320 households (ZIP)
Methodology PDF
JSON, CSV, Parquet formats
Account required to purchase

Purchases are for internal use only. Redistribution or resale of data is prohibited under the WealthSchema Data License.

View data license →