Data Dictionary

Reference guide for all fields available in the 5500Alpha dataset.

129 fields across 11 categories

Firmtext
firm
Plan sponsor name.
Plan Nametext
plan_name
Official plan name as filed.
EINidentifier
ein
Employer identification number.
Plan Yeardate
plan_year
Plan year start date.
Participant Sizeenum
sizes
Participant size tier.
Under 2525-99100-249250-9991000-49995000 plus
History Tierenum
history_tiers
Years-of-history tiers.
Under 4 yrs4-6 yrs7-13 yrs14-25 yrs25 yrs plusnullOther
Plan Sincedate
plan_history_start
Earliest filing year for the plan.
Filing Datedate
date_received
Date the filing was received by the Department of Labor. Used to filter and sort by filing recency.
Plan PINidentifier
pin
Plan identifier paired with EIN for year-over-year changes.
Filing Typeenum
filing_type
Filing category with schedules H or I where applicable.
5500-SF5500-H5500-I5500
ACK IDidentifier
ack_id
Unique filing ID.

Plan Type Categoryenum
plan_type_category
Broadest classification of the plan's payout structure (Defined Contribution vs. Defined Benefit).
DCDB
Plan Structureenum
plan_structure
The specific funding or regulatory model.
Salary Deferral DCEmployer-Driven DCEmployer-Mandated DCStock-Based DCIRA-Based Employer DCOther DCTraditional DBCash Balance DB
Plan Typeenum
retirement_plan_type
The common legal or marketing category of the plan.
401k403bProfit SharingCash BalanceDBESOPDC-OtherMoney PurchaseSEP
Enrollment Typeenum
enrollment_type
Enrollment type category.
AutoVoluntaryUnknown
Vesting Scheduleenum
vesting_contribs
Vesting schedule for employer contributions.
VestingNon-VestingUnknown
Safe Harbor 401(k)boolean
safe_harbor_401k
Safe harbor plan flag.
01
QDIAenum
qdia
Qualified default investment alternative (QDIA). Indicates if the plan uses an ERISA-compliant default investment for participants.
YesNoUnknown
Union Planboolean
is_union_plan
Indicates whether the plan's terms are maintained under one or more Collective Bargaining Agreements (CBAs).
01
Self-Directed Optionboolean
has_self_directed_option
Indicates if the plan allows a separate brokerage account to invest in assets outside the plan's core lineup.
01

Sectortext
sector
Uses NAICS to group employer into one of 21 broad economic blocks (e.g., Manufacturing, Finance)
Industry Grouptext
industry_group
Mid-level industry grouping within a subsector (98); example: Legal Services.
Industrytext
industry
The most granular industry classification (484); e.g., Offices of Dentists.
Fortune 500boolean
is_fortune_500
Fortune 500 sponsor flag.
01
NAICS Codeidentifier
naics
NAICS industry code.
Regionenum
region
Geographic region (4 region BLS model)
NortheastSouthMidwestWest
Metro Areatext
cbsa
The Core Based Statistical Area (CBSA), or Metro Area, consisting of one or more counties anchored by a high-density urban core.
Addresstext
address
Street address of the plan sponsor.
Citytext
city
City of the plan sponsor.
Statetext
state
State of the plan sponsor.
ZIPidentifier
zip
ZIP code of the plan sponsor.

Active Participantsinteger
active_ees_eoy
Active participants at year-end.
Participants with Accountsinteger
ees_w_accounts
Active participants with account balances.
Implied Median Ageinteger
implied_median_age
Estimated median participant age from TDF vintage with highest assets.
Participation Ratepercent
participation_rate
Active participants making contributions divided by eligibles (bounded by 0-1).
Participation Rankfloat
participation_rate_rank
Percentile rank of participation_rate for 401k and 403b plans only using cascading peer cohorts.
Participation P25percent
p25_participation_rate
25th percentile participation_rate for the applicable 401k and 403b peer cohort under the participation cascade.
Participation P50percent
p50_participation_rate
Median participation_rate for the applicable 401k and 403b peer cohort under the participation cascade.
Participation P75percent
p75_participation_rate
75th percentile participation_rate for the applicable 401k and 403b peer cohort under the participation cascade.
Employee Contributioncurrency
ee_contrib_per_part
Annual employee dollar contributions per participant.
Employee Contribution Rankfloat
ee_contribs_rank
Percentile rank of average employee contributions partitioned by Participant Size, Industry Group, and Geographic Region.
Employee Contribution P25currency
p25_ee_contribs
25th percentile employee contribution.
Employee Contribution P50currency
p50_ee_contribs
Median employee contribution.
Employee Contribution P75currency
p75_ee_contribs
75th percentile employee contribution.
Employer Contributioncurrency
firm_contribs_per_avg_part
Annual firm contributions per participant (firm dollar match).
Employer Contribution Rankfloat
firm_match_rank
Percentile rank of the employer's total annual contributions per participant, partitioned by Size, Industry Group, and Region.
Employer Contribution P25currency
p25_firm_match
25th percentile firm match.
Employer Contribution P50currency
p50_firm_match
Median firm match.
Employer Contribution P75currency
p75_firm_match
75th percentile firm match.
Total Contributioncurrency
total_contrib_per_part
The combined annual sum of employee deferrals and employer matching/profit-sharing contributions per active participant.
Total Contribution Rankfloat
total_contrib_rank
Percentile rank of the total annual contributions (EE + ER) per participant, partitioned by Size, Industry Group, and Region.
Total Contribution P25currency
p25_total_contrib
25th percentile total contribution per participant.
Total Contribution P50currency
p50_total_contrib
Median total contribution per participant.
Total Contribution P75currency
p75_total_contrib
75th percentile total contribution per participant.
Employee Contribution % of Totalpercent
ee_pct_of_total_contrib
The percentage of the total annual contribution amount that originated from employee deferrals.
Employee Contribution % of Total Rankfloat
ee_pct_of_total_contrib_rank
Percentile rank of the employee's share of total contributions, partitioned by Size, Industry Group, and Region.
Employee Contribution % of Total P25percent
p25_ee_pct_of_total_contrib
25th percentile employee share of total contributions.
Employee Contribution % of Total P50percent
p50_ee_pct_of_total_contrib
Median employee share of total contributions.
Employee Contribution % of Total P75percent
p75_ee_pct_of_total_contrib
75th percentile employee share of total contributions.
Contribution Cohortenum
rank_cohort_level_contributions
The specific peer-grouping used to calculate contribution ranks, based on a cascading match of Size, Industry Group, and Region.
SizeIndustry GroupRegion
Loans Allowedboolean
allows_loans
Plan permits participant loans.
01
Assets on Loan %percent
loan_percentage_eoy
Loan balances as a share of assets.
Loan Balance Rankfloat
assets_on_loan_rank
Percentile rank of the plan's total outstanding loans as a percentage of assets, partitioned by Avg. Balance Tier and Sector.
Loan Balance P25percent
p25_avg_assets_on_loan
25th percentile of loans as a percentage of assets.
Loan Balance P50percent
p50_avg_assets_on_loan
50th percentile of loans as a percentage of assets.
Loan Balance P75percent
p75_avg_assets_on_loan
75th percentile of loans as a percentage of assets.
Loans per Participantcurrency
loans_per_active
The total outstanding loan balance of the plan divided by the total number of active participants. A key sign of workforce stress.
Loans per Participant Rankfloat
loans_per_active_rank
Percentile rank of the plan-wide debt density, partitioned by Avg. Balance Tier and Sector.
Loans per Participant P25float
p25_loans_per_active
25th percentile loans per active participant.
Loans per Participant P50float
p50_loans_per_active
Median loans per active participant.
Loans per Participant P75float
p75_loans_per_active
75th percentile loans per active participant.

3Y Asset CAGRpercent
assets_eoy_3yr_cagr
Compounded annual growth rate of total assets over the trailing 3-year period.
3Y Participant CAGRpercent
active_ees_eoy_3yr_cagr
Compounded annual growth rate of active participants over the trailing 3-year period.

Total Assets (EOY)currency
assets_eoy
Total plan assets at year-end.
Asset Tierenum
asset_tiers
Categorizes plans into asset-value ranges for peer benchmarking.
Under $1M$1M-$5M$5M-$25M$25M-$100M$100M-$500M$500M plus
Average Balancecurrency
avg_balance
The average account balance per plan participant.
Average Balance Rankfloat
avg_balance_rank
Percentile rank of average participant balances partitioned by Asset Tier and Sector.
Avg Balance Tierenum
avg_balance_tier
Average balance tier by quintiles.
Under $13k$13-39k$39-70k$70-160k$160k plusOther
Average Balance P25currency
p25_avg_balance
25th percentile average balance.
Average Balance P50currency
p50_avg_balance
Median average balance.
Average Balance P75currency
p75_avg_balance
75th percentile average balance.

Admin Fee PAPMcurrency
admin_papm
Explicit administrative fees per participant per month, expressed in dollars. Used for direct cost comparisons and narratives.
Admin Fee PAPM Rankfloat
papm_admin_fees_rank
Percentile rank of administrative fees (PAPM) relative to comparable plans. Lower values indicate lower fees. Calculated on the eligible peer cohort; plans with missing or invalid fee data are excluded.
Admin Fee PAPM P25currency
p25_admin_papm
Peer cohort percentile benchmarks for administrative fees (PAPM). Values reflect the distribution of comparable plans after applying ranking eligibility rules.
Admin Fee PAPM P50currency
p50_admin_papm
Peer cohort percentile benchmarks for administrative fees (PAPM). Values reflect the distribution of comparable plans after applying ranking eligibility rules.
Admin Fee PAPM P75currency
p75_admin_papm
Peer cohort percentile benchmarks for administrative fees (PAPM). Values reflect the distribution of comparable plans after applying ranking eligibility rules.
BPS Admin Fees (basis points)float
bps_admin_avg_assets
Explicit administrative fees expressed as basis points of average plan assets. Primary metric for size-normalized fee comparisons across plans. Benchmark values are derived from the eligible peer cohort and may be null where fee data is incomplete or not comparable.
BPS Admin Fee Rankfloat
bps_admin_fees_rank
Percentile rank of administrative fees (basis points) relative to comparable plans. Lower values indicate lower fees. Calculated on the eligible peer cohort; excluded for plans without valid fee data.
BPS Admin Fee P25float
p25_admin_bps
Peer cohort percentile benchmarks for administrative fees in basis points. Reflect the distribution of comparable plans after applying ranking eligibility rules.
BPS Admin Fee P50float
p50_admin_bps
Peer cohort percentile benchmarks for administrative fees in basis points. Reflect the distribution of comparable plans after applying ranking eligibility rules.
BPS Admin Fee P75float
p75_admin_bps
Peer cohort percentile benchmarks for administrative fees in basis points. Reflect the distribution of comparable plans after applying ranking eligibility rules.

1-Year IRRpercent
irr
One-year estimated internal rate of return (IRR).
1-Year IRR Rankfloat
irr_rank
Percentile rank of one-year IRR partitioned by Plan Year and Sector.
1-Year IRR P25percent
p25_irr
25th percentile one-year return.
1-Year IRR P50percent
p50_irr
Median one-year return.
1-Year IRR P75percent
p75_irr
75th percentile one-year return.
3-Year IRRpercent
irr_3yr
Three-year estimated internal rate of return (IRR).
3-Year IRR Rankfloat
irr_3yr_rank
Percentile rank of three-year IRR partitioned by Plan Year and Sector.
3-Year IRR P25percent
p25_irr_3yr
25th percentile three-year return.
3-Year IRR P50percent
p50_irr_3yr
Median three-year return.
3-Year IRR P75percent
p75_irr_3yr
75th percentile three-year return.
5-Year IRRpercent
irr_5yr
Five-year estimated internal rate of return (IRR).
5-Year IRR Rankfloat
irr_5yr_rank
Percentile rank of five-year IRR partitioned by Plan Year and Sector.
5-Year IRR P25percent
p25_irr_5yr
25th percentile five-year return.
5-Year IRR P50percent
p50_irr_5yr
Median five-year return.
5-Year IRR P75percent
p75_irr_5yr
75th percentile five-year return.

Passive %percent
passive_pct
Percentage of assets on Schedule D in passive funds.
Dominant Fund Familytext
dominant_family
Fund family with highest asset concentration (e.g., Schwab, Fideilty, Vanguard)
Menu Breadth Tierenum
menu_breadth_tier
Categorizes the plan based on the total number of unique investment options offered on the menu.
1-56-1011-1516-2021-2526 plusnull
TDF Index Series Countinteger
tdf_index_series_count
Total number of unique index-based Target-Date Fund (TDF) series offered in the plan.
TDF Peak Vintage Yearyear
tdf_peak_vintage_year
Target-date vintage ( (e.g., 2030, 2045) with the highest assets in the plan.
Employer Stock %percent
employer_sec_pct
Percentage of plan assets held in employer stock or related securities.
Master Trust Countinteger
master_trust_count
Number of reported master trusts in the plan.
Asset Coverage %percent
asset_coverage_pct
The proportion of total plan assets that are mapped to specific investment categories or schedules.
DC Data Quality Flagenum
dc_data_quality_flag
Schedule D data quality flag identifying gaps in investment detail.
oklownone

Recordkeepertext
recordkeeper_name
Canonical plan recordkeeper name.
Recordkeeper Typeenum
recordkeeper_type
Recordkeeper classification (Schedule C).
BundledPayroll IntegratedStandaloneCustodian
Investment Advisortext
main_advisor_name
Primary investment advisor name.
Admin Platformtext
admin_platform
The primary retirement administration ecosystem or bundled provider (e.g., Human Interest, Guideline, Pentegra).
Auditortext
accountant_firm_name
Plan audit firm name.
Auditor Aliastext
accountant_firm_alias
Canonical normalized audit firm name.
Auditor EINidentifier
accountant_firm_ein
EIN of the plan audit firm.

Fee Risk Flagboolean
fee_risk_flag
Fee risk indicator based on minimum AUM and fees per account. 1 = risk flagged, 0 = no flag.
01
Corrective Distributioncurrency
corrective_distrib
Sum of corrective distribution dollars reported for the filing, expressed in absolute value. Tied to failed nondiscrimination testing or excess contribution refunds.
Corrective Distribution %percent
corrective_ee_percent
Corrective distributions as a percentage of employee contributions.
Auditor Changeboolean
new_auditor
Auditor change signal.
YesNo
Decision Makertext
signer
Plan signer or decision maker. Could be TPA or individual.
Returning Signerenum
signer_match
Indicates if the current year's authorized signer matches the previous year's filing history.
10New
Phonetext
phone_number
Sponsor or administrator phone number.