Skip to content

MindweaveTech/us-sme-dataset-sample

Repository files navigation

US Sample Business Database — AdventureWorks Alternative (IRS/FICA Compliant)

Looking for a realistic US sample database for SQL practice, software testing, or demo data? This is a modern alternative to AdventureWorks and Northwind — with balanced double-entry accounting, IRS tax compliance, and 3 years of simulated business operations.

A realistic, relational business dataset generated by simulating a US retail SME day-by-day over 3 financial years. Every transaction flows through double-entry accounting. Not random data — simulated business operations.

Free to use for development, testing, training, and evaluation.

Browse all datasets →

Full dataset: 42 tables, 78,000+ rows, CSV + SQL + Parquet + SQLite — $49 on Gumroad


Simulated Business

Field Value
Company Summit Outdoor Gear LLC
Industry Retail (outdoor/camping equipment)
Country US (Colorado)
Structure LLC with EIN
Period 1 January 2024 – 31 December 2025
Financial Year Calendar year (January–December)

IRS/FICA Tax Compliance (2024)

  • Federal income tax — 2024 marginal brackets (10%–37%), $14,600 standard deduction
  • FICA — Social Security 6.2% (up to $168,600) + Medicare 1.45%
  • State sales tax — ~7.5% on all sales
  • 401(k) — 4% employer match
  • Fortnightly payroll with real withholding calculations

Every payslip satisfies: Gross = Net + Tax + FICA. Every journal entry balances: debits = credits.

Quick Start

git clone https://github.com/MindweaveTech/us-sme-dataset-sample.git
cd us-sme-dataset-sample

sqlite3 :memory: <<'SQL'
.mode csv
.import companies.csv companies
.import sales_orders_sample.csv sales_orders
SELECT name, country, structure FROM companies;
SELECT count(*) as total_sales FROM sales_orders;
SQL

Sample Contents

Full reference tables: companies, departments, chart_of_accounts (69 accounts — GAAP-style), products (25 SKUs), customers

Sampled transactions: sales_orders (200 rows), journal_entries (200 rows), employees (full)

Full Dataset & Other Variants

View all products at mindweave.tech/datasets.

Product Country Tables Rows Price
US Complete US 42 78,000+ $49
US Multi-Company US 126 230,000+ $99
AU Complete AU 42 83,000+ $49
UK Complete UK 42 39,000+ $49

Industry Verticals

Product Industry Rows Price
US Restaurant & Bar Hospitality 259,000+ $79
US Consulting Firm Professional Services 156,000+ $79

Also on Payhip, Kaggle, and Hugging Face.

Keywords

Sample database, test data, dummy data, synthetic data, AdventureWorks alternative, Northwind alternative, SQL practice database, US business dataset, IRS test data, FICA, accounting test data, mock data, demo database.

License

Free for development, testing, training, and evaluation. Full dataset commercially licensed — see mindweave.tech/datasets.


Built by Mindweave Technologies | Dev.to: AdventureWorks Is Dead | Dev.to: 5 Reasons AdventureWorks Is Terrible

About

Free sample: 42-table synthetic US business dataset. IRS/FICA compliant, double-entry accounting. AdventureWorks alternative. CSV + SQL + Parquet + SQLite.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors