dbt-BigQuery Package for Salesforce Campaign Funnel Data

A production ready dbt package that transforms raw Salesforce data integrated with Windsor.ai into clean, analytics ready tables in BigQuery following standardized architecture patterns.

You can find a complete list of available Salesforce fields here.

🚀 Features of this dbt package:

Multi level data modeling: Structured models for campaigns, leads, contacts, and opportunities
Campaign funnel analytics: Complete lead → contact → opportunity attribution tracking
Business KPIs out of the box: Pre calculated metrics like conversion rates, campaign ROI, and lead progression timing
Reusable macros: Modular macros for consistent metric calculation and data transformation
Custom tests: Built in tests to ensure data quality and prevent duplicates
Type safety: Safe and consistent casting of strings to numeric types using safe_cast
Performance optimized: Designed for BigQuery efficiency with native data types and filter logic
Built for Windsor.ai: Tailored to Windsor.ai's Salesforce schema and sync behavior
Executive dashboards: Ready to use models for business intelligence

What does this dbt package do?

This package transforms raw Salesforce data into clean, analytics ready tables modeling the complete Campaign → Lead → Contact → Campaign Member → Opportunity funnel. It provides:

Campaign ROI analysis: Track performance metrics across campaigns
Lead conversion tracking: Monitor progression with timing analytics
Multi touch attribution: Understand campaign touchpoint impact
Executive dashboards: Ready to use models for business intelligence
Data quality assurance: Built in validation and testing

⚙️ Prerequisites:

Before using this package, you have to integrate Salesforce data into BigQuery using the Windsor.ai connector to ensure the schema matches the expected format:

Sign up for Windsor.ai's free trial.
Connect your Salesforce account(s).
Choose BigQuery as a data destination.
Create and run a destination task for each of the 5 required tables by selecting specific fields. You can use the Report Presets dropdown to automatically select the necessary fields for each model (campaigns, leads, contacts, campaign_members, opportunities).

✅ Required BigQuery tables

Windsor.ai will stream your Salesforce data to your BigQuery project in minutes. The following tables must be available with the field structure defined in the sources.yml file:

Table	Status	Description	Key Fields
`campaigns`	Required	Campaign level info such as names, types, status, and budgets	`id`, `name`, `type`, `status`, `start_date`, `end_date`, `budgeted_cost`, `actual_cost`, `is_active`
`leads`	Required	Lead records with contact information and source tracking	`id`, `email`, `first_name`, `last_name`, `company`, `status`, `lead_source`, `created_date`, `converted_date`, `converted_contact_id`, `converted_opportunity_id`
`contacts`	Required	Contact master data with account relationships	`id`, `email`, `first_name`, `last_name`, `account_id`, `created_date`, `lead_source`
`campaign_members`	Required	Campaign membership associations linking campaigns to leads/contacts	`id`, `campaign_id`, `lead_id`, `contact_id`, `status`, `first_responded_date`, `created_date`
`opportunities`	Required	Sales opportunity pipeline data with campaign attribution	`id`, `name`, `account_id`, `amount`, `stage_name`, `close_date`, `campaign_id`, `created_date`, `is_closed`, `is_won`

After verifying that the data is present, you're ready to start transforming it using this dbt package.

📋 Requirements

Software versions

dbt Core >= 1.0.0 (tested up to 1.8.x)
Python >= 3.7 (required for dbt Core)

Supported data warehouses

BigQuery (primary support) - All features tested and optimized

🚀 Quick start

Step 1: install the package

Add to your packages.yml:


packages:

- git: "https://github.com/windsor-ai/dbt-bigquery-package-for-salesforce.git"
revision: main

Step 2: install dependencies


dbt deps

Step 3: configure source tables

Define your Salesforce source tables in models/sources.yml:


version: 2

sources:

- name: salesforce
description: "Raw Salesforce data tables"
tables:
    - name: campaigns
description: "Salesforce campaigns data"
    - name: leads
description: "Salesforce leads data"
    - name: contacts
description: "Salesforce contacts data"
    - name: campaign_members
description: "Campaign member associations"
    - name: opportunities
description: "Sales opportunities data"

Step 4: run the models



# Run all models

dbt run

# Run specific layers

dbt run --select +stg_salesforce    \# Staging only
dbt run --select +int_salesforce    \# Staging + Intermediate
dbt run --select +salesforce        \# All models

# Run tests

dbt test

🏗️ Package architecture

This package follows dbt best practices with a threenlayer architecture for scalable, maintainable data transformations:

Staging models (`stg_salesforce__*`)

Purpose: Clean and standardize raw Salesforce data from Windsor.ai integration

Transformations applied:

Data type standardization and safe casting
Consistent field naming conventions
Basic data validation and quality checks
Null value handling and default assignments
Deduplication where necessary

Models in this layer:

stg_salesforce__campaigns - Campaign master data
stg_salesforce__leads - Lead information and status
stg_salesforce__contacts - Contact details and account relationships
stg_salesforce__campaign_members - Campaign membership associations
stg_salesforce__opportunities - Sales opportunity pipeline data

Intermediate models (`int_salesforce__*`)

Purpose: Business logic, calculations, and relationship modeling

Transformations applied:

Lead journey progression tracking with timing analysis
Campaign performance metric calculations
Multi touch attribution logic implementation
Conversion funnel analysis
Historical trend calculations

Models in this layer:

int_salesforce__lead_journey - Lead lifecycle and conversion tracking
int_salesforce__campaign_performance - Campaign metrics and ROI calculations
int_salesforce__contact_touchpoints - Multi touch interaction history

Marts models (`salesforce__*`)

Purpose: Analytics ready tables optimized for business intelligence and reporting

Optimizations:

BigQuery partitioning and clustering for query performance
Executive dashboard ready data structures
Pre calculated KPIs and business metrics
Dimensional modeling for BI tool consumption

Models in this layer:

salesforce__campaign_lead_funnel - Complete funnel analysis with conversion metrics
salesforce__campaign_attribution_summary - Multi touch attribution reporting

📊 Models reference

Model	Layer	Description	Grain
`stg_salesforce__campaigns`	Staging	Cleaned and standardized campaign master data with consistent field naming and data types	One row per campaign
`stg_salesforce__leads`	Staging	Standardized lead information with safe casting and null handling	One row per lead
`stg_salesforce__contacts`	Staging	Clean contact records with account relationships and contact details	One row per contact
`stg_salesforce__campaign_members`	Staging	Campaign membership associations linking campaigns to leads/contacts	One row per campaign member
`stg_salesforce__opportunities`	Staging	Opportunity pipeline data with stage information and amounts	One row per opportunity
`int_salesforce__lead_journey`	Intermediate	Lead progression tracking with conversion timing and status changes	One row per lead with journey metrics
`int_salesforce__campaign_performance`	Intermediate	Campaign performance metrics including costs, responses, and conversion rates	One row per campaign with aggregated metrics
`int_salesforce__contact_touchpoints`	Intermediate	Contact interaction history across all campaigns and touchpoints	One row per contact campaign interaction
`salesforce__campaign_lead_funnel`	Marts	Complete funnel analysis from campaigns through leads to opportunities with conversion metrics	One row per campaign with full funnel data
`salesforce__campaign_attribution_summary`	Marts	Multi touch attribution reporting showing campaign influence on pipeline and revenue	One row per campaign with attribution metrics

🛠 How to use this dbt package

Configure your dbt_project.yml:

vars:  
  # Date range for processing (adjust as needed)
  start_date: '2020-01-01'
  end_date: '2024-12-31'
  
  # Currency settings
  target_currency: 'USD'
  
  # Data quality filters
  exclude_test_campaigns: true
  exclude_deleted_records: true

Make sure these source tables are available in your BigQuery project:

campaigns
leads
contacts
campaign_members
opportunities

Run the models:

# Run all models  
dbt run

# Run specific layers  
dbt run --select +stg_salesforce    # Staging only  
dbt run --select +int_salesforce    # Staging + Intermediate  
dbt run --select +salesforce        # All models

# Run tests  
dbt test

⚙️ Configuration options

Custom field mapping

Adapt to your Salesforce org schema by overriding field mappings in dbt_project.yml:

vars:
# Campaign field mappings
campaigns_id_field: 'campaign_id'
campaigns_name_field: 'campaign_name'
campaigns_type_field: 'campaign_type'
campaigns_status_field: 'campaign_status'
campaigns_actual_cost_field: 'campaign_actual_cost'
campaigns_budgeted_cost_field: 'campaign_budgeted_cost'

# Lead field mappings
leads_id_field: 'lead_id'
leads_email_field: 'lead_email'
leads_status_field: 'lead_status'
leads_source_field: 'lead_lead_source'
leads_converted_date_field: 'lead_converted_date'
leads_converted_contact_id_field: 'lead_converted_contact_id'

# Contact field mappings  
contacts_id_field: 'contact_id'
contacts_email_field: 'contact_email'
contacts_account_id_field: 'contact_accountid'
contacts_created_date_field: 'contact_createddate'

# Campaign Member field mappings
campaign_members_id_field: 'campaignmember_id'
campaign_members_campaign_id_field: 'campaignmember_campaign_id'
campaign_members_lead_id_field: 'campaignmember_lead_id'
campaign_members_contact_id_field: 'campaignmember_contact_id'

# Opportunity field mappings
opportunities_id_field: 'opportunity_id'
opportunities_name_field: 'opportunity_name'
opportunities_amount_field: 'opportunity_amount'
opportunities_stage_name_field: 'opportunity_stage_name'
opportunities_campaign_id_field: 'opportunity_campaign_id'

Picklist value configuration

Configure picklist values to match your Salesforce org:


vars:
salesforce_campaign_types:
- 'Email'
- 'Webinar'
- 'Trade Show'
- 'Social Media'

salesforce_lead_statuses:
- 'Open - Not Contacted'
- 'Working - Contacted'
- 'Qualified'
- 'Unqualified'

Schema configuration

Customize output schemas:


models:
salesforce_campaign_funnel:
+schema: my_salesforce_schema
staging:
+schema: my_staging_schema
intermediate:
+schema: my_intermediate_schema

🔧 Utility macros

The package includes helper macros for data transformations:

clean_boolean() - Convert string booleans to proper boolean type
clean_email() - Standardize and validate email addresses
safe_date_parse() - Robust date parsing with error handling
calculate_days_between() - Date difference calculations
clean_currency() - Numeric/currency field validation
clean_phone() - Phone number standardization

🧪 Data quality & testing

Built in data quality tests include:

Uniqueness: Primary key constraints on all models
Referential integrity: Foreign key relationships between tables
Data validation: Email format, date ranges, picklist values
Business logic: Conversion funnel consistency checks
Completeness: Required field validation

Run tests with:


dbt test
dbt test --select stg_salesforce  \# Test staging models only

📈 Use cases

Perfect for organizations looking to:

Measure marketing ROI across Salesforce campaigns
Optimize lead conversion with detailed funnel analysis
Understand attribution across multiple touchpoints
Build executive dashboards with campaign performance
Track sales pipeline from marketing source to closed won

📚 Additional resources

Package capabilities: Review analysis/docs/package_capabilities.md for feature documentation
Field mapping: Review analysis/docs/field_mapping.md for field documentation
Macros documentation: Review analysis/docs/macros_documentation.md for detailed macro usage and examples
Source documentation: Review models/staging/salesforce/sources.yml for field definitions
Model documentation: Check schema.yml files in each layer for model and column documentation
Salesforce to BigQuery integration documentation: Read this guide https://windsor.ai/connect/salesforce-google-bigquery-integration/ for available integration methods

🤝 Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch
Add tests for any new functionality
Submit a pull request with clear description

📝 License

This project is licensed under the MIT License see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
analysis/docs		analysis/docs
macros		macros
models		models
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md
dbt_project.yml		dbt_project.yml
package-lock.yml		package-lock.yml
packages.yml		packages.yml

Folders and files

Latest commit

History

Repository files navigation

dbt-BigQuery Package for Salesforce Campaign Funnel Data

🚀 Features of this dbt package:

What does this dbt package do?

⚙️ Prerequisites:

✅ Required BigQuery tables

📋 Requirements

Software versions

Supported data warehouses

🚀 Quick start

Step 1: install the package

Step 2: install dependencies

Step 3: configure source tables

Step 4: run the models

🏗️ Package architecture

Staging models (stg_salesforce__*)

Intermediate models (int_salesforce__*)

Marts models (salesforce__*)

📊 Models reference

🛠 How to use this dbt package

Configure your dbt_project.yml:

⚙️ Configuration options

Custom field mapping

Picklist value configuration

Schema configuration

🔧 Utility macros

🧪 Data quality & testing

📈 Use cases

📚 Additional resources

🤝 Contributing

📝 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Staging models (`stg_salesforce__*`)

Intermediate models (`int_salesforce__*`)

Marts models (`salesforce__*`)

Packages