This project demonstrates how to build a data analytics and visualization workflow using Google BigQuery and Looker Studio.
Using historical U.S. flight data from the Bureau of Transportation Statistics, we analyze flight delays and visualize insights through interactive dashboards.
- Transform raw datasets
- Create analytical views
- Build visual dashboards
- Generate actionable business insights
The objective of this lab is to build a data analytics pipeline using BigQuery and Looker Studio to analyze airline flight delay patterns.
Key goals include:
- Transforming raw flight data using BigQuery SQL views
- Connecting BigQuery datasets to Looker Studio
- Building interactive data visualizations and dashboards
- Analyzing flight delay trends across airlines
Key Fields
| Field | Description |
|---|---|
| FL_DATE | Flight date |
| UNIQUE_CARRIER | Airline carrier |
| ORIGIN | Departure airport |
| DEST | Destination airport |
| DEP_DELAY | Departure delay |
| ARR_DELAY | Arrival delay |
| DISTANCE | Flight distance |
- This project answers the following analytical questions:
- Which airlines experience the highest departure and arrival delays?
- What is the relationship between departure delay and arrival delay?
- What percentage of flights are on time vs delayed?
- How do delay thresholds (10, 15, 20 minutes) impact flight performance?
- Which airlines consistently experience higher average delays?
The workflow follows a Cloud Data Analytics pipeline. Raw Data → BigQuery Table → BigQuery Views → Looker Studio → Interactive Dashboard
| Component | Purpose |
|---|---|
| BigQuery | Data storage and SQL transformations |
| BigQuery Views | Analytical datasets for visualization |
| Looker Studio | Dashboard creation |
| Scatter / Pie / Bar Charts | Data insights and reporting |
- The raw flight dataset was transformed using SQL views.
CREATE OR REPLACE VIEW dsongcp.flights AS SELECT FlightDate AS FL_DATE, Reporting_Airline AS UNIQUE_CARRIER, Origin AS ORIGIN, Dest AS DEST, CAST(DepDelay AS FLOAT64) AS DEP_DELAY, CAST(ArrDelay AS FLOAT64) AS ARR_DELAY, DISTANCE FROM dsongcp.flights_raw;
- Flights delayed by 10,20,30 minutes
- The BigQuery dataset was connected to Looker Studio to build an interactive analytics dashboard.
Looker studio can be directly connected from Bigquery or Big query data tables can be accessed from Looker studio data sources connectivity.
- Visualizes the relationship between departure delay and arrival delay.
- Airlines with high departure delays typically also experience high arrival delays.
- calculated field was created to classify flights.
CASE WHEN ARR_DELAY < 15 THEN "ON TIME" ELSE "LATE" END
Pie Chart Example
- Compares average delays across airlines.
- Some airlines consistently experience higher delays than others.
- Date range filter
- Interactive charts
- Dynamic data exploration
- Real-time filtering
- Users can select a time period and instantly view updated analytics.
- Flights departing late are likely to arrive late.
- Some carriers have consistently higher delay averages.
- However, delay thresholds increase rapidly during peak travel periods.
- Flights delayed more than 15–20 minutes indicate operational bottlenecks.
- Monitor airline operational performance
- Identify delay patterns
- Compare airline reliability
- Improve scheduling decisions
- Airline operations teams
- Airport management
- Transportation analysts
- Aviation regulators
Google BigQuery Data warehouse and SQL analytics Looker Studio Data visualization SQL Data transformation Google Cloud Cloud analytics platform
- Add geographic flight route maps
- Implement predictive delay modeling
- Add real-time flight data ingestion
- Build a machine learning delay prediction model








