Skip to Content
Home

Data Analytics Cookbook

Practical Data Analysis with BigQuery & Pandas

Learn how to solve the same problems using two approaches: SQL and PythonπŸš€ Get Started in 5 Minutes

Build Dashboards Like These!

πŸ“Š E-Commerce Analytics Dashboard

After completing this curriculum, you’ll be able to build a comprehensive business intelligence dashboard covering sales analysis, customer segmentation, marketing performance, and churn prediction.

πŸ“Š KPI CardsπŸ—ΊοΈ World MapπŸ‘₯ RFM AnalysisπŸ“ˆ Cohort Retention🚨 Churn PredictionπŸ§ͺ A/B TestingπŸ“‰ Hypothesis TestingπŸ“ Regression Analysis

🎯 View Live Demo

πŸ’° Finance Analytics Dashboard

The core of financial data analysis! Learn analytical techniques used in financial industry practice, from stock price forecasting to portfolio optimization and risk analysis.

πŸ“ˆ Stock AnalysisπŸ’Ό Portfolio⚠️ Risk AnalysisπŸ“… Time Series Forecasting

πŸ’° View Finance Demo

πŸ’Ž Premium Starter Kit

Includes fully executable Jupyter Notebooks, exercise solutions, and practical utility functions. Learn advanced topics such as memory optimization, Copy-on-Write, and large-scale data processing.

πŸ““ Jupyter Notebooksβœ… Exercise SolutionsπŸ”§ Practical UtilitiesπŸ“Š 85 Interview Questions

πŸ’Ž View Interview Questions

Choose Your Learning Track

πŸ—„οΈ SQL Track

SQL-based data analysis using BigQuery. Ideal for large-scale data processing and analysis in cloud environments.
  • Run directly in BigQuery Console
  • Serverless large-scale processing
  • Standard SQL syntax
Start SQL Track β†’

🐼 Pandas Track

Local data analysis using Python Pandas. Ideal for flexible data manipulation and building analysis pipelines.
  • Start immediately with CSV files
  • Leverage the rich Python ecosystem
  • Flexible data transformations
Start Pandas Track β†’

Common Sections

πŸ“Š Visualization

Data visualization with Matplotlib, Seaborn, and PlotlyGo β†’

πŸ“ˆ Statistical Analysis

Hypothesis testing, regression analysis, A/B testingGo β†’

πŸ€– Machine Learning

Clustering, predictive models, time series analysisGo β†’

Datasets Used

This Cookbook is based on Google BigQuery’s thelook_ecommerce public dataset.

TableDescriptionKey Columns
src_ordersOrder informationorder_id, user_id, created_at, status
src_order_itemsOrder itemsorder_id, product_id, sale_price
src_productsProduct informationproduct_id, category, brand, retail_price
src_usersCustomer informationuser_id, age, gender, country
src_eventsWeb eventssession_id, user_id, created_at
events_augmentedSession data (augmented)channel_key, device_key, landing_page
cs_tickets_dummyCS ticketsticket_id, issue_type, satisfaction_score
mkt_campaigns_dummyMarketing campaignscampaign_id, channel_key, budget

Quick Start

πŸš€ Get Started in 5 Minutes

  1. BigQuery Setup or Local Environment Setup
  2. Choose your track (SQL or Pandas)
  3. Follow the first recipe

Difficulty Guide

LevelDescription
BeginnerBasic syntax, simple aggregations
IntermediateJoins, subqueries, window functions
AdvancedComplex analysis, optimization, ML integration
Last updated on

πŸ€–AI λͺ¨μ˜λ©΄μ ‘μ‹€μ „μ²˜λŸΌ μ—°μŠ΅ν•˜κΈ°