Business Data Technologies

Wisconsin School of Business, University of Wisconsin - Madison, Fall 2021

Instructor: Emaad Manzoor (emanzoor[at]wisc.edu) Teaching Assistant: Yang Gao (ygao328[at]wisc.edu)
Office Hours: By appointment, at Grainger 4284B Office Hours: By appointment, at Grainger 4284A
Section Meeting Times, Locations, Canvas Sites: Format: Lecture (90m), Quiz (10m), Lab (50m)
    Section 010: Tue 2.30-5.00PM, Grainger 2335 (Canvas) Grading: HW (40%), Quizzes (20%), Project (40%)
    Section 011: Tue 5.40-8.10PM, Grainger 2520 (Canvas) Resources: Syllabus, Project, VM Setup
    Section 022: Tue 5.40-8.10PM, Zoom (Canvas) StackOverflow: Section 010, 011, 022

In this course, you will learn how to engineer and architect scalable, resilient, and extensible business data analytics pipelines with modern data management technologies. You will develop expertise in ingesting data from various sources, loading data into data lakes and warehouses, designing warehouse schemas, programmatically querying and analyzing warehoused data, and designing end-to-end data management architectures for both batch and streaming data.

Prerequisites: Proficiency in Python (by taking GEN BUS 705, for example)

Industry Partners:

Date Topic Resources Notes
Sep 14 Introduction & Project Discussion
Lab: VM setup, Linux Shell, Git
Linux Journey
Explain Shell
Learn Enough CLI
Pro Git
HW1 out (VM setup, Linux
shell, Git, data extraction)
Sep 21 Stages of Data Sophistication
Cloud Data Lakes on S3
Lab: APIs, Snowflake Setup, S3
Data Governance Webbook
Enterprise Data Lakes Book
S3 User Guide
Github REST API
 
Sep 28 Traditional & Modern Data
Warehouse Architectures
Lab: Snowflake Quickstart
DDIA Book Ch. 3
Databases Redbook
Snowflake Paper
C-Store Paper
 
Oct 05 Basic SQL
Lab: Basic SnowSQL
Learning SQL Book
SnowSQL Commands
SnowSQL Functions
StackOverflow Schema
StackOverflow Queries
HW1 due, HW2 out
(basic/advanced SQL)
Oct 12 Advanced SQL
Lab: Advanced SnowSQL
Learning SQL Book  
Oct 19 Performant SQL
Lab: PostgreSQL/psycopg
PostgreSQL Query
Optimization Book

Use The Index Luke
psycopg
 
Oct 26 Entity-Relationship Modeling
Schema Design

Lab: No Lab
  HW2 due, HW3 out
(schema design)
Nov 02 Normalization
Dimensional Modeling

Lab: Case Studies
Data Warehouse Toolkit  
Nov 09 Streaming Data Analytics
Lab: Kafka
DDIA Book Ch. 11 HW3 due
Nov 16 Network Data Analytics
Lab: neo4j
Graph Algorithms Book
Graph Databases Book
MMDS Book
NC&M Book
HW4 out
Nov 23 MapReduce
Lab: MapReduce
Text Processing
with MapReduce

DDIA Book Ch. 10
Project midterm
report due
Nov 30 Project Office Hours    
Dec 07 Project Presentations   Project presentation due
HW4 due
Dec 14 Project Presentations (if needed)