DATA 202 (Wrangling and Analytics), Fall 2023
Syllabus
RStudio
Notes
Class Meetings
Projects
Midterm Project
Final Project
Notes
Missing Data
pandas
Plotly Tips
Prayers for Data Science
Quarto Documents
scikit-learn
Tool References
Units
1: Introduction
Exercise 1: Warmup
Exercise 2: Bikeshare
Creating Quarto Documents
DATA 202 Week 1 Day 1: Welcome!
W02D1: Data, Questions, Tools
3: Visualization Design
Exercise 3: Visualization Design
Visualization 1: Principles
W3.2 Visualizing
4: Visualization 2
Exercise 4: Plot Types
Slides 4: Visualization Implementation
5: Wrangling 1
Exercise 5: Bikeshare Wrangling
Slides 5.1: Tabular Data with
pandas
Slides 5.2: Wrangling Example
6: Wrangling 2
Exercise 6: Pivoting and Joining
Exercise 6: Pivoting and Joining - Added Notes
Joining Data
Tidying Data
Slides 7: Data Tidying and Visualizing
9: Supervised Learning 1
Exercise 9: Practice with Supervised Learning
Introduction to Supervised Learning
10: Trees
Exercise 10: Thresholds and Metrics
Trees
Regression and Classification Metrics
11: Model Zoo
Bayesian Networks: a short introduction
Exercise 11: Bayesian networks among other models
11.1 Model Types
12: Validation
Exercise 12: Logistic Regression, Regularization, and Cross-Validation
12.1 Linear Regression and Classification
13: LLMs
Exploring Language Models
13.1 LLMs
14: Unsupervised Learning, etc.
Exercise 13: Clustering
15.1 Interpreting and Explaining
14.1 Unsupervised Learning
Wrap-Up
Course Notes
Order By
Default
Title
Author
Title
Author
Missing Data
Plotly Tips
Prayers for Data Science
Quarto Documents
Tool References
pandas
scikit-learn
No matching items
Final Project
Missing Data