Course Description
TL;DR:
-
This course requires/expects absolutely no background in coding
-
You’ll learn to use ChatGPT and similar tools to write code
-
You’ll be writing code yourself every week, in class and on homeworks
-
You can use ChatGPT on homeworks and on the exams
-
Exams will be in-class and 2 hours (midterm in week 5, final during finals week)
Let’s get this out of the way: A lot of code today is written by ChatGPT. This isn’t a bad thing per se, and it can be a useful adjunct. But ChatGPT is not error-free, so knowing something about coding is still important. In this course, you’ll learn enough computer science, statistics, data analysis, and, yes, generative AI to do data analysis at your job.
We’ll use two of the most popular programming languages today, Python and SQL. No prior experience in coding or data analytics is required. If you have any Python background, let me know and we’ll work on finding more challenging practice problems.
You will be writing code every single week. If that scares you a little, that’s ok, that’s what the instructor and TAs are here to help with! We will carry out analytics, including data cleaning, computing summary statistics, basic visualizations and basic linear regression. No math or statistics background is required, we’ll learn everything we need.
This course could be taken as a preparation for Business Statistics (41000), Applied Regression (41100), and Advanced Decision Models with Python (36109). Understanding how to work with data will also be helpful in other coursework even if programming is not required, such as Healthcare Data Analytics (40205).
Learning Outcomes:
-
Write basic code in Python and SQL to carry out fundamental data analysis tasks like data import, data cleaning, and data visualization
-
Prompt ChatGPT and similar genAI tools to optimize your coding workflow
-
Define data analytics fundamentals to enable you to talk intelligently with programmers and analysts on your team