Course Information

Course: DS 4400 — Machine Learning and Data Mining I
Term: Spring 2026
Instructor: Zohair Shafi
Email: shafi.z [at] northeastern [dot] edu (Please make sure to add [sp26_ds4400] in the subject line)
Meeting Times: Monday and Wednesday | 02:50 PM - 04:30 PM
Location: Snell 033
Office Hours:

Course Description

This course provides a broad introduction to machine learning with a focus on fundamental algorithms for supervised learning. Topics include regression, linear and non-linear classification, ensemble methods, and deep learning. Students will gain practical experience implementing algorithms, evaluating models, and understanding the trade-offs involved in real-world applications.

The course also introduces ethical considerations in machine learning, including fairness, transparency, and responsible use of data and models.

Prerequisites

Textbook and Resources

Primary text: An Introduction to Statistical Learning with Applications in Python (ISL)
(If the link does not work, download the textbook here)

Additional weekly resources are provided in the schedule below

Grading

Tentative Schedule

Week Date Topic Notes, Readings and Additional Resources
1 Wednesday - Jan 7th
  • Course Overview
  • Syllabus
  • Introduction to Machine Learning
  • Probability Review
  • Linear Algebra Review
2 Monday - Jan 12th
  • Linear Algebra Review
  • Calculus Review
  • Simple Linear Regression
2 Wednesday - Jan 14th
  • Simple Linear Regression
  • Practical Issues
  • Bias-Variance Tradeoff
  • Feature Normalization
Homework 1 Posted
Due January 30th
11:59PM EST
3 Monday - Jan 19th No Class - University Closed
Martin Luther King Jr. Day
3 Wednesday - Jan 21st
  • Gradient Descent
4 Monday - Jan 26th Recap
4 Wednesday - Jan 28th
  • Classification and Metrics
Homework 2 Posted
Due February 15th
11:59PM EST
5 Monday - Feb 2nd
  • Classification Metrics
  • K-Nearest Neighbors
5 Wednesday - Feb 4th
  • K-Nearest Neighbors
  • Logistic Regression
6 Monday - Feb 9th
  • Logistic Regression
Homework 3 Posted
Due March 9th
11:59PM EST
6 Wednesday - Feb 11th
  • Linear Discriminant Analysis
7 Monday - Feb 16th No Class - University Closed
Presidents' Day
7 Wednesday - Feb 18th Midterm Review Class
8 Monday - Feb 23rd
  • Naive Bayes' Classifier
  • Decision Trees
8 Wednesday - Feb 25th Mid-Term - In Class
9 Monday - March 2nd Spring Break - No Class
9 Wednesday - March 4th Spring Break - No Class
10 Monday - March 9th
  • Decision Trees
  • Ensemble Methods
10 Wednesday - March 11th
  • Ensemble Methods
  • Intro to Deep Learning
11 Monday - March 16th TBD TBD
11 Wednesday - March 18th TBD TBD
12 Monday - March 23rd TBD TBD
12 Wednesday - March 25th TBD TBD
13 Monday - March 30th TBD TBD
13 Wednesday - April 1st TBD TBD
14 Monday - April 6th TBD TBD
14 Wednesday - April 8th TBD TBD
15 Monday - April 13th TBD TBD
15 Wednesday - April 15th TBD TBD