Course Information
Course: DS 4400 — Machine Learning and Data Mining I
Term: Spring 2026
Instructor: Zohair Shafi
Email: shafi.z [at] northeastern [dot] edu
Meeting Times: Monday and Wednesday | 02:50 PM - 04:30 PM
Location: Snell 033
Course Description
This course provides a broad introduction to machine learning with a focus on fundamental algorithms for supervised learning. Topics include regression, linear and non-linear classification, ensemble methods, and deep learning. Students will gain practical experience implementing algorithms, evaluating models, and understanding the trade-offs involved in real-world applications.
The course also introduces ethical considerations in machine learning, including fairness, transparency, and responsible use of data and models.
Prerequisites
- Probability and statistics
- Calculus
- Linear algebra
- Programming experience in Python or a similar language
Textbook and Resources
Primary text: An Introduction to Statistical Learning with Applications in Python (ISL)
Grading
- Assignments — 50% (12.5% x 8 assignments)
- Midterm Exam — 20%
- Final Exam — 25%
- Class Participation — 5%
Tentative Schedule
| Week | Date | Topic | Readings / Notes |
|---|---|---|---|
| 1 | Wednesday - Jan 7th | Course overview, syllabus, and introduction to machine learning | ISL Chapters 1–2 |
| 1 | Monday - Jan 12th | Probability and linear algebra review | Supplemental review notes |
| 2 | Wednesday - Jan 14th | TBD | TBD |
| 2 | Monday - Jan 19th | TBD | TBD |
| 3 | Wednesday - Jan 21st | TBD | TBD |
| 3 | Monday - Jan 26th | TBD | TBD |
| 4 | Wednesday - Jan 28th | TBD | TBD |
| 4 | Monday - Feb 2nd | TBD | TBD |