Teaching assistants: Varun Agrawal, Vinod Chincholi, Paarth Iyer
Evaluation:
Assignments 30-40%, quizzes and midsemester exam 20-30%, final exam 40%
Copying is fatal
Text and reference books:
Web Data Mining by Bing Liu, 2nd edition, Springer (2011).
Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan
Machine Learning by Tom Mitchell.
C4.5: Programs for Machine Learning by Ross Quinlan.
Artificial Intelligence: A Modern Approach by Stuart J Russell and Peter Norvig, 3rd edition, Pearson (2016).
Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow by Aurélien Géron, 3rd edition, O'Reilly (2022)
Reinforcement Learning: An Introduction, by Richard S. Sutton and Andrew G. Barto, MIT Press, 2nd ed (2018)
Here is a tentative list of topics.
Supervised learning: Association rules, regression, decision trees, naive Bayes, SVM, classifier evaluation, expectation maximization, ensemble classifiers.
Unsupervised learning: Clustering, outlier detection, dimensionality reduction.
Text mining: Basic ideas from information retrieval, TF/IDF model, Page Rank, HITS
Other topics (if time permits): Probabilistic graphical models, Bayesian networks, Markov models, neural networks, ranking and social choice, …
Assignment 1, 6 March 2024, due 17 March 2024.
Assignment 2, 21 March 2024, due 31 March 2024.
Assignment 3, 9 April 2024, due 21 April 2024.
Lecture 1: 9 Jan 2024
(Class Notes (pdf)),
Slides (pdf))
Introduction, market-basket analysis, frequent itemsets, Apriori algorithm
Lecture 2: 11 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Apriori algorithm, association rules, class association rules
Lecture 3: 16 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Supervised learning, decision trees, impurity measures (entropy, Gini index)
Lecture 4: 18 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Decision trees: information gain ratio, handling numeric attributes
Evaluating classifiers: training/test sets, confusion matrix
Lecture 5: 23 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Decision Trees in Python
Linear Regression: loss functions, normal equation, gradient descent
Lecture 6: 25 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Linear Regression: gradient descent, probabilistic justification for SSE loss
Polynomial regression, regularization, the non-polynomial case
Lecture 7: 30 Jan 2024
(Class Notes (pdf),
Slides (pdf))
Regression using decision trees
Handling overfitting in decision trees
Lecture 8: 1 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Regression for Classification
Regression Trees in Python
Linear, polynomial and logistic regression in Python
Lecture 9: 8 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Naïve Bayesian classifiers
Naïve Bayes text classification
Lecture 10: 13 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Ensemble Classifiers: Bagging
Lecture 11: 15 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Ensemble Classifiers: Boosting
Lecture 12: 20 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Unsupervised learning: Clustering — K-Means, Hierarchical, Density-based
Unsupervised learning: Local density based outlier detection
Lecture 13: 22 Feb 2024
(Class Notes (pdf),
Slides (pdf))
Dimensionality reduction: PCA, manifold learning, locally linear embeddings
Lecture 14: 5 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Expectation maximization and applications
Lecture 15: 7 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Applications of unsupervised learning: semi-supervised learning, image segmentation
Linear Separators — Perceptrons
Lecture 16: 14 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Linear Separators — SVMs
Kernel methods
Lecture 17: 19 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Kernel methods
Neural networks: Multilayer perceptrons, sigmoid neurons, network architecture, universality
Lecture 18: 21 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Neural networks: Backpropagation
Lecture 19: 26 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Bayesian networks: basic definitions, semantics, exact inference
Lecture 20: 28 Mar 2024
(Class Notes (pdf),
Slides (pdf))
Bayesian networks: Conditional independence, D-separation
Lecture 21: 2 Apr 2024
(Class Notes (pdf),
Slides (pdf))
Bayesian networks: approximate inference, sampling
Introduction to Markov chains
Lecture 22: 4 Apr 2024
(Class Notes (pdf),
Slides (pdf))
Markov Chain Monte Carlo: Gibbs Sampling
Lecture 23: 16 Apr 2024
(Class Notes (pdf),
Slides (pdf))
Introduction to reinforcement learning, multi-armed bandits
Lecture 24: 18 Apr 2024
(Class Notes (pdf),
Slides (pdf))
Markov Decision Processes: Basic definitions and examples, policies and value functions, Bellman equation, optimal policies
Lecture 25: 23 Apr 2024
(Class Notes (pdf),
Slides (pdf))
Markov Decision Processes: Police evaluation, policy iteration, value iteration