ML 1

Machine learning 1

Description: This course sets out the general framework of machine learning, allowing you to situate the different approaches in the field. It covers the notions of data pre-processing, an introduction to statistical learning theory (risks, overlearning, convex proxies, regularization), the difference between frequentist and Bayesian approaches, supervised, unsupervised, semi-supervised and reinforcement learning paradigms. Some approaches are detailed (Kernel methods, SVM, Boosting, Bagging, Decision trees…).

Learning outcomes: At the end of this course, students will be able to recognize the different classes of algorithms in the landscape of the many methods available on the shelf. They will also have the statistical notions that will enable them to make reasoned use of these methods, thus avoiding a black-box approach with blind parameter testing.

Means: The courses and practical work are given by Hervé Frezza-Buet, Arthur Hoarau, Jérémy Fix. The courses present theoretical aspects, mathematical proofs, but are also illustrated by demonstrations of algorithms. The practical work will be done in Python, using sickit-learn, in pairs.

Evaluation methods: 2h written test, can be retaken.

Evaluated skills:

Research and Development
Development

Course supervisor: Hervé Frezza-Buet

Geode ID: 3MD1540

CM:

Datasets and learning (1.5 h)
Frequentist, Bayesian, evaluation (1.5 h)
Risks (1.5 h)
C-SVC, Lagrange formulation (1.5 h)
Kernels, numerical resolution (1.5 h)
SVMs for regression, unsupervized learning, nu-versions of SVMs. (1.5 h)
Arbres de décision (1.5 h)
Bagging (1.5 h)
Boosting (1.5 h)

TP:

Data Science en Python (3.0 h)
Arbres de décision (3.0 h)
Bagging (3.0 h)
Forêts aléatoires (3.0 h)
TP+ 1/2 (4.0 h)
TP+ 2/2 (4.0 h)