Machine Learning B (MLB)
NDAK22001U - SCIENCE
Passed: 91%, Average grade: 8.1, Median grade: 10
Description
The course is a continuation of Machine Learning A course and provides deeper theoretical foundations of machine learning and a number of advanced theoretically grounded learning techniques. A tentative list of topics includes:
- Basics in Optimization Theory
- Basic properties of functions: convexity, Lipschitzness, gradients, subgradients, etc.
- Constrained optimization and the method of Lagrange multipliers
- Stochastic Gradient Descent (SGD)
- Convergence proof for SGD
- Alternating optimization methods
- Basics of Information Theory
- Entropy
- Relative entropy (the Kullback-Leibler divergence)
- The method of types
- kl inequality for concentration of measure
- Advanced techniques for analysing generalisation power of
learning algorithms
- Vapnik-Chervonenkis (VC) analysis
- VC analysis of SVMs
- VC lower bound
- PAC-Bayesian analysis
- PAC-Bayesian analysis of majority vote
- Bernstein-type concentration inequalities, with applications to analysis of learning algorithms
- Kernel Methods
- Kernels and RKHS
- SVMs
- Ensemble classifiers and weighted majority vote
- Boosting technique
- AdaBoost
- XGBoost
- Bayesian inference
- Basic concepts
- Difference between Bayesian and frequentist views
WARNING: If you have not taken DIKU's Machine Learning A course, please, carefully check the "Recommended Academic Qualifications" box below. Machine Learning courses given at other places do not necessarily prepare you well for this course, because DIKU's machine learning courses have a stronger theoretical component than average machine learning courses offered elsewhere. It is not advised taking the course if you do not meet the academic qualifications.
At course completion, the successful student will have:
Knowledge of
- advanced understanding of the concept of generalisation;
- advanced tools for analysis of generalisation power of machine learning algorithms;
- the mathematical foundations of selected advanced machine learning algorithms.
Skills in
- deriving advanced generalisation bounds for expected prediction quality;
- applying advanced linear and non-linear techniques for classification and regression;
- implementing selected advanced machine learning algorithms;
- visualising and evaluating results obtained with machine learning techniques;
- using software libraries for solving machine learning problems.
Competences in
- recognising and describing possible applications of machine learning;
- formalising and rigorously analysing machine learning problems;
- comparing, appraising and selecting machine learning methods for specific tasks;
- solving real-world data mining and pattern recognition problems by using machine learning techniques.
Recommended qualifications
It is assumed that the students have successfully passed Machine Learning A course. Machine Learning courses given at other places do not necessarily prepare you well for this course.Please, check the self-preparation assignment at https://sites.google.com/diku.edu/machine-learning-courses/mlb.
The course requires strong mathematical skills and background corresponding to what is achieved on the BSc. in Machine Learning and Data Science. In particular:
1. Knowledge of Linear Algebra corresponding to Lineær algebra i datalogi course (LinAlgDat)
2. Knowledge of Calculus corresponding to Introduktion til matematik i naturvidenskab (MatintroNat) or Matematisk analyse og sandsynlighedsteori i datalogi (MASD).
3.Knowledge of Probability Theory corresponding to Sandsynligheds-regning og statistik (SS), Grundlæggende statistik og sandsynlighedsregning (GSS) or Matematisk analyse og sandsynlighedsteori i datalogi (MASD) and Modelling analysis of data (MAD).
4.Knowledge of Discrete Mathematics corresponding to Diskret matematik og formelle sprog (DMFS), Diskret Matematik of Algoritmer (IDMA) or Diskret Matematik og algoritmer (DMA).
5. Knowledge of programming corresponding to Programmering og problemløsning (PoP) and experience with programming in Python.
Coordinators
Nirupam Gupta
nigu@di.ku.dk
Exam
Continuous Assessment
Course Info
Department(s)
- Computer Science
Workload
Lectures | 36h |
Preparation | 8h |
Theory Exercises | 85h |
Practical Exercises | 77h |
Total: 206h