AMLS is a 5 ECTS master course at Graz University of Technology, which is given at Aalborg University in reduced form as a two-day blocked course. This course covers the architecture and essential concepts of modern ML systems for both local and large-scale machine learning (ML). These architectures include systems for data-parallel execution (e.g., Spark, Dask, SystemDS), Parameter Servers and similar distribution stategies (e.g., TensorFlow, MXNet, PyTorch), ML lifecycle systems, and the integration of ML into database systems. The covered topics focus both, on a microscopic view of internal compilation, execution, and data management techniques, but also a macroscopic view of entire ML pipelines.
In detail, the course covers the following topics, which also reflects the timeline. The individual lectures will take place August 29-30, 8am-5pm in room 0.2.13 (Selma Lagerlöfs Vej 300, 9220 Aalborg).
A: ML Lifecycle Systems
B: ML System Internals
The lectures are accompanied by mandatory programming projects or exercises. For this blocked
course, we recommend the exercise which touches upon many aspects of ML pipelines. Alternatively,
we also allow programming projects in
Apache SystemDS
(an open source ML system for the end-to-end data science lifecycle),
or DAPHNE (an open and extensible system infrastructure for integrated
data analysis pipelines, OSS release 03/2022).
Exercise details and a list of project proposals are available here: