AMLS is a 6 ECTS module, applicable to the master study courses computer science, computer engineering, information systems management, and electrical engineering, as well as the study areas data and software engineering, cognitive systems, and distributed systems and networks. Machine learning (ML) applications profoundly transform our lives, and many domains such as health care, finance, media, transportation, production, and information technology itself. In a narrow sense, ML systems are software systems underpinning theses ML applications. However, in a broad sense, ML systems comprise the entire systems from ML applications, over the compiler/runtime stack, to the underlying heterogeneous hardware devices.
This module covers the architecture and essential concepts of modern machine learning (ML) systems for both local and large-scale machine learning. These architectures include systems for data-parallel execution, parameter servers, ML lifecycle systems, and the integration of ML into database systems. The covered topics focus both on a microscopic view of internal compilation, execution, and data management techniques, as well as a macroscopic view of end-to-end ML pipelines. In detail, the module covers the following topics which also reflect the lecture calendar (with a separate 90-120min lecture per topic):
In detail, the course covers the following topics, which also reflects the course calendar. All slides will be made available prior to the individual lectures, which take place Thursday, 4pm-6pm in A 053 and virtually via zoom (call-in: first lecture, other lectures). Furthermore, we also offer weekly office hours, which take place Tuesday, 3pm-4.30pm in TEL 0811 and virtually via zoom (call-in: office hour, starting May 09)
A: Overview and ML System Internals
B: ML Lifecycle Systems
The lectures are accompanied by mandatory programming projects
(to the extend of 3 ECTS, i.e, roughly 80 working hours), preferably in
Apache SystemDS
(an open source ML system for the end-to-end data science lifecycle), or
DAPHNE
(an open and extensible system infrastructure for integrated data analysis pipelines).
A list of project proposals and details on alternative exercises
(programming contest or ML pipeline) are available here: