Course Description and Goals:

Statistical learning is a field that teaches students how to analyze and interpret data by applying statistical methods and machine learning algorithms to uncover patterns, make predictions, and gain insights from data. The syllabus includes:

• Statistical and machine learning methods, including linear and polynomial regression, logistic regression, and linear discriminant analysis.

• Model validation techniques such as cross-validation and bootstrap, model selection, and regularization methods (ridge and lasso).

• Nonlinear models, splines, and generalized additive models.

• Tree-based methods, including random forests and boosting.

• Support-vector machines and an introduction to causal inference.

• Unsupervised learning methods such as principal components analysis and clustering (k-means and hierarchical).

Examination Format: Report and Presentation (obligatory to receive participaation certificate)

General Information University: University of Duisburg-Essen

• Learning Platform: Moodle (Link: https://lehre.moodle.uni-due.de/course/view.php?id=1986)

• Hybrid Format: Yes, the course includes both in-person and online sessions via Zoom

• Study Program and Level: Master’s/PhD students

• Weekly Hours: The event consists of three block sessions (lecture with tutorial sessions) with two additional dates for tutorials:

• First session: 02.04. (9:00–14:00)

• Following sessions: 03.04. & 04.04. (9:00–17:00)

• Online session: 07.04.

• Final lecture: 25.04. (9:00–14:00)

• Language: German or English (depending on student preference)

Further information, including locations, and Zoom links, can be found on our homepage: Statistical Learning SS 25

Related Posts

Call for participation

Call for participation

Call for participation!

The Data Literacy Center Rhine-Ruhr (DKZ.2R) issues a call for participation in its “rent-an-expert” project! We offer support for ambitious research projects of PhD students and early postdocs dealing with Data Science and Artificial Intelligence, High Performance Computing and Simulation, and Research Data Management. As the DKZ.2R is funded by the German Federal Ministry of Education and Research (BMBF) as well as the EU, this offer is free of charge!

Read More
Documentation From User Experience

Documentation From User Experience

This post is a condensed version of a talk at our Data Compentcy College

If you regularly use scientific software written by others, or tried to replicate interesting research that relies on software, you have probably also invested weeks of work to solve a software problem or even given up on a software because of missing documentation. Finding a project that might be the solution to your problem and then failing to run the code is frustrating. Being unable to run a project you have built yourself years ago is even worse. Having experienced all those setbacks myself in the past I want to use this post to channel that frustration to fuel solutions for better documentation for our current and future projects.

Read More
How To: Good Scientific Practice

How To: Good Scientific Practice

“Scientific integrity forms the basis for trustworthy research”, so it says in the Guidelines for Safeguarding Good Research Practice of the DFG, the German Research Foundation. As a major funder of research in Germany the DFG, as well as many other funders of research in Germany and the European Union, requires researchers to follow a certain set of rules conducting their research. These rules are called “good scientific practice” and have to be followed by researchers to be viable for funding. According to the guidelines researchers are required to “document all information relevant to the production of a research result as clearly as is required by and is appropriate for the relevant subject area to allow the result to be reviewed and assessed”. But good scientific practice is not done by documenting your research. It also includes i.a. protecting the personality rights of your subjects and handling research data in an appropriate manner by e.g. “back(-ing) up research data and results made publicly available, as well as the central materials on which they are based and the research software used, by adequate means according to the standards of the relevant subject area, and retain them for an appropriate period of time.” This is where Research Data Management (RDM) comes in. Of course RDM is much more than just creating a backup of your data on a USB-Stick and handing it over to anyone asking for it. “Good scientific practice” in RDM follows the FAIR principles:

Read More