The SDM & RL core technology cluster provides a network for researchers working on sequential decision-making problems.

We, the SDM & RL cluster, focus on exploring the fascinating field of sequential decision-making. We are interested in developing algorithms and models that enable intelligent agents to make optimal decisions in dynamic and uncertain environments. Most of us research in the direction of applied or foundational (deep) reinforcement learning, while another large subset of us works with robotics and control theory.

Goal: The main purpose of the cluster is to provide a network for practitioners and researchers in the SDM & RL field, to advance our understanding of the fundamental principles underlying intelligent, sequential decision-making, to foster scientific discussion and collaboration, and to exchange practical knowledge.

Join the Cluster

The cluster uses the public Slack channel:

#ctc_sequential_decision_making_and_reinforcement_learning

To get started, simply fill out the cluster registration form and join us on Slack!

Cluster Activities

With respect to the goal of the cluster, we meet bi-weekly via zoom and typically engage in one of the following activities:

    • Research discussions: Cluster members are encouraged to present their research in SDM & RL, engaging in valuable scientific discourse.
    • Paper discussions: The cluster discusses SOTA papers from the field. Typically this involves one person presenting a summary of the paper, followed by a discussion.
    • Practical knowledge exchange: The cluster discusses implementational details, hyperparameters, and heuristics associated with the complex code underlying sequential decision-making problems and algorithms.
    • External speaker invitation: The cluster occasionally invites external (potential industrial) speakers to talk about their SDM & RL research.

Recent Publications from Cluster Members

A selection of the latest research published by members of our cluster:

Active Cluster Members

Keywords: Non-linear independent component analysis, Causal representation learning

Affiliation: Chalmers University of Technology

Website: https://selozhd.github.io/

Keywords: Autonomous anti-drone systems

Affiliation: KTH Royal Institute of Technology

Website: https://www.kth.se/profile/alesr

Keywords: Autonomous Vehicles

Affiliation: Chalmers University of Technology

Website: https://www.chalmers.se/en/persons/pathare/

Keywords: RL (offline, online, o2o), Multi-task Transfer Learning, Constrained RL, in-context RL

Affiliation: Örebro University

Website: https://www.finnrietz.dev/

Keywords: Heterogeneous Multi-Agent Reinforcement Learning, Safe RL

Affiliation: Luleå University of Technology

Website: https://www.ltu.se/en/staff/g/gabriele-calzolari

Keywords: Autonomous drug design, bandits, RL, active learning/online learning

Affiliation: Chalmers University of Technology

Website: https://www.chalmers.se/personer/hamsven/

Keywords: Goal-conditioned RL, Vision, Pose Estimation, Robotics, Sim2Real

Affiliation: Lund University

Website: https://portal.research.lu.se/sv/persons/hampus-åström/

Keywords: Multi-armed bandits, Gaussian process bandits, Bayesian optimization.

Affiliation: Chalmers University of Technology

Website: https://www.chalmers.se/en/persons/jacksa/

Keywords: Agentic systems for drug discovery

Affiliation: Uppsala University and AstraZeneca

Website: https://www.uu.se/en/contact-and-organisation/staff?query=N25-892

Keywords:

Affiliation: Umeå University

Website: https://www.umu.se/en/staff/marti-ejarque/

Keywords: Sequential descision making, classical planning

Affiliation: Linköping University

Website: https://martin36.github.io/

Keywords: Transfer and imitation learning, vision-based navigation for UAVs

Affiliation: Örebro University

Website: https://meraccos.com/

Keywords: Game theory and MARL for drones

Affiliation: Chalmers University of Technology and SAAB

Website: https://www.chalmers.se/en/persons/mikape/

Keywords: Sequential decision making, Theoretical guarantees

Affiliation: KTH Royal Institute of Technology

Website: https://www.kth.se/profile/bongole

Keywords: Curiosity driven exploration

Affiliation: Örebro University and Nexer

Website: https://www.oru.se/personal/samuel_blad

Keywords: DRL, RL in games, generalization, scalability

Affiliation: KTH Royal Institute of Technology

Website: https://www.kth.se/profile/sarakari

Keywords: Zero-shot RL, theoretical guarantees for RL

Affiliation: KTH Royal Institute of Technology

Website: https://www.kth.se/profile/stesto

Keywords: Learning for control, System identification

Affiliation: KTH Royal Institute of Technology

Website: https://www.kth.se/profile/yinwang

Keywords: Human motion prediction, dynamics mapping

Affiliation: Örebro University

Website: https://www.oru.se/personal/yufei_zhu

Current Cluster Leader

Stefan Stojanovic

PhD student, Division of Decision and Control Systems, KTH