Exploration and Evaluation of Reinforcement Learning in Production

Tuesday, April 9, 2019
10:30 AM 11:00 AM 10:30 11:00

Lindholmen Conference Hall 5 Lindholmspiren Västra Götalands län, 417 56 Sweden (map)

Google Calendar ICS

Abstract

The typical assertion is that RL does not work in production systems due to the exploratory nature of agents, but is it possible to mitigate some of these assumptions? This talk will be about issues with exploration and evaluation in RL production systems, but also about mitigation in terms of sample-efficiency (for ex. through transfer or distributed/federated learning), safe exploration, and off-policy evaluation.

Jesper Derehag

Senior Data Scientist @ Ericsson

Jesper is a senior engineer from Ericsson, with a career spanning software design, architecture, research, and, now, machine learning. At Ericsson, he has been busy working on using machine learning to improve processes, in areas such as fault prediction and statistical analysis of code complexity. Most recently he has been using reinforcement learning in production specifically for auto-scaling cloud resources. These experiences will be used as a basis for the talk at the conference.

Source:: https://www.linkedin.com/in/jesper-derehag/

Exploration and Evaluation of Reinforcement Learning in Production

Abstract

Jesper Derehag

Senior Data Scientist @ Ericsson

© 2023 GAIA

Action