Powered by RND
PodcastTecnologiaTalkRL: The Reinforcement Learning Podcast
Ascolta TalkRL: The Reinforcement Learning Podcast nell'app
Ascolta TalkRL: The Reinforcement Learning Podcast nell'app
(6 613)(250 156)
Radio preferite
Sveglia
Sleep timer

TalkRL: The Reinforcement Learning Podcast

Podcast TalkRL: The Reinforcement Learning Podcast
Robin Ranjit Singh Chauhan
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests ...

Episodi disponibili

5 risultati 64
  • NeurIPS 2024 - Posters and Hallways 1
    Posters and Hallway episodes are short interviews and poster summaries.  Recorded at NeurIPS 2024 in Vancouver BC Canada.   Featuring  Jiaheng Hu of University of Texas: Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning  Skander Moalla of EPFL: No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO  Adil Zouitine of IRT Saint Exupery/Hugging Face : Time-Constrained Robust MDPs  Soumyendu Sarkar of HP Labs : SustainDC: Benchmarking for Sustainable Data Center Control  Matteo Bettini of Cambridge University: BenchMARL: Benchmarking Multi-Agent Reinforcement Learning  Michael Bowling of U Alberta : Beyond Optimism: Exploration With Partially Observable Rewards  
    --------  
    9:32
  • Abhishek Naik on Continuing RL & Average Reward
    Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton.  Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications.  Featured References  Reinforcement Learning for Continuing Problems Using Average Reward Abhishek Naik Ph.D. dissertation 2024  Reward Centering Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024   Learning and Planning in Average-Reward Markov Decision Processes Yi Wan, Abhishek Naik, Richard S. Sutton 2020  Discounted Reinforcement Learning Is Not an Optimization Problem Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019  Additional References Explaining dopamine through prediction errors and beyond, Gershman et al 2024 (proposes Differential-TD-like learning mechanism in the brain around Box 4)  
    --------  
    1:21:40
  • Neurips 2024 RL meetup Hot takes: What sucks about RL?
    What do RL researchers complain about after hours at the bar?  In this "Hot takes" episode, we find out!  Recorded at The Pearl in downtown Vancouver, during the RL meetup after a day of Neurips 2024.  Special thanks to "David Beckham" for the inspiration :)  
    --------  
    17:45
  • RLC 2024 - Posters and Hallways 5
    Posters and Hallway episodes are short interviews and poster summaries.  Recorded at RLC 2024 in Amherst MA.   Featuring:  0:01 David Radke of the Chicago Blackhawks NHL on RL for professional sports  0:56 Abhishek Naik from the National Research Council on Continuing RL and Average Reward  2:42 Daphne Cornelisse from NYU on Autonomous Driving and Multi-Agent RL  08:58 Shray Bansal from Georgia Tech on Cognitive Bias for Human AI Ad hoc Teamwork  10:21 Claas Voelcker from University of Toronto on Can we hop in general?  11:23 Brent Venable from The Institute for Human & Machine Cognition on Cooperative information dissemination  
    --------  
    13:17
  • RLC 2024 - Posters and Hallways 4
    Posters and Hallway episodes are short interviews and poster summaries.  Recorded at RLC 2024 in Amherst MA.   Featuring:  0:01  David Abel from DeepMind on 3 Dogmas of RL  0:55 Kevin Wang from Brown on learning variable depth search for MCTS  2:17 Ashwin Kumar from Washington University in St Louis on fairness in resource allocation  3:36 Prabhat Nagarajan from UAlberta on Value overestimation  
    --------  
    4:52

Altri podcast di Tecnologia

Su TalkRL: The Reinforcement Learning Podcast

TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
Sito web del podcast

Ascolta TalkRL: The Reinforcement Learning Podcast, Acquired e molti altri podcast da tutto il mondo con l’applicazione di radio.it

Scarica l'app gratuita radio.it

  • Salva le radio e i podcast favoriti
  • Streaming via Wi-Fi o Bluetooth
  • Supporta Carplay & Android Auto
  • Molte altre funzioni dell'app