作者:李建平
链接:https://www.zhihu.com/question/516672871/answer/2350330143
来源:知乎
著作权归作者所有。商业转载请联系作者获得授权,非商业转载请注明出处。
Nan Jiang 课程主页提到的课程资料,下面这些老师应该也是做RL的
There are also many related courses whose material is available online. Here is an incomplete list (not in any particular order; list from 2019 and has not been updated since then):
R. Srikant. UIUC ECE 586.
Ron Parr. Duke CompSci 590.2.
Ben Van Roy. Stanford MS&E 338.
Ambuj Tewari and Susan Murphy. U Michigan STATS 710.
Susan Murphy. Harvard Stat 234.
Alekh Agarwal and Alex Slivkins. Columbia COMS E6998.001.
Daniel Russo. Columbia B9140-001.
Shipra Agrawal. Columbia IEOR 8100.
Emma Brunskill CMU 15-889e.
Philip Thomas. U Mass CMPSCI 687.
Michael Littman. Brown CSCI2951-F.
列一下在蒙特利尔 / Mila 的一些研究者:
McGill
- Joelle Pineau (McGill, FAIR) 现任FAIR的老大
- Doina Precup (McGill, DeepMind) Rich Sutton的亲传,在DeepMind内部级别也很高,做过很多hierarchical RL / option / semi-MDP 方面的早期工作
UdeM
Google Brain
Microsoft Research
Harm van Seijen
Remi Tachet des Combes
Romain Laroche
(如果想到更多的,之后再来补充)
作者:Narsil
链接:https://www.zhihu.com/question/516672871/answer/2740525392
来源:知乎
著作权归作者所有。商业转载请联系作者获得授权,非商业转载请注明出处。