Richard S. Sutton

Richard S. Sutton is a Canadian computer scientist. Currently, he is a distinguished research scientist at DeepMind and a professor of computing science at the University of Alberta. Sutton is considered[1] one of the founding fathers of modern computational reinforcement learning, having several significant contributions to the field, including temporal difference learning and policy gradient methods.

Richard S. Sutton
NationalityCanadian
CitizenshipCanadian
Alma materUniversity of Massachusetts Amherst
Stanford University
Known forTemporal difference learning, Dyna, Options, GQ(λ)
AwardsAAAI Fellow (2001)
President's Award (INNS) (2003)
Royal Society of Canada Fellow (2016)
Scientific career
FieldsArtificial Intelligence
Reinforcement Learning
InstitutionsUniversity of Alberta
ThesisTemporal credit assignment in reinforcement learning (1984)
Doctoral advisorAndrew Barto
Doctoral studentsDavid Silver, Doina Precup
Websiteincompleteideas.net

Life and Education

Richard Sutton was born in Ohio, and grew up in Oak Brook, Illinois, a suburb of Chicago.

Sutton received his B.A. in psychology from Stanford University in 1978 before taking an M.S. (1980) and Ph.D. (1984) in computer science from the University of Massachusetts Amherst under the supervision of Andrew Barto. His doctoral dissertation, Temporal Credit Assignment in Reinforcement Learning, introduced actor-critic architectures and "temporal credit assignment".[2]

Career

In 1984, Sutton was a postdoctoral researcher at the University of Massachusetts.

From 1985 to 1994, he was a principal member of technical staff in the Computer and Intelligent Systems Laboratory at GTE in Waltham, Massachusetts. In 1995, he returned to the University of Massachusetts as a senior research scientist.

From 1998 to 2002, Sutton worked at the AT&T Shannon Laboratory in Florham Park, New Jersey as principal technical staff member in the artificial intelligence department.

Since 2003, he has been a professor of computing science at the University of Alberta. He led the institution's Reinforcement Learning and Artificial Intelligence Laboratory until 2018.

While retaining his professorship, Sutton joined Deepmind in June 2017 as a distinguished research scientist and co-founder of its new Edmonton office.[2][3]

Sutton became a Canadian citizen in 2015 and renounced his US citizenship in 2017.

Selected publications

Awards and honors

Sutton is fellow of the Association for the Advancement of Artificial Intelligence (AAAI) since 2001.[4] In 2003 he received the President's Award from the International Neural Network Society[5] and in 2013, the Outstanding Achievement in Research award from the University of Massachusetts Amherst.[6]

Sutton's nomination as a AAAI fellow reads:[4]

For significant contributions to many topics in machine learning, including reinforcement learning, temporal difference techniques, and neural networks.

gollark: '11!¹!!one!¡¡!one1¹¹one1one¡¡one1one¹1!1¹¡oneone¡1one!¡one!¡one¡¡11¡¹¡¡one!one1!¡¡1one¹oneone1oneone¡¡¹one¹¡one¹¡11¹!1!11¹!one11one1one!¹!1!¹oneone¡1¹¹
gollark: How are you converting messages to sound?
gollark: > that long message by gollark just made a square wave sound sfxIn what way?
gollark: Ah, jabÜ.
gollark: Update your status accordingly.

References

  1. "Exclusive: Interview with Rich Sutton, the Father of Reinforcement Learning". 2018-01-11. Archived from the original on 2018-01-11. Retrieved 2018-12-17.
  2. "Brief Biography for Richard Sutton". incompleteideas.net. Retrieved 2018-12-17.
  3. "DeepMind expands to Canada with new research office in Edmonton, Alberta". DeepMind. Retrieved 2018-12-17.
  4. "Elected AAAI Fellows". www.aaai.org. Retrieved 2018-12-17.
  5. "INNS Award Recipients". www.inns.org. Retrieved 2018-12-17.
  6. "Outstanding Achievement and Advocacy Award Recipients". College of Information and Computer Sciences, University of Massachusetts Amherst. 2010-10-05. Retrieved 2018-12-17.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.