- Student Records
Programme & Unit Catalogues


CM50270: Reinforcement learning

Follow this link for further information on academic years Academic Year: 2019/0
Further information on owning departmentsOwning Department/School: Department of Computer Science
Further information on credits Credits: 6      [equivalent to 12 CATS credits]
Further information on notional study hours Notional Study Hours: 120
Further information on unit levels Level: Masters UG & PG (FHEQ level 7)
Further information on teaching periods Period:
Semester 2
Further information on unit assessment Assessment Summary: CW 100%
Further information on unit assessment Assessment Detail:
  • Written Project Report + Oral Presentation (CW 40%)
  • Programming Assignments (CW 60%)
Further information on supplementary assessment Supplementary Assessment:
Like-for-like reassessment (where allowed by programme regulations)
Further information on requisites Requisites:
Further information on descriptions Description: Aims:
This unit introduces the reinforcement learning problem and describes basic solution methods.

Learning Outcomes:
At the end of this unit, students will be able to:
1. describe how reinforcement learning problems differ from supervised learning problems such as regression and classification,
2. formulate suitable real-world problems as reinforcement learning problems by defining a state space, an action space, and a reward function appropriate for the context,
3. critically evaluate a range of basic solution methods to reinforcement learning problems,
4. analyse the difficulties encountered in solving large, complex reinforcement learning problems in practice.

Skills:
Intellectual skills:
* Develop algorithmic thinking for sequential decision making under uncertainty (T, F, A)
Transferable skills:
* Enhance perspective of decision making (T, F)
* Oral presentation of ones work (F,A)

Content:
Topics covered normally include: dynamic programming, Monte Carlo methods, temporal-difference algorithms, integration of planning and learning, value function approximation, and policy gradient methods.
Further information on programme availabilityProgramme availability:

CM50270 is Optional on the following programmes:

Department of Computer Science
  • RSCM-AFM51 : Integrated PhD Accountable, Responsible and Transparent Artificial Intelligence
  • TSCM-AFM51 : MRes Accountable, Responsible and Transparent Artificial Intelligence
  • TSCM-AFM52 : MSc Accountable, Responsible and Transparent Artificial Intelligence
  • TSCM-AFM45 : MSc Data Science
  • TSCM-AWM45 : MSc Data Science
  • TSCM-AFM48 : MSc Machine Learning and Autonomous Systems
  • TSCM-AWM48 : MSc Machine Learning and Autonomous Systems
Department of Electronic & Electrical Engineering

Notes: