production
Skip to Content

QC5003: EVALUATION OF AI SYSTEMS (2026-2027)

Last modified: 05 Dec 2025 16:46


Course Overview

One of the biggest challenges in Artificial Intelligence is evaluating how well AI systems work.   This course will provide students of our MSc in AI with knowledge of core evaluation concepts, approaches, tools, techniques and technologies; we will also look at software testing of AI systems.

Course Details

Study Type Postgraduate Level 5
Term First Term Credit Points 15 credits (7.5 ECTS credits)
Campus Offshore Sustained Study No
Co-ordinators
  • Dr Wei Zhao

What courses & programmes must have been taken before this course?

  • Any Postgraduate Programme (Studied)

What other courses must be taken with this course?

None.

What courses cannot be taken with this course?

None.

Are there a limited number of places available?

No

Course Description

The course will cover concepts, methods, techniques and tools/technologies for evaluating and testing AI systems. Students will be equipped with knowledge on statistical analysis and hypothesis testing (e.g., t-test, chi-square, correlations and regression) and learn to use software/tools for statistical analysis. The course will cover the design and execution of human evaluations of AI systems, including research ethics when working with human subjects.  It will also examine the use of automated metrics to evaluate AI systems, including best practice, common pitfalls, and avoiding bias; and methodologies for rigorous software testing of AI systems.


Contact Teaching Time

Information on contact teaching time is available from the course guide.

Teaching Breakdown

More Information about Week Numbers


Details, including assessments, may be subject to change until 31 August 2025 for 1st Term courses and 19 December 2025 for 2nd Term courses.

Summative Assessments

Report: Group

Assessment Type Summative Weighting 50
Assessment Weeks Feedback Weeks

Look up Week Numbers

Feedback
Learning Outcomes
Knowledge LevelThinking SkillOutcome
ReflectionAnalyseStudents will be familiar with common AI evaluation techniques, and can critically assess whether these techniques are being used appropriately.
ReflectionCreateStudents will be able to design, carry out, and write up experiments to test research questions and hypotheses
ReflectionCreateStudents will understand how to perform common statistical hypothesis tests.

Report: Individual

Assessment Type Summative Weighting 50
Assessment Weeks Feedback Weeks

Look up Week Numbers

Feedback
Learning Outcomes
Knowledge LevelThinking SkillOutcome
ReflectionAnalyseStudents will be familiar with common AI evaluation techniques, and can critically assess whether these techniques are being used appropriately.
ReflectionCreateStudents will understand how to perform common statistical hypothesis tests.

Formative Assessment

There are no assessments for this course.

Resit Assessments

Resubmission of failed elements (pass marks carried forward)

Assessment Type Summative Weighting
Assessment Weeks Feedback Weeks

Look up Week Numbers

Feedback
Learning Outcomes
Knowledge LevelThinking SkillOutcome
Sorry, we don't have this information available just now. Please check the course guide on MyAberdeen or with the Course Coordinator

Course Learning Outcomes

Knowledge LevelThinking SkillOutcome
ReflectionAnalyseStudents will be familiar with common AI evaluation techniques, and can critically assess whether these techniques are being used appropriately.
ReflectionCreateStudents will understand how to perform common statistical hypothesis tests.
ReflectionCreateStudents will be able to design, carry out, and write up experiments to test research questions and hypotheses

Compatibility Mode

We have detected that you are have compatibility mode enabled or are using an old version of Internet Explorer. You either need to switch off compatibility mode for this site or upgrade your browser.