production
Skip to Content

CS50AE: INFORMATION EXTRACTION AND TEXT ANALYTICS (2015-2016)

Last modified: 25 Mar 2016 11:39


Course Overview

An abundance of textual information is available on the Internet. As it is dispersed over web pages, it is difficult to extract the information and understand its overall meaning. In this course, students will learn information extraction and text mining theory and techniques, corpus construction, and programming tools (e.g. NLTK and GATE) in order to extract and structure information from text. The emphasis is hands-on and realistic. Using the techniques and tools, students will be able to start to unlock the economic, cultural, and social value of web-based textual information, gaining valuable skills in an expanding market.

Course Details

Study Type Postgraduate Level 5
Session First Sub Session Credit Points 15 credits (7.5 ECTS credits)
Campus Old Aberdeen Sustained Study No
Co-ordinators
  • Dr Adam Wyner
  • Dr Advaith Siddharthan

Qualification Prerequisites

None.

What courses & programmes must have been taken before this course?

  • Computing Science (CS) (Studied)
  • Any Postgraduate Programme (Studied)

What other courses must be taken with this course?

None.

What courses cannot be taken with this course?

None.

Are there a limited number of places available?

No

Course Description

Syllabus
 Background on NLTK and GATE, including an overview of computational linguistic cascade, rule-based systems, and machine learning
Accessing resources, corpus development, and processing raw text
Annotation standards, model specification and development
Application of annotation models
Adjudication, testing and evaluation
Auxiliary processing modules: opinion mining, ontologies, text classification, parsing etc.
Extracting structured information to a variety of data formats (ontologies, XML, logic)
Managing linguistic data

Contact Teaching Time

Information on contact teaching time is available from the course guide.

Teaching Breakdown

More Information about Week Numbers


Details, including assessments, may be subject to change until 31 August 2023 for 1st half-session courses and 22 December 2023 for 2nd half-session courses.

Summative Assessments

1st Attempt: 1 two-hour written examination (75%); continuous assessment (25%)

Resit: Will be available

Formative Assessment

There are no assessments for this course.

Feedback

None.

Course Learning Outcomes

None.

Compatibility Mode

We have detected that you are have compatibility mode enabled or are using an old version of Internet Explorer. You either need to switch off compatibility mode for this site or upgrade your browser.