Accessibility statement

DATA: Introduction to Data Science - COM00028I

« Back to module search

  • Department: Computer Science
  • Module co-ordinator: Dr. Tarique Anwar
  • Credit value: 20 credits
  • Credit level: I
  • Academic year of delivery: 2023-24
    • See module specification for other years: 2024-25

Module summary

Introduction to Data Science

Module will run

Occurrence Teaching period
A Semester 1 2023-24

Module aims

Students will be introduced to key concepts required to undertake rigorous and valid data analysis. Students will be introduced to processes for collecting, manipulating and cleaning data, while gaining experience in judging the quality of data sources. Students will be introduced to statistical analysis in data science, including correlation, inferential statistics and regression, and how to use these tests in a programming environment. Relational databases, SQL, and and other database paradigms such as NoSQL, are covered as a way of storing and accessing data. A key aim of the module is to solve complex problems and deliver insights about multi-dimensional data.

Module learning outcomes

  • Distinguish between different types of data that are generated in science, engineering and design, and employ strategies for ensuring data quality.
  • Retrieve data from a variety of different data sources in a variety of different formats.
  • Apply inferential statistics and statistical procedures to test hypotheses about features and relationships within data sets.
  • Use appropriate visualisations to present and explore data sets.
  • Use databases, both relational and of other paradigms, to store and query data.
  • Identify the ethical concerns regarding the provenance of data, the privacy of individuals, and the impact data analytics can have on society, and apply topics from the code of ethics of a professional data protection body.

Assessment

Task Length % of module mark
Essay/coursework
Essay: DATA
N/A 100

Special assessment rules

None

Reassessment

Task Length % of module mark
Essay/coursework
Essay: DATA
N/A 100

Module feedback

Feedback is provided through work in practical sessions, and after the final assessment as per normal University guidelines.

Indicative reading

*** Spiegelhalter, D., The Art of Statistics: Learning from Data, Pelican, 2019.

*** VanderPlas, J. Python Data Science Handbook: Essential Tools for Working with Data, O’Reilly, 2016.

** Igual, L. Segui, S. Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications, Springer, 2017



The information on this page is indicative of the module that is currently on offer. The University is constantly exploring ways to enhance and improve its degree programmes and therefore reserves the right to make variations to the content and method of delivery of modules, and to discontinue modules, if such action is reasonably considered to be necessary by the University. Where appropriate, the University will notify and consult with affected students in advance about any changes that are required in line with the University's policy on the Approval of Modifications to Existing Taught Programmes of Study.