Master of Science in Data Science

Fully Online
18 months
2250 hours | 90 ECTS
Degree
Scaler Neovarsity
Accreditation:
EQF7

About

The course teaches students comprehensive and specialised subjects in data science; it develops sophisticated skills in statistics, mathematical modelling, and the ability to code in support of such analyses. It further grounds students in the disciplinary history and methodology of data science, preparing them for either further study or to work as a practitioner in the field. The program prominently features a major capstone project, requiring students to identify a real-world problem that would benefit from a data-driven approach; to collect and prepare the data to address the problem; and to build visualisations in support of their arguments. The combination of rigorous mathematical training with practical approaches gives learners the ability to autonomously further develop their skills after graduation, turning them into lifelong learners of data science methods.

  • Target Audience

    • Ages 19-30, 31-65, 65+

  • Target Group

    • This course is designed for individuals who wish to enhance their knowledge of computer science and its various applications used in different fields of employment. It is designed for those that will have responsibility for planning, organizing, and directing technological operations. In all cases, the target group should be prepared to pursue substantial academic studies. Students must qualify for the course of study by entrance application. A prior computer science degree is not required; however the course does assume technical aptitude; and it targets students with finance, engineering, or STEM training or professional experience.

  • Mode of attendance

    • Online/Blended Learning

  • Structure of the programme

  • Please note that this structure may be subject to change based on faculty expertise and evolving academic best practices. This flexibility ensures we can provide the most up-to-date and effective learning experience for our students.The Master of Science in Computer Science combines asynchronous components (lecture videos, readings, and assignments) and synchronous meetings attended by students and a teacher during a video call. Asynchronous components support the schedule of students from diverse work-life situations, and synchronous meetings provide accountability and motivation for students. Students have direct access to their teacher and their peers at all times through the use of direct message and group chat; teachers are also able to initiate voice and video calls with students outside the regularly scheduled synchronous sessions. Modules are offered continuously on a publicly advertised schedule consisting of cohort sequences designed to accommodate adult students at different paces. Although there are few formal prerequisites identified throughout the programme, enrollment in courses depends on advisement from Woolf faculty and staff. The degree has 3 tiers. The first tier is required for all students, who must take 15 ECTS. In the second tier, students must select 45 ECTS from elective tiers. Tier Three may be completed in two different ways: a) by completing a 30ECTS Advanced Applied Computer Science capstone project, or b) by completing a 10 ECTS Applied Computer Science project and 20 ECTS of electives from the program.

  • Grading System

    • Scale: 0-100 points

    • Components: 60% of the mark derives from the average of the assignments, and 40% of the mark derives from the cumulative examination

    • Passing requirement: minimum of 60% overall

  • Dates of Next Intake

    • Rolling admission

  • Pass rates

    • 2023 pass rates will be publicised in the next cycle, contingent upon ensuring sufficient student data for anonymization.

  • Identity Malta’s VISA requirement for third country nationals: https://www.identitymalta.com/unit/central-visa-unit/

    • Passing requirement: minimum of 60% overall

  • Dates of Next Intake

    • Rolling admission

  • Pass rates

Supporting your global mobility
Supporting your global mobility

Global Recognition

Woolf degrees align with major international qualification frameworks, ensuring global recognition and comparability. Earn your degree in the most widely recognized accreditation system in the world.

Learn More About Degree Mobility

Our accreditation through the Malta Further and Higher Education Authority (MFHEA) provides a solid foundation for credential recognition worldwide.

Success stories
Success stories

How students have found success through Woolf

"As a working parent, I needed something flexible and manageable. Woolf’s structure fit me perfectly. I was nervous at first, balancing work, parenting, and midnight classes, but the support, resources, and sense of community kept me going."
Andreia Caroll
Clinical Research Nurse
“Woolf and Scaler’s hands-on Master’s program gave me the practical skills and confidence I was missing after my undergraduate degree. Real projects, professional tools, and mentorship transformed how I think, build, and solve problems — leading me to a career as a Software Engineer.”
Bhavya Dhiman
Master’s in Computer Science
"Woolf provided me flexibility, a strong community, and high quality education. It really broadened my perspective and significantly improved my communication skills. I graduated not just more knowledgeable, but also more confident and well-rounded."
Brian Etemesi
Software Engineer
“Woolf’s flexible, accredited program gave me structure, community, and the confidence to grow. From landing my dream internship to winning a hackathon, Woolf opened doors and shaped both my career and mindset.”
Dominion Yusuf
Higher Diploma in Computer Science
"As a working parent, I needed something flexible and manageable. Woolf’s structure fit me perfectly. I was nervous at first, balancing work, parenting, and midnight classes, but the support, resources, and sense of community kept me going."
Andreia Caroll
Clinical Research Nurse
“Woolf and Scaler’s hands-on Master’s program gave me the practical skills and confidence I was missing after my undergraduate degree. Real projects, professional tools, and mentorship transformed how I think, build, and solve problems — leading me to a career as a Software Engineer.”
Bhavya Dhiman
Master’s in Computer Science
"Woolf provided me flexibility, a strong community, and high quality education. It really broadened my perspective and significantly improved my communication skills. I graduated not just more knowledgeable, but also more confident and well-rounded."
Brian Etemesi
Software Engineer
“Woolf’s flexible, accredited program gave me structure, community, and the confidence to grow. From landing my dream internship to winning a hackathon, Woolf opened doors and shaped both my career and mindset.”
Dominion Yusuf
Higher Diploma in Computer Science
• Students will develop advanced, innovative, and multi-disciplinary problem- solving skills, • Students will communicate data science clearly and unambiguously to specialised and non-specialised audiences • Students will develop advanced abilities related to data analytics operational procedures and the ability to implement them in response to changing environments. • Students will critically evaluate alternative approaches to data science on the basis of academic scholarship and case studies, demonstrating reflection on social and ethical responsibilities. • Students will formulate data-driven analytical judgments and plans despite incomplete information by integrating knowledge and approaches from diverse domains including statistical inference, machine learning, big data, computer vision, deep learning, and natural language processing. • Students will produce work driven by research at the forefront of the domain of Data Science. • Students will enquire critically into the theoretical strategies for applying data science and analytics within business and organizational contexts. • Students will gain facility with modern tools for data analysis, from data visualisation tools such as Tableau through platforms for analysing massive amounts of unstructured data, such as Hadoop or MongoDB, and cloud architectures for data analysis. • Students will develop new skills in response to emerging knowledge and techniques and demonstrate leadership skills and innovation in complex and unpredictable contexts. • Students will gain experience in working collaboratively on data science teams, including such skills as peer review, understanding contributor roles, and team dynamics.

Course Structure

Exploratory Data Analysis & Management
150 hours | 6 ECTS

About

Most industry analysis starts with exploratory data analysis and a thorough study of this will help learners to perform data health checks and provide initial business insights.

The module will help the learner to understand and perform descriptive statistics and present the data using appropriate graphs/diagrams and serves as a foundation for advanced analytics.

This module also introduces the basics of programming in R and Python, the most commonly used languages used for data science.

The module culminates in practices related to data management, which is essential for both exploratory data analysis and advanced analytics. In particular, the module focuses on SQL as a highly practical language for data preprocessing, and addresses ways to connect SQL with R and Python tools, as well as learning the skills required to prepare data for machine learning and efficient data modelling.

Core Reading List:

R for Data Science: Import, Tidy, Transform, Visualise, and Model DataPaperback – 25 July 2016

by Garrett Grolemund (Author), Hadley Wickham (Author)

Hands-On Exploratory Data Analysis with Python: Perform EDA techniques to understand, summarise, and investigate your data Paperback – 27 Mar. 2020

by Suresh Kumar Mukhiya (Author), Usman Ahmed (Author)

Supplementary Reading List:

Exploratory Data Analysis with R

Radhika Datar, Harish Garg

Publisher: Packt Publishing (31 May 2019)

ISBN: 178980437X

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Grainne Barry
Grainne Barry
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo

Intended learning outcomes

Knowledge
  • Methods of distribution
  • Best practices used to visually display data.
  • Best practices related to data analysis and management, especially for large data sets.
  • Key strategies related to the most appropriate measures of central tendency.
Skills
  • Autonomously gather material, including from large data sets, and organise it into effective visualisations for analysis.
  • Assess symmetry of data using measures of skewness.
  • Accurately visualise and analyse data relationships. Autonomously connect SQL to R and Python to efficiently demonstrate data modelling processes through industry application.
Competencies
  • Import and export datasets and create data frames within R and Python, and connect these to SQL for preprocessing.
  • Troubleshoot problems and be prepared to make leadership decisions related to industry methods and principles of data analysis and management.
  • Independently work in R, Python, and SQL development environments.
  • Manage data sets using a variety of functions, including acting autonomously to identify problems and relevant solutions for data wrangling.
Statistical Inference
150 hours | 6 ECTS

About

This module provides learners with an in-depth understanding of the statistical distribution and hypothesis testing in a practical approach to getting things done.

Statistical distributions include Binomial, Poisson, Normal, Log-Normal, Exponential, t, F, and Chi-Square. Parametric and non-parametric tests used in research problems are covered in this unit.

The module will help learners to formulate research hypotheses, select appropriate tests of hypotheses, write primarily R programs to perform hypothesis testing, and to draw inferences using the output generated. After taking this course, students will understand the broad directions of statistical inference and use this information for making informed choices in analysing data.

Core Reading List:

Statistical Inference For Everyone

Copyright Year: 2017

Brian Blais, Bryant University

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • The relevance of R to calculate probabilities.
  • Discrete and continuous random variables.
  • Key strategies related to distributions of observed data.
  • Select topics for the advanced management of parametric and non- parametric tests.
Skills
  • Understand and use statistical hypothesis testing concepts and terminology.
  • Autonomously perform tests for normality and common distribution
  • Analyse data relationships using covariance
Competencies
  • Evaluate standard types of distributions.
  • Demonstrate self-direction and industry practices in developing solutions for hypothesis testing.
  • Efficiently analyse the concept of variance through a variety of models.
Fundamentals of Predictive Modelling
150 hours | 6 ECTS

About

This module provides a strong foundation for predictive modelling. Its

objective is to define the entire modelling process with the help of real life

case studies.

Many concepts in predictive modelling methods are common and, therefore,

these concepts will be covered in detail in this module.

Students will learn how to carry out exploratory data analysis to gain

insights and prepare data for predictive modelling, an essential skill valued

in many industries.

The module also builds on information covered in the module Exploratory

Data Analysis to include hands-on applications of the summarization and

visualisation of datasets through plots to present results in compelling and

meaningful ways.

Core Reading List:

Mastering Predictive Analytics with R - Second Edition James D. Miller, Rui Miguel Forte Publisher Packt Publication date: August 2017

Predictive Analytics with Python, 1st Edition Alvaro Fuentes Publisher Packt

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Key strategies and best practices related to assessing the goodness of fit of a model.
  • Industry applications of normality tests.
  • The step-by-step construction of regression models.
Skills
  • Test value assumptions using multiple predictors.
  • Autonomously carry out global and individual testing of parameters used in defining predictive models
  • Evaluate machine learning models on a limited data sample.
Competencies
  • Demonstrate self-direction in calculating inflation factors.
  • Efficiently manage troubleshooting issues that arise in connection to data not explained by a model.
  • Solve problems and be prepared to take leadership decisions related to the methods and correlation of variables.
  • Apply a professional and scholarly approach to real-world problems pertaining to the estimation of model parameters.
Business Intelligence
150 hours | 6 ECTS

About

PowerBI and Excel are fundamental parts of the data analytics toolkit. A strong understanding of these also provides a basis for more advanced data analytics with other techniques and technologies. In this unit, learners will gain experience in collecting, processing, analysing, and communicating with data using Excel. In addition, data visualisation is a powerful way to communicate meaning in data and support business decision-making. This unit will cover the main commercial tools used in data visualisation such as Tableau and Power BI, enabling learners to create a wide range of graphs, charts, and dashboards and use them appropriately in context.

Teachers

Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Theories and contemporary practices in business analytics.
  • Key strategies related to deploying data in business operations and management
  • Select topics related to industry-specific uses of PowerBI
Skills
  • Autonomously solve problems in the domain of visualizing business data.
  • Autonomously identify opportunities for the use of Excel and PowerBI in business contexts
  • Employ Excel–including tools such as pivot tables, and basic visualisations–and PowerBI to surface insights about business operations .
Competencies
  • Apply a professional and scholarly approach to data analytics within a business context.
  • Demonstrate self-direction in research and originality in addressing the availability of data for business operations.
  • Solve problems related to the use of dashboards and visualisations for business management
  • Act autonomously in identifying research problems and solutions related to applications of Excel and PowerBI for analytics.
Data Science In Practice
150 hours | 6 ECTS

About

This unit provides learners with an opportunity to apply key knowledge and skills through project work. They will be able to select a project from a specific domain and will be required to carry out various data management, exploratory data analysis, data visualisation and predictive modelling tasks. The Data Science in Practice work should deepen their engagement with this material, and should prepare students for engaging fully with contemporary research methods in data science.

Reading List

(General):

  • Gao, G., Mishra, B., & Ramazzotti, D. (2018). Causal data science for financial stress testing. J. Comput. Sci., 26, 294-304.

  • Chen, H., Lundberg, S.M., & Lee, S. (2018). Hybrid Gradient Boosting Trees and Neural Networks for Forecasting Operating Room Data. ArXiv, abs/1801.07384. (Machine Learning)

  • Miller, James D. and Rui Miguel Forte. Mastering Predictive Analytics with R: Machine Learning Techniques For Advanced Models. Second Ed. Birmingham: Packt, 2017.

  • Fuentes, Alvaro. Mastering Predictive Analytics with Python. Birmingham: Packt, 2018. (Data Analytics in Business)

  • Hands-On Exploratory Data Analysis with R: Become an Expert in Exploratory Data Analysis Using R Packages, Radhika Datar and Harish Garg, 1st Edition. (Packt Publishing, 2019). 266 pages

  • Hands-On Exploratory Data Analysis with Python: Perform EDA Techniques to Understand, Summarize, and Investigate Your Data, Suresh Kumar Mukhiya and Usman Ahmed, 1st Edition. (Packt Publishing, 2020). 352 pages.

Teachers

Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Theories and contemporary practices in data analytics.
  • Select topics related to industry-specific uses of programming in R, Python, and MySQL, as well as data visualisation tools.
  • Key strategies related to statistical modelling and predictive analytics.
Skills
  • Autonomously solve problems in the domain of data analytics.
  • Autonomously identify opportunities for the use of Python, R, MySQL, and data visualisation tools.
  • Employ statistical modelling and predictive analytics within real-world business contexts
Competencies
  • Act autonomously in identifying research problems and solutions related to data visualisation and analytics.
  • Apply a professional and scholarly approach to data analytics within a real-world context.
  • Demonstrate self-direction in research and originality in addressing statistical analysis and predictive modelling.
  • Solve problems related to the use of programming and data modelling in real-world applications
Advanced Predictive Modelling
150 hours | 6 ECTS

About

This module builds on the concepts introduced in the module Fundamentals of Predictive Modelling.

In this module, learners are introduced to model development for categorical dependent variables. Binary dependent variables are encountered in many domains such as risk management, marketing and clinical research and this unit covers detailed model building processes for binary dependent variables. Additionally, a primary goal of the module is for students to be able to select and successfully apply appropriate advanced regression models in applied settings.

The module will culminate with multinomial models and ordinal scaled variables.

Core Reading List:

Mastering Predictive Analytics with R - Second Edition James D. Miller, Rui Miguel Forte Publisher Packt Publication date: August 2017

Predictive Analytics with Python, 1st Edition Alvaro Fuentes Publisher Packt

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Comparing data to a known distribution.
  • Determining if a sample follows a normal distribution.
  • The implementation of binomial regression in real world settings.
Skills
  • Develop applications using more than two categories of dependent, outcome, or explanatory variables.
  • Critically assess the effect of several variables upon the time a specified result takes to occur.
  • Develop models using one or more predictor variables to predict the target variable classes.
Competencies
  • Efficiently estimate model parameters.
  • Act autonomously in developing estimates of unknown population parameters.
  • Demonstrate self-direction in global hypothesis testing.
  • Solve problems related to generalised linear models through link function.
Unsupervised Multivariate Methods
150 hours | 6 ECTS

About

Data reduction is a key process in business analytics projects. In this module, learners will learn data reduction methods such as Principal Component Analysis, Factor Analysis and Multidimensional Scaling.

Students will develop skills related to the formation of segments using cluster analysis methods. Additionally, students will analyse segments, the process of which is a key technique for large groups of data as intrinsic information appears in detail once segmented thoughtfully.

Required Reading Material:

  • Applied Unsupervised Learning with R Publisher: Packt R Copyright Year: March 2019 ISBN 9781789956399 Alok Malik, Bradford Tuckfield

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Grainne Barry
Grainne Barry
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo

Intended learning outcomes

Knowledge
  • Estimating the optimum number of clusters using hierarchical clustering.
  • Key strategies related to the concept of data reduction.
  • Select topics for the advanced implementation of Eigenvectors.
  • Algorithms relevant to multivariate methods.
Skills
  • Apply an in-depth domain-specific knowledge and understanding to multivariate analysis.
  • Define Principal Component Analysis (PCA) and its derivations and assess their application.
  • Critically understand and implement hierarchical and non-hierarchical cluster analysis and assess their outputs.
Competencies
  • Solve problems and be prepared to take leadership decisions related to the methods and principles of visualising the level of similarity of individual cases of a dataset.
  • Act autonomously in the estimation of loading matrices and interpreting factor solutions.
  • Demonstrate self-direction in research and originality in developing scoring models.
  • Apply industry best practices for resolving issues pertaining to factor analysis.
Time Series Analysis
150 hours | 6 ECTS

About

In this module, time series forecasting methods are introduced and explored. Students will gain a working knowledge of the nature and processes used in relation to time series data and confidently recognize and understand trends that exist within that data. This information will be used to make predictions or forecasts.

Students will analyse and forecast macroeconomic variables such as GDP and inflation. Additionally, students will work with complex financial models using ARCH and GARCH, ARIMA, time series regression, exponential smoothing, and other models.

Core Reading List:

Hands on Time Series Analysis with R

Rami Krispin

Publisher: Packt

Copyright Year: May 2019

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Paul Breton Penman
Paul Breton Penman

Intended learning outcomes

Knowledge
  • Models related to series analysis.
  • Conversion of non-stationary time series data into stationary time series data.
  • Key strategies related to the concept of seasonal decomposition.
Skills
  • Validate Auto Regressive Integrated Moving Average (ARIMA) models and use estimation.
  • Implement panel data regression methods.
  • Assess the concepts and uses of time series analysis and test for stationarity in time series data.
Competencies
  • Create synthetic contextualised discussions of key issues related to components of time series.
  • Efficiently manage industry-level issues in connection to trend analysis.
  • Demonstrate self-direction in developing real-world applications for serial correlation.
  • Solve problems and be prepared to take leadership decisions related to the methods and principles of residual analysis.
Machine Learning I
150 hours | 6 ECTS

About

Machine learning algorithms are new generation algorithms used in conjunction with classical predictive modelling methods.

In this Machine Learning 1 module, learners will understand applications of the Support vector machine, K Nearest Neighbours and Naive Bayes algorithms for classification and regression problems. Additionally, students will develop practical machine learning and data science skills including theoretical basics of a broad range of machine learning concepts and methods with practical applications to sample datasets.

Reading List:

Introduction to Machine Learning with Python: A guide for Data Scientists, Andreas Müller and Sarah Guido, 1st Edition. (O’Reilly Media, 2016).

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Decision boundaries that help classify data points.
  • The industry relevance of the apriori algorithm.
  • Models intended to predict the value of a target variable.
  • Regression models with binary target variables.
Skills
  • Appraise classification methods and the support vector machine algorithm.
  • Use algorithims to make predictions and apply neutral networks to classification problems.
  • Apply decision tree and random forest algorithms to classification and regression problems.
Competencies
  • Act autonomously in identifying neutral networks for classification problems.
  • Apply a professional and scholarly approach to Bayes theorem and its applications.
  • Efficiently manage issues in connection to machine algorithms.
  • Demonstrate self-direction in bootstrapping and aggregation.
Machine Learning II
150 hours | 6 ECTS

About

Machine learning algorithms are new generation algorithms used in

conjunction with classical predictive modelling methods.

Machine learning algorithms are new generation algorithms used in

conjunction with classical predictive modelling methods. In this Machine Learning 2 module, students build on the knowledge gained from Machine Learning 1 and will go on to understand applications of decision trees and random forest algorithms, and neural networks for classification and regression problems. Additionally, students will develop practical machine learning and data science skills including theoretical basics of a broad range of machine learning concepts and methods with practical applications to sample datasets.

Reading List:

Introduction to Machine Learning with Python: A guide for Data Scientists, Andreas Müller and Sarah Guido, 1st Edition. (O’Reilly Media, 2016).

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Regression models with binary target variables.
  • Models intended to predict the value of a target variable.
  • Decision boundaries that help classify data points.
  • The industry relevance of the apriori algorithm.
Skills
  • Use algorithms to make predictions and apply neutral networks to classification problems.
  • Apply decision tree and random forest algorithms to classification and regression problems.
  • Appraise classification methods and the support vector machine algorithm.
Competencies
  • Demonstrate self-direction in bootstrapping and aggregation.
  • Apply a professional and scholarly approach to Binary Logistic Regression and its applications.
  • Act autonomously in identifying neutral networks for classification problems.
  • Efficiently manage issues in connection to decision tree and random forest machine learning algorithms.
Text Mining and Natural Language Processing
150 hours | 6 ECTS

About

In this module, students will look at analysing unstructured data such as that found on social media, newspaper articles, videos, and more.

Specifically, students will look at text techniques for text mining and natural language processing using R and Python code to produce graphical representations of unstructured data and carry out sentiment analysis.

This module focuses on learning key concepts, tools, and methodologies for natural language processing and emphasises hands-on learning through guided tutorials and real-world examples.

Core Reading List:

Text Mining with R

Julia Silge and David Robinson.

O’Reilly

Natural Language Processing with Python

Steven Bird, Ewan Klein and Edward Loper.

O’Reilly

Teachers

Rachit Agarwal
Rachit Agarwal
Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Industry applications in the domain of language processing.
  • Principles and applications of sentiment analysis.
  • Key strategies related to structured data versus unstructured data and the features of each.
  • Principles and applications of text analysis.
Skills
  • Process text data to generate insights
  • Perform sentiment analysis on unstructured data.
  • Process text data and strings, and perform pattern matching with expressions in R and Python.
Competencies
  • Efficiently manage issues that arise in connection to text mining.
  • Apply a professional and scholarly approach to research problems pertaining to natural language processing.
  • Demonstrate self-direction in applying solutions related to text mining.
Applied Data Science Practicum
750 hours | 30 ECTS

About

The Applied Data Science Practicum requires learners to investigate a real-world problem in the last phase of the MSc Data Science course. Its objective is to help students appropriately apply the concepts, techniques and tools learned from the Postgraduate Certificate and Diploma parts of the course to a real-world scenario.

Students typically choose a problem from a particular business or social domain after discussing it with the course instructor(s). They have the option of working on a real-world problem from their own organisation and working with a mentor in conjunction with their course supervisor. All external expert supervisors and projects need to be approved by the instructor(s) to ensure that the analytic question is appropriately scoped and technically challenging and that the solutions are rigorous and of high quality.

Students are required to solve an analytically complex research problem. Once the problem has been approved by the instructor(s), the student conducts a literature review of prior work in the field. Then, they conduct an exploratory data analysis, hypothesis testing, and research design and use a range of classical and/or modern machine learning modelling methods to predict outcomes and provide actionable insights and recommendations. Depending on the problem, the students may build dashboards or other artifacts as part of this work.

A key part of the project is to communicate the output of the learner’s research to technical and non-technical audiences through written, verbal and visual means.

Teachers

Sujata Nitin Suvarnapathaki
Sujata Nitin Suvarnapathaki
Deshpande Vinayak Bapu
Deshpande Vinayak Bapu
Babita Kaul Kachroo
Babita Kaul Kachroo
Snigdha Pain
Snigdha Pain

Intended learning outcomes

Knowledge
  • Theories for business and organisational applications in the domain of data science.
  • Key strategies related to best practices of modern data analytics.
  • Diverse scholarly views on data modelling, predictive inference, machine learning, and data visualisation.
  • Research methods related to data science.
Skills
  • Critically analyse the performance of various statistical methods and machine learning algorithms with real world data sets.
  • Apply an in-depth, domain-specific knowledge of modern data analytics methods and successfully communicate the outcomes of complex technical processes through verbal, visual, and written means.
  • Creatively apply statistical and scientific methods to evaluate research problems.
  • Autonomously synthesise a wide range of data science skills and methods to real world problems.
Competencies
  • Efficiently manage interdisciplinary issues that arise in connection to statistical methods and data visualisations.
  • Demonstrate self-direction in research and originality in solutions developed for classical and machine learning algorithms and other modelling methods.
  • Apply a professional and scholarly approach to research problems pertaining to data science and machine learning.
  • Create synthetic contextualised discussions of key issues related to real-world problems in data science.

Entry Requirements

Tuition Cost
6,000 USD
Student education requirement
Undergraduate (Bachelor’s)

Application Process

1

Submit initial Application

Complete the online application form with your personal information

2

Documentation Review

Submit required transcripts, certificates, and supporting documents

3

Assessment

Your application will be evaluated against program requirements

4

Interview

Selected candidates may be invited for an interview

5

Decision

Receive an admission decision

6

Enrollment

Complete registration and prepare to begin your studies

Ready to advance your education with a globally recognised degree?

Previous
Overview
Next
What you’ll learn