Description
Covers Sections 1 & 2 of ICDSP Certification Syllabus
1. Accessing and Manipulating Data
1.1 Preparing data for analysis and modelling
1.2 Developing optimal data structures
1.3 Transforming data into usable datasets
1.4 Exploratory data analysis including identification of anomalies and outliers
1.5 Handling missing data - imputation and other methods
2. Data Mining and Modelling: Supervised and Unsupervised Learning Systems
2.1 Classification methods
2.2 Regression methods – linear, logistic and non-linear regression models
2.3 Time-series forecasting methods
2.4 Ensemble methods e.g. Boosted Decision Trees and Forests
2.5 Association-based data mining schemes
2.6 Unsupervised learning through clustering segmentation
Duration: The module features twice weekly synchronous online sessions (2 hours per session), over a 6-week period (24 hours total course contact duration).