Data Analytics for Practitioner (DAP)
The future belongs to those who can transform data into strategic business decisions and value-driven products. Data Analytics is a field of Big Data which seeks to provide meaningful information from complex data using various tools, statistics and programming principles.
Course Overview
Data analytics incorporates statistics and information technology, providing data analysts with information that optimizes business efficiency. From enabling customized user experiences, to refining business operations in accordance with emerging trends, data analytics potentially elevates businesses among competitors.
This data-driven course offers a comprehensive hands-on experience, exposing trainees to practical analytical perspectives and also equips trainees with an extensive expertise in handling data using the ideal analytical tools.
By the end of this course, trainees should be well-informed of data analytics processes, ultimately providing better judgements on statistically significant decisions crucial for businesses and operations.
Participant Prerequisites
Candidates must have a background in databases and programming. Basic background in statistics, probability, linear algebra, and calculus would be helpful.
Course Objectives
Upon completion of this course, you will be able to:
-
Incorporate text analytics over corresponding areas of commercial and retail production.
-
Enhance the efficiency of customer behavioural modeling.
-
Understand Text Analytics and Data Science.
-
Grow the value-add of operations for higher management.
Course Outline
The following items describe the outline of the course:
Day 1:
The Business Case for Data Analytics
-
Predicting customer behavior with greater accuracy and precision
-
Cost improvements by avoiding manual data processing
-
Innovation improvements
-
Analytics Maturity Curve
Data Analytics
-
Introduction to Data Analysis
-
Data Processing
-
Data Quality
-
Working with SQL vs. NoSQL
Case Study
-
Product recommendation and churn modeling cases will be discussed
Hands-on Excercise
-
Participants will be asked to assess their Analytics Maturity
-
Participants will be asked to define their problem statements for their business segments
-
Participants will be asked to discuss organizational and cultural limitations within their companies to broad analytics adoption
Day 2:
Exploratory Data Analysis
-
Distributions – Binomial, Power Law (Benford’s Law and Zipf’s Law), Normal, Etc.
-
Descriptive statistics
-
Fitting linear models with lm
-
Fitting linear models with multiple predictors
-
Overfitting
Predictive Analytics with Python & R
-
What is a prediction?
-
Sampling, training, and test sets
-
Constructing a decision tree for Purchasing Decision
-
Collaborative Filtering for Recommendation Analysis
-
Association Rule Mining for Recommendation Analysis
-
Logistic Regression for Churn modeling
-
Neural Network for Churn modeling
Customer Segmentation with with Python and R
-
Clustering using k-means and other techniques
-
Classification using naïve Bayesian and random Forest
-
Identifying Potential Customers
Hands-on Exercise
-
Participants will be asked to perform customer segmentation on a customer data set
-
Participants will be asked to create a recommender on a customer/product data set
Day 3:
Data Visualization with Tableau
-
Presenting Data
-
Creating Dashboards and Storyboards
-
Identifying measures
Setting up a Data Architecture
-
Data Ingestion tools – Nifi, Kylo
-
Data Lakes – Hadoop & S3
-
Data Transformation and ELT
-
Data Storage (SQL vs. NoSQL)
-
Message Storage (Redis Pub-Sub)
-
Streaming Data Storage and Processing (Spark)
-
Search Database (ElasticSearch)
-
Graph Database (Neo4J)
Hands-in Exercise
-
Participants will be asked to create a simple dashboard in Tableau
-
Participants will be asked to identify roles and responsibilities that are either lacking or not adequately skilled for Data Analytics
Course Materials
The following materials are included as part of the course:
- iTrain Asia official digital curriculum
Exam Format
Exam duration is 2 hours, consisting of 40 Multiple Choice Questions, with a Passing Score of 70%.






