Select Page

Data Analytics for Practitioner (DAP)

The future belongs to those who can transform data into strategic business decisions and value-driven products. Data Analytics is a field of Big Data which seeks to provide meaningful information from complex data using various tools, statistics and programming principles.

Course Overview

Data analytics incorporates statistics and information technology, providing data analysts with information that optimizes business efficiency. From enabling customized user experiences, to refining business operations in accordance with emerging trends, data analytics potentially elevates businesses among competitors.

This data-driven course offers a comprehensive hands-on experience, exposing trainees to practical analytical perspectives and also equips trainees with an extensive expertise in handling data using the ideal analytical tools.

By the end of this course, trainees should be well-informed of data analytics processes, ultimately providing better judgements on statistically significant decisions crucial for businesses and operations.

Participant Prerequisites

Candidates must have a background in databases and programming. Basic background in statistics, probability, linear algebra, and calculus would be helpful.

Course Objectives

Upon completion of this course, you will be able to:

 

  • Incorporate text analytics over corresponding areas of commercial and retail production.

  • Enhance the efficiency of customer behavioural modeling.

  • Understand Text Analytics and Data Science.

  • Grow the value-add of operations for higher management.

Course Outline

The following items describe the outline of the course:

Day 1:

The Business Case for Data Analytics

  • Predicting customer behavior with greater accuracy and precision

  • Cost improvements by avoiding manual data processing

  • Innovation improvements

  • Analytics Maturity Curve

Data Analytics

  • Introduction to Data Analysis

  • Data Processing

  • Data Quality

  • Working with SQL vs. NoSQL

Case Study

  • Product recommendation and churn modeling cases will be discussed

Hands-on Excercise

  • Participants will be asked to assess their Analytics Maturity

  • Participants will be asked to define their problem statements for their business segments

  • Participants will be asked to discuss organizational and cultural limitations within their companies to broad analytics adoption

Day 2:

Exploratory Data Analysis

  • Distributions – Binomial, Power Law (Benford’s Law and Zipf’s Law), Normal, Etc.

  • Descriptive statistics

  • Fitting linear models with lm

  • Fitting linear models with multiple predictors

  • Overfitting

Predictive Analytics with Python & R

  • What is a prediction?

  • Sampling, training, and test sets

  • Constructing a decision tree for Purchasing Decision

  • Collaborative Filtering for Recommendation Analysis

  • Association Rule Mining for Recommendation Analysis

  • Logistic Regression for Churn modeling

  • Neural Network for Churn modeling

Customer Segmentation with with Python and R

  • Clustering using k-means and other techniques

  • Classification using naïve Bayesian and random Forest

  • Identifying Potential Customers

Hands-on Exercise

  • Participants will be asked to perform customer segmentation on a customer data set

  • Participants will be asked to create a recommender on a customer/product data set

Day 3:

Data Visualization with Tableau  

  • Presenting Data

  • Creating Dashboards and Storyboards

  • Identifying measures

Setting up a Data Architecture

  • Data Ingestion tools – Nifi, Kylo

  • Data Lakes – Hadoop & S3

  • Data Transformation and ELT

  • Data Storage (SQL vs. NoSQL)

  • Message Storage (Redis Pub-Sub)

  • Streaming Data Storage and Processing (Spark)

  • Search Database (ElasticSearch)

  • Graph Database (Neo4J)

Hands-in Exercise

  • Participants will be asked to create a simple dashboard in Tableau

  • Participants will be asked to identify roles and responsibilities that are either lacking or not adequately skilled for Data Analytics

      Course Materials

      The following materials are included as part of the course:

      • iTrain Asia official digital curriculum

      Exam Format

      Exam duration is 2 hours, consisting of 40 Multiple Choice Questions, with a Passing Score of 70%.

      DAP - Data Analytics for Practitioner

      4 + 12 =