Data Analysis Boot Camp - 3 Day

This course, organized into key topic areas, leverages straightforward business examples to explain practical techniques for understanding and reviewing data quality and how to translate data into analysis of business problems to begin making informed, intelligent decisions. Get an overview of data quality and data management, followed by foundational analysis and statistical techniques.

 

Throughout the course, you will learn to communicate about data and findings to stakeholders who need to quickly make the decisions that drive your organization forward.

At the end of the class, we provide an overview of the Certified Analytics Professional certification. We discuss business applications for professionals with the certification, the main focus areas behind the certification, test-preparation and test-taking anecdotes.

In–Class Exercises, Demos, and Real-World Case Studies

This data analysis training class is a lively blend of expert instruction combined with hands-on exercises so you can practice new skills. Leave prepared to start performing practical analysis techniques the moment you return to work. Every Data Analysis Boot Camp instructor is a veteran consultant and data guru who will guide you through effective best practices and easily-accessible technologies for working with your data. Through a combination of demonstrations and hands-on practice, you will learn to use data analysis techniques which are typically the domain of expensive consultants:

  • Identify opportunities, manage change and develop deep visibility into your organization

  • Understand the terminology and jargon of analytics, business intelligence and statistics

  • Learn a wealth of practical applications for applying data analysis capability

  • Visualize both data and the results of your analysis for straightforward graphical presentation to stakeholders

  • Learn to estimate more accurately than ever, while accounting for variance, error, and Confidence Intervals

  • Practice creating a valuable array of plots and charts to reveal hidden trends and patterns in your data

  • Differentiate between "signal" and "noise" in your data

  • Understand and leverage different distribution models, and how each applies in the real world

  • Form and test hypotheses – use multiple methods to define and interpret useful predictions

  • Learn about statistical inference and drawing conclusions about the population

 

Outline:

1. Data Fundamentals

Course Overview and Level Set

  • Objectives of the class

  • Expectations for the class

Understanding "real-world" data

  • Unstructured vs. structured

  • Relationships

  • Outliers

  • Data growth

Types of Data

  • Flavors of data

  • Sources of data

  • Internal vs. external data

  • Time scope of data (lagging, current, leading)

LAB: Getting started with our classroom data 

Data-related Risk

  • Common identified risks

  • Effect of process on results

  • Effect of usage on results

  • Opportunity costs, Tool investment

  • Mitigating common risks

Data Quality

  • Cleansing

  • Duplicates

  • SSOT

  • Field standardization

  • Identifying sparsely populated fields

  • How to fix some common issues

LAB: Data Quality

Relationships

  • Finding common attributes

  • 1:N, N:N, 1:1

LAB: Relationships in a dataset
 

2. Analysis Foundations

Statistical Practices: Overview

  • Comparing programs and tools

  • Words in English vs. data

  • Concepts specific to data analysis

Domains of data analysis

  • Descriptive statistics

  • Inferential statistics

  • Analytical mindset

  • Describing and solving problems

 

3. Analyzing Data

Averages in data

  • Mean

  • Median

  • Mode

  • Range

Central Tendency

  • Variance

  • Standard deviation

  • Sigma values

  • Percentiles

  • Using these concepts to estimate things

LAB: Hands-On – Central Tendency

LAB: Hands-On – Linear Regression

Overview of commonly useful distributions

  • Probability distribution

  • Cumulative distribution

  • Bimodal distributions

  • Skewness of data

  • Pareto distribution

Correlation

LAB: Distributions

Analytical Graphics for Data

  • Categorical – bar charts

  • Continuous – histograms

  • Time series – line charts

  • Bivariate data – scatter plots

  • Distribution – box plot

 

4. Analytics & Modeling

ROI & Financial Decisions

Common uses of financial data

  • Earned Value

  • Actual Cost, BAC and EAC

  • Expected Monetary Value

  • Cost Performance/Schedule Performance Index

Common uses for random numbers

  • Sampling

  • Simulation

  • Monte Carlo analysis

  • Pseudo-random sequences

Demo / Lab – Random numbers in Excel

An introduction to Predictive Analytics

  • A discussion about patterns

  • Regression and time series for prediction

  • Machine learning basics

  • Tools for predictive analytics

Demo / Lab – Getting started with R

Understanding Clustering

  • Segmentation

  • Common algorithms

  • K-MEANS

  • PAM

Fundamentals of Data Modeling

  • Architecture and analysis

  • Stages of a data model

  • Data warehousing

  • Top-down vs. Bottom-up

Understanding Data Warehousing

  • Context tables

  • Facts

  • Dimensions

  • Star vs. Snowflake Schema

 

5. Visualizing & Presenting Data

 

Goals of Visualization

  • Communication and Narrative

  • Decision enablement

  • Critical characteristics

Visualization Essentials

  • Users and stakeholders

  • Stakeholder cheat sheet

  • Common missteps

Communicating Data-Driven Knowledge

  • Alerting and trending

  • To self-serve or not

  • Formats & presentation tools

  • Design considerations