Get in Touch

Course Outline

Day One: Language Fundamentals

  • Course Introduction
  • Overview of Data Science
    • Defining Data Science
    • The Data Science Workflow.
  • Introduction to the R Language
  • Variables and Data Types
  • Control Structures (Loops and Conditionals)
  • R Scalars, Vectors, and Matrices
    • Creating R Vectors
    • Matrices
  • String and Text Manipulation
    • Character data types
    • File Input and Output
  • Lists
  • Functions
    • Function Basics
    • Closures
    • lapply and sapply functions
  • DataFrames
  • Practical Labs for all sections

Day Two: Intermediate R Programming

  • DataFrames and File I/O
  • Importing data from files
  • Data Preparation
  • Built-in Datasets
  • Visualisation
    • Graphics Package
    • plot() / barplot() / hist() / boxplot() / scatter plot
    • Heat Maps
    • ggplot2 package (qplot(), ggplot())
  • Data Exploration with Dplyr
  • Practical Labs for all sections

Day Three: Advanced Programming With R

  • Statistical Modelling in R
    • Statistical Functions
    • Handling Missing Values (NA)
    • Probability Distributions (Binomial, Poisson, Normal)
  • Regression Analysis
    • Introduction to Linear Regression
  • Recommendation Systems
  • Text Processing (tm package and Word Clouds)
  • Clustering
    • Introduction to Clustering
    • K-Means
  • Classification
    • Introduction to Classification
    • Naive Bayes
    • Decision Trees
    • Model training using the caret package
    • Evaluating Algorithms
  • R and Big Data
    • Connecting R to databases
    • Big Data Ecosystem
  • Practical Labs for all sections

Requirements

  • A foundational background in programming is preferred

Setup

  • A modern laptop
  • The latest version of R Studio and the R environment installed
 21 Hours

Number of participants


Price per participant

Testimonials (7)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories