Course Outline

Basic overview of R and R Studio

  • R overview
  • R Studio Environment Windows
    • Script Editor Window
    • Data Environment
    • Console
    • Plots/Help/Packages

Working with Data

  • Introduction to vectors and matrices (data.frame)
  • Different types of variables
    • Numeric, Integer, factor etc
    • Changing variable types
    • Importing data using R Studio menu functions
    • Removing variables ls() command
  • Creating variables at the console prompt – single, vector, data frame
  • Naming vectors and matrices
  • Head and tail commands
  • Introduction to dim, length and class
  • Command line import (reading .csv and tab delimited .txt files)
  • Attaching and detaching data (advantages vs data.frame$)
  • Merging data using cbind and rbind

Exploratory Data Analysis

  • Summarising data
  • Summary command on both vectors and data frames
  • Sub-setting data using square brackets
    • summarising and creating new variables
  • Table and summary commands
  • Summary statistic commands
    • Mean
    • Median
    • Standard Deviation
    • Variance
    • Count & frequencies
    • Min & Max,
    • Quartiles
    • Percentiles
    • Correlation

Exporting data

    • Write table .txt
    • Write to a .csv file

R Workspace

  • Concept of Working Directories and Projects (menu driven and code – setwd())

Introduction to R scripts

  • Creating R Scripts
  • Saving scripts
  • Workspace images

Concepts of packages

  • Installing packages
  • Loading packages into memory

Plotting data (using standard default R plot command and ggplot2 package)

  • Bar Charts and Histograms
  • Boxplots
  • Line charts / time series
  • Scatter plots
  • Stem and leaf
  • Mosaic
  • Modifying plots
    • Titles
    • Legends
    • Axis
    • Plot Area
  • Exporting a plot to a third party application

Requirements

There are no specific requirements needed to attend this course.

  7 Hours
 

Testimonials

Related Courses

Data Analysis with SQL, Python and Spotfire

 14 hours

In this instructor-led, live training, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database

Advanced Data Analysis with TIBCO Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire

Introduction to Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts and data analysts who wish to learn

AI-Driven Data Analysis with TIBCO Spotfire X

 14 hours

TIBCO Spotfire X is a business intelligence and augmented analytics tool that combines natural language, artificial intelligence (AI), and streaming data technologies. This instructor-led, live training (online or onsite) is aimed at business

Introduction to Data Visualization with R

 28 hours

This course is intended for data engineers, decision makers and data analysts and will lead you to create very effective plots using R studio that appeal to decision makers and help them find out hidden information and take the right

Data Visualization

 28 hours

This course is intended for engineers and decision makers working in data mining and knoweldge discovery. You will learn how to create effective plots and ways to present and represent your data in a way that will appeal to the decision makers

ELK: Elasticsearch, Logstash and Kibana for Administrators

 14 hours

This instructor-led, live training is aimed at system administrators who wish to set up an ELK stack (Elasticsearch, Logstash, Kibana). Note that a minimum of 3 delegates is needed for this course to run. The training starts with a discussion of

Kibana: Essentials

 14 hours

This training introduces Kibana to the users of Elasticsearch. Kibana is an open source analytics and visualization platform designed to work with Elasticsearch. You use Kibana to search, view, and interact with data stored in Elasticsearch

Apache Druid for Real-Time Data Analysis

 21 hours

Apache Druid is an open-source, column-oriented, distributed data store written in Java. It was designed to quickly ingest massive quantities of event data and execute low-latency OLAP queries on that data. Druid is commonly used in business

Introduction to Data Visualization with Tidyverse and R

 7 hours

The Tidyverse is a collection of versatile R packages for cleaning, processing, modeling, and visualizing data. Some of the packages included are: ggplot2, dplyr, tidyr, readr, purrr, and tibble. In this instructor-led, live training,

Knowledge Discovery in Databases (KDD)

 21 hours

Knowledge discovery in databases (KDD) is the process of discovering useful knowledge from a collection of data. Real-life applications for this data mining technique include marketing, fraud detection, telecommunication and manufacturing. In

Monitoring with Grafana

 14 hours

Grafana is a visualization and metric analytic suite. Grafana presents a visual representation of real-time data for application analytics. Grafana is most commonly used for home automation, process control, and more. The aim of the course is to

Grafana and Graphite

 14 hours

Grafana is an open source visualization and metric analytic suite for visualizing real-time data of application analytics. Graphite is an open source software for monitoring and graphing computer system data. Together with Grafana and Gaphite, users

TIBCO for Developers

 21 hours

TIBCO is a software suite for integrating, managing, and monitoring enterprise applications and information delivery. Two of its products, BusinessEvents (business rules engine) and ActiveMatrix Service Bus (lightweight enterprise service bus), are

TIBCO Statistica

 14 hours

TIBCO Statistica is an advanced analytics system that provides data analysis, data management, data visualization, data mining, statistics, and machine learning solutions for businesses. This instructor-led, live training (online or onsite) is