Course Outline
Basic overview of R and R Studio
- R overview
- R Studio Environment Windows
- Script Editor Window
- Data Environment
- Console
- Plots/Help/Packages
Working with Data
- Introduction to vectors and matrices (data.frame)
- Different types of variables
- Numeric, Integer, factor etc
- Changing variable types
- Importing data using R Studio menu functions
- Removing variables ls() command
- Creating variables at the console prompt – single, vector, data frame
- Naming vectors and matrices
- Head and tail commands
- Introduction to dim, length and class
- Command line import (reading .csv and tab delimited .txt files)
- Attaching and detaching data (advantages vs data.frame$)
- Merging data using cbind and rbind
Exploratory Data Analysis
- Summarising data
- Summary command on both vectors and data frames
- Sub-setting data using square brackets
- summarising and creating new variables
- Table and summary commands
- Summary statistic commands
- Mean
- Median
- Standard Deviation
- Variance
- Count & frequencies
- Min & Max,
- Quartiles
- Percentiles
- Correlation
Exporting data
-
- Write table .txt
- Write to a .csv file
R Workspace
- Concept of Working Directories and Projects (menu driven and code – setwd())
Introduction to R scripts
- Creating R Scripts
- Saving scripts
- Workspace images
Concepts of packages
- Installing packages
- Loading packages into memory
Plotting data (using standard default R plot command and ggplot2 package)
- Bar Charts and Histograms
- Boxplots
- Line charts / time series
- Scatter plots
- Stem and leaf
- Mosaic
- Modifying plots
- Titles
- Legends
- Axis
- Plot Area
- Exporting a plot to a third party application
Requirements
There are no specific requirements needed to attend this course.
Testimonials
I was benefit from the good examples and opportunity to follow along.
- Environmental and Climate Change Canada
I genuinely enjoyed the hands passed exercises.
Yunfa Zhu - Environmental and Climate Change Canada
The trainer was very concern about individual understanding.
Muhammad Surajo Sanusi - Birmingham City University
Excellent presentation and it gives me confidence to build on knowledge gained.
- Birmingham City University
Background knowledge and 'provenance' of trainer.
Francis McGonigal - Birmingham City University
Resources
Hafiz Rana - Birmingham City University
Good explanations on how we do things
- Birmingham City University
I feel more confident with coding now. I've never done it before but now I understand that it's not rocket science and I can do it when necessary.
Anna Yartseva - Birmingham City University
Related Courses
Data Analysis with SQL, Python and Spotfire
14 hoursIn this instructor-led, live training, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database
Advanced Data Analysis with TIBCO Spotfire
14 hoursTIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire
Introduction to Spotfire
14 hoursTIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts and data analysts who wish to learn
AI-Driven Data Analysis with TIBCO Spotfire X
14 hoursTIBCO Spotfire X is a business intelligence and augmented analytics tool that combines natural language, artificial intelligence (AI), and streaming data technologies. This instructor-led, live training (online or onsite) is aimed at business
Introduction to Data Visualization with R
28 hoursThis course is intended for data engineers, decision makers and data analysts and will lead you to create very effective plots using R studio that appeal to decision makers and help them find out hidden information and take the right
Data Visualization
28 hoursThis course is intended for engineers and decision makers working in data mining and knoweldge discovery. You will learn how to create effective plots and ways to present and represent your data in a way that will appeal to the decision makers
ELK: Elasticsearch, Logstash and Kibana for Administrators
14 hoursThis instructor-led, live training is aimed at system administrators who wish to set up an ELK stack (Elasticsearch, Logstash, Kibana). Note that a minimum of 3 delegates is needed for this course to run. The training starts with a discussion of
Kibana: Essentials
14 hoursThis training introduces Kibana to the users of Elasticsearch. Kibana is an open source analytics and visualization platform designed to work with Elasticsearch. You use Kibana to search, view, and interact with data stored in Elasticsearch
Apache Druid for Real-Time Data Analysis
21 hoursApache Druid is an open-source, column-oriented, distributed data store written in Java. It was designed to quickly ingest massive quantities of event data and execute low-latency OLAP queries on that data. Druid is commonly used in business
Introduction to Data Visualization with Tidyverse and R
7 hoursThe Tidyverse is a collection of versatile R packages for cleaning, processing, modeling, and visualizing data. Some of the packages included are: ggplot2, dplyr, tidyr, readr, purrr, and tibble. In this instructor-led, live training,
Knowledge Discovery in Databases (KDD)
21 hoursKnowledge discovery in databases (KDD) is the process of discovering useful knowledge from a collection of data. Real-life applications for this data mining technique include marketing, fraud detection, telecommunication and manufacturing. In
Monitoring with Grafana
14 hoursGrafana is a visualization and metric analytic suite. Grafana presents a visual representation of real-time data for application analytics. Grafana is most commonly used for home automation, process control, and more. The aim of the course is to
Grafana and Graphite
14 hoursGrafana is an open source visualization and metric analytic suite for visualizing real-time data of application analytics. Graphite is an open source software for monitoring and graphing computer system data. Together with Grafana and Gaphite, users
TIBCO for Developers
21 hoursTIBCO is a software suite for integrating, managing, and monitoring enterprise applications and information delivery. Two of its products, BusinessEvents (business rules engine) and ActiveMatrix Service Bus (lightweight enterprise service bus), are
TIBCO Statistica
14 hoursTIBCO Statistica is an advanced analytics system that provides data analysis, data management, data visualization, data mining, statistics, and machine learning solutions for businesses. This instructor-led, live training (online or onsite) is