Course Outline
Lesson 1 - SQL basics:
- Select statements
- Join types
- Indexes
- Views
- Subqueries
- Union
- Creating tables
- Loading data
- Dumping data
- NoSQL
Lesson 2 - Data Modeling:
- Transaction based ER systems
- Data warehousing
- Data warehouse models
- Star schema
- Snowflake schemas
- Slowly changing dimensions (SCD)
- Structured and non-structured data
- Different table type storage engines:
- Column based
- Document-based
- In Memory
Lesson 3 - Index in the NoSQL/Data science world
- Constraints (Primary)
- Index-based scanning
- performance tuning
Lesson 4 - NoSQL and non-structured data
- When to use NoSQL
- Eventually consistent data
- Schema on read vs. Schema on write
Lesson 5 - SQL for data analytics
- Windowing function
- Lateral Joins
- Lead & Lag
Lesson 6 - HiveQL
- SQL Support
- External and Internal Tables
- Joins
- Partitions
- Correlated subqueries
- Nested queries
- When to use Hive
Lesson 7 - Redshift
- Design and structured
- Locks and shared resources
- Postgres differences
- When to use redshift
Requirements
- An understanding of databases
- Experience with SQL an asset.
Audience
- Business analysts
- Software developers
- Database developers
Testimonials
the scope of material
Maciej Jonczyk
systematizing knowledge in the field of ML
Orange Polska
I really was benefit from the willingness of the trainer to share more.
Balaram Chandra Paul
I generally was benefit from the presentation of technologies.
Continental AG / Abteilung: CF IT Finance
Overall the Content was good.
Sameer Rohadia
Michael the trainer is very knowledgeable and skillful about the subject of Big Data and R. He is very flexible and quickly customize the training meeting clients' need. He is also very capable to solve technical and subject matter problems on the go. Fantastic and professional training!.
Xiaoyuan Geng - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
I really enjoyed the introduction of new packages.
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The tutor, Mr. Michael An, interacted with the audience very well, the instruction was clear. The tutor also go extent to add more information based on the requests from the students during the training.
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The subject matter and the pace were perfect.
Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The example and training material were sufficient and made it easy to understand what you are doing.
Teboho Makenete
Richard's training style kept it interesting, the real world examples used helped to drive the concepts home.
Jamie Martin-Royle - NBrown Group
The content, as I found it very interesting and think it would help me in my final year at University.
Krishan Mistry - NBrown Group
I generally liked the fernando's knowledge.
Valentin de Dianous - Informatique ProContact INC.
The broad coverage of the subjects
- Roche
Intensity, Training materials and expertise, Clarity, Excellent communication with Alessandra
Marija Hornis Dmitrovic - Marija Hornis
R programming
Osden Jokonya - University of the Western Cape
Practical exercises
JOEL CHIGADA - University of the Western Cape
Related Courses
Automated Monitoring with Zabbix
14 hoursThis course focuses on practical implementation and tooling. This course covers the installation, planning and configuration of Zabbix
Databricks
14 hoursAzure Databricks is a unified data analytics platform that allows users to store and visualize vast amounts of data from different sources. It provides a collaborative environment to build, deploy, and manage data analytics workloads easily. This
Data Cleaning
7 hoursData Cleaning or Data Cleansing refers to the process of detecting and fixing issues in a data set before analyzing it. This instructor-led, live training (online or onsite) is aimed at data scientists, data analysts, and business analysts who
Datadog Monitoring
7 hoursDatadog is a monitoring platform for cloud-based applications that provides tools for monitoring servers and databases. It helps determine performance metrics and perform event monitoring for infrastructure and cloud-based services. This
Netdata
7 hoursNetdata is an open-source infrastructure performance monitoring and troubleshooting solution that simplifies real-time data collection of system, hardware, and application metrics. Netdata helps users visualize and store data, set performance issue
Zenoss Monitoring for Administrators
21 hoursZenoss Community Edition is an application, server, and network management platform for monitoring availability, inventory/configuration, performance, and events. It is based on the Zope application server. This instructor-led, live training
Fluentd for Log Data Unification
14 hoursThis instructor-led, live training (online or onsite) is aimed at engineers who wish to set up an architecture where everything is logged. By the end of this training, participants will be able to: Install and configure Fluentd. Collect
KNIME Analytics Platform for BI
21 hoursKNIME Analytics Platform is a leading open source option for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures. With more than 1000 modules, hundreds of ready-to-run
Microsoft Power Platform Fundamentals
14 hoursMicrosoft Power Platform is a platform made up of three Microsoft products: Power BI, PowerApps, and Power Automate. These products use low-code programming to help users build simple apps, create automated workflows, and generate business
Monitoring Your Resources with Munin
7 hoursMunin is an open-source monitoring tool that helps system administrators monitor resources such as servers, workstations, networks, SANs, applications, network devices, etc. It shows resource trends and provides insights into questions such as
Nagios
35 hoursThe 5-day course demonstrates through hands-on practice the fundamentals of Nagios.
Nagios Core
21 hoursThis course covers the installation, planning and configuration of Nagios Core The level of this course is Intermediate
Nagios XI Administration
21 hoursNagios XI is enterprise server and network monitoring software. In this instructor-led, live training, participants will learn how to set up and operate Nagios XI as they step through process of managing Linux and Windows servers in a series
Sensu: Beginner to Advanced
14 hoursSensu is a telemetry and monitoring service for multi-cloud infrastructures at scale. Sensu is aimed at dynamic infrastructures that require a change in approach to monitoring systems that traditional monitoring systems cannot provide. This
SPSS Modeler
14 hoursIBM SPSS Modeler is a software used for data mining and text analytics. It provides a set of data mining tools that can build predictive models and perform data analytic tasks. This instructor-led, live training (online or onsite) is aimed at