Course Outline
Greenplum Database Overview?
- What is Greenplum Database?
- Greenplum Database features
- Greenplum Database architecture
Preparing the Development Environment
- Installing and configuring Greenplum Database
Administration
- Creating users
- Creating a group
- Adding users
Database
- Creating a database
- Granting privileges
- Creating a schema
psql Command Line
- Loading and exporting data
- Executing scripts
Queries and Performance
- Analyzing tables
- Changing optimizers
- Partitioning tables
In-Database Analytics
- Setting up Apach Zepellin
- Aggregating data
- Assembling results
- Using Apache Madlib
- Performing linear regression
Summary and Conclusion
Requirements
- An understanding of RDBMS (Relation Database Management Systems)
Audience
- Administrators
Testimonials
the scope of material
Maciej Jonczyk
systematizing knowledge in the field of ML
Orange Polska
I really was benefit from the willingness of the trainer to share more.
Balaram Chandra Paul
I generally was benefit from the presentation of technologies.
Continental AG / Abteilung: CF IT Finance
Overall the Content was good.
Sameer Rohadia
Michael the trainer is very knowledgeable and skillful about the subject of Big Data and R. He is very flexible and quickly customize the training meeting clients' need. He is also very capable to solve technical and subject matter problems on the go. Fantastic and professional training!.
Xiaoyuan Geng - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
I really enjoyed the introduction of new packages.
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The tutor, Mr. Michael An, interacted with the audience very well, the instruction was clear. The tutor also go extent to add more information based on the requests from the students during the training.
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The subject matter and the pace were perfect.
Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The example and training material were sufficient and made it easy to understand what you are doing.
Teboho Makenete
Richard's training style kept it interesting, the real world examples used helped to drive the concepts home.
Jamie Martin-Royle - NBrown Group
The content, as I found it very interesting and think it would help me in my final year at University.
Krishan Mistry - NBrown Group
I generally liked the fernando's knowledge.
Valentin de Dianous - Informatique ProContact INC.
The broad coverage of the subjects
- Roche
Intensity, Training materials and expertise, Clarity, Excellent communication with Alessandra
Marija Hornis Dmitrovic - Marija Hornis
R programming
Osden Jokonya - University of the Western Cape
Practical exercises
JOEL CHIGADA - University of the Western Cape
Related Courses
Amazon Redshift
21 hoursAmazon Redshift is a petabyte-scale cloud-based data warehouse service in AWS. In this instructor-led, live training, participants will learn the fundamentals of Amazon Redshift. By the end of this training, participants will be able
Big Data & Database Systems Fundamentals
14 hoursThe course is part of the Data Scientist skill set (Domain: Data and Technology).
Pivotal Greenplum for Developers
21 hoursPivotal Greenplum is a Massively Parallel Processing (MPP) Data Warehouse platform based on PostgreSQL. This instructor-led, live training (online or onsite) is aimed at developers who wish to set up a multi-node Greenplum database. By the end
MemSQL
28 hoursMemSQL is an in-memory, distributed, SQL database management system for cloud and on-premises. It's a real-time data warehouse that immediately delivers insights from live and historical data. In this instructor-led, live training,
Big Data Business Intelligence for Govt. Agencies
35 hoursAdvances in technologies and the increasing amount of information are transforming how business is conducted in many industries, including government. Government data generation and digital archiving rates are on the rise due to the rapid growth of
Big Data Architect
35 hoursDay 1 - provides a high-level overview of essential Big Data topic areas. The module is divided into a series of sections, each of which is accompanied by a hands-on exercise. Day 2 - explores a range of topics that relate analysis practices and
Vespa: Serving Large-Scale Data in Real-Time
14 hoursVespa is an open-source big data processing and serving engine created by Yahoo. It is used to respond to user queries, make recommendations, and provide personalized content and advertisements in real-time. This instructor-led, live
Programming with Big Data in R
21 hoursBig Data is a term that refers to solutions destined for storing and processing large data sets. Developed by Google initially, these Big Data solutions have evolved and inspired other similar projects, many of which are available as open-source. R
Big Data Storage Solution - NoSQL
14 hoursWhen traditional storage technologies don't handle the amount of data you need to store there are hundereds of alternatives. This course try to guide the participants what are alternatives for storing and analyzing Big Data and what are theirs
A Practical Introduction to Data Analysis and Big Data
35 hoursParticipants who complete this instructor-led, live training will gain a practical, real-world understanding of Big Data and its related technologies, methodologies and tools. Participants will have the opportunity to put this knowledge into
From Data to Decision with Big Data and Predictive Analytics
21 hoursAudience If you try to make sense out of the data you have access to or want to analyse unstructured data available on the net (like Twitter, Linked in, etc...) this course is for you. It is mostly aimed at decision makers and people who need to
Data Vault: Building a Scalable Data Warehouse
28 hoursData Vault Modeling is a database modeling technique that provides long-term historical storage of data that originates from multiple sources. A data vault stores a single version of the facts, or "all the data, all the time". Its
Apache Druid for Real-Time Data Analysis
21 hoursApache Druid is an open-source, column-oriented, distributed data store written in Java. It was designed to quickly ingest massive quantities of event data and execute low-latency OLAP queries on that data. Druid is commonly used in business
Data Science for Big Data Analytics
35 hoursBig data is data sets that are so voluminous and complex that traditional data processing application software are inadequate to deal with them. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer,
Apache Kylin: From Classic OLAP to Real-Time Data Warehouse
14 hoursApache Kylin is an extreme, distributed analytics engine for big data. In this instructor-led live training, participants will learn how to use Apache Kylin to set up a real-time data warehouse. By the end of this training, participants will