Course Outline
Introduction
- Apache Arrow vs Parquet
Installing and Configuring Apache Arrow
Overview of Apache Arrow Features and Architecture
Exploring Data with Pandas and Apache Arrow
Exploring Data with Spark and Apache Arrow
Exploring Data with R and Apache Arrow
Exploring Data with MapD and Apache Arrow
Other Data Analysis Integrations
- PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices
Troubleshooting
Summary and Conclusion
Requirements
- A basic undersanding of SQL
- Familiarity with Python or R
- Some familiarity with Apache Spark
Testimonials
I get answers on all my questions.
Natalia Gladii
I really was benefit from the willingness of the trainer to share more.
Balaram Chandra Paul
Liked very much the interactive way of learning.
Luigi Loiacono
I enjoyed the Excel sheets provided having the exercises with examples. This meant that if Tamil was held up helping other people, I could crack on with the next parts.
Luke Pontin
Learning how to use excel properly.
Torin Mitchell
The way the trainer made complex subjects easy to understand.
Adam Drewry
It was a very practical training, I liked the hands-on exercises.
Proximus
Detailed and comprehensive instruction given by experienced and clearly knowledgeable expert on the subject.
Justin Roche
Tamil is very knowledgeable and nice person, I have learned from him a lot.
Aleksandra Szubert
I liked the first session. Very intensive and quick.
Digital Jersey
I was benefit from the good overview, good balance between theory and exercises.
Proximus
I mostly liked the patience of Tamil.
Laszlo Maros
I enjoyed the dynamic interaction and “hands-on” the subject, thanks to the Virtual Machine, very stimulating!.
Philippe Job
I really was benefit from the real life practical examples.
Wioleta (Vicky) Celinska-Drozd
I was benefit from the competence and knowledge of the trainer.
Jonathan Puvilland
I generally was benefit from the presentation of technologies.
Continental AG / Abteilung: CF IT Finance
good interaction with the trainer, dynamic exchange of knowledge
NIIT Limited
Overall the Content was good.
Sameer Rohadia
Exercises on functions!
WIktor Paprzycki
way of translation
Małgorzata Mieczkowska
exercises
Wiktor Paprzycki
I liked the customized, in-house file processing and data analysis.
Glycom A/S
I enjoyed the that we have used our own data as examples.
Glycom A/S
I really liked the exercises on time series modeling.
Teleperformance
New tool which is “R” and I find it interesting to know the existence of such tool for data analysis.
Michael Lopez - Teleperformance
The tool was interesting and I see the use. I would like to learn about more about it.
- Teleperformance
The trainer was fantastic and really knew his stuff. I learned a lot about the software I didn't know previously which will help a lot at my job!
Steve McPhail - Alberta Health Services - Information Technology
The high level principles about Hive, HDFS..
Geert Suys - Proximus Group
The handson. The mix practice/theroy
- Proximus Group
Fulvio was able to grasp our companies business case and was able to correlate with the course material, almost instantly.
Samuel Peeters - Proximus Group
I thought he did a great job of tailoring the experience to the audience. This class is mostly designed to cover data analysis with HIVE, but me and my co-worker are doing HIVE administration with no real data analytics responsibilities.
ian reif - Franchise Tax Board
Practical knowledge of the trainer
Szymon Skirgajłło - RST Sp. z o. o. Sp. k.
Terener's professionalism
Michał - RST Sp. z o. o. Sp. k.
The possibility of asking questions, knowledge and experience of the teacher
Marcin - RST Sp. z o. o. Sp. k.
Appropriately selected exercises, thanks to which you could get to know a relatively wide spectrum of Elastica applications
Maks Wojewoda - RST Sp. z o. o. Sp. k.
practical
- Swiss International Air Lines Ltd.
Mr. Bless has a very friendly, pleasant personality and with his experience and enthusiasm always motivates the participants and encourages them to do some tricky exercises! He has always made sure that all participants understand the content and has answered any questions. Good exercises help you to use the theory part quickly in practice. In general, the organization and tailoring of the training to the requirements of Nobleprog participants was very professional. Many Thanks!
Timo Koch - Swiss International Air Lines Ltd.
His deep knowledge about the subject
I thought the training was very thorough and while we covered a lot of material, Martin made ample time for questions and gave good focus to each individual and their different requirements.
Jeán Thysse - Quidco
Marcin knew exactly what he talking about and had proper hands on in-depth experience with the tools. He had answers to all our questions and made some really strong recommendations that we could start working towards with future projects and uses.
Conor Glasman - Quidco
Doing the exercises. I really enjoyed the practicals.
Warren Stephen - Quidco
Relaxed style. Help with the issues we were having with current setup.
- Quidco
The content relevnt and to the point
Qiniso Mdletshe - Quidco
Trainer was very open minded about questions and tried to answer as many as possible.
- Quidco
Presentation of new tools that I didn't know before.
- Knauf Bełchatów Sp. z o.o.
I liked that we got a general overview of elastic and learned tons of things that could be applied in current project the first day. I also liked that we went through current project code with a code review and mention improvements or/and stuff to think about or take up for discussion in the project on the second day. I like that the training gave me a good base to continue delve into elastic search.
Mattias Hansson - Chalmers Tekniska Högskola AB
The trainer's openness to questions and willingness to help/answer/explain.
Chalmers Tekniska Högskola AB
He is very knowledgeable and could answer all the questions
Chalmers Tekniska Högskola AB
Trainer develops training based on participant's pace
Farris Chua
The notebooks and examples were on point.
The explanation provided is clear.
The fact he had dif excel and data sheets with exercises for us to do.
Deepakie Singh Sodhi - Queens College, CUNY
Steve was willing to answer every questions and worked diligently to address any individual concerns or technical issues as they arose in the class. He also did a great job of presenting the technical details in a way that made it less intimidating to even the least tech savvy people in the room. Personally, learning about some useful shortcuts in Excel that I didn't know about will certainly improve my overall workflow when using Excel in the future! I am so appreciative of those little details that I was exposed to during the two-day training.
Alan Gonzalez - Queens College, CUNY
R programming
Osden Jokonya - University of the Western Cape
Practical exercises
JOEL CHIGADA - University of the Western Cape
The reminder of the world of statistics :)
Export Credit insurance corporation
1. Clear theoretical explanation of concepts and alternatives to problem solving. 2. Practical examples where concepts are, and can be applied. 3. I learnt skills that I can use in my job, which will make some of my work easier 4. It will definitely bring some innovation into some of the reports I prepare for different Committees.
Sindiso Ndlovu - Export Credit insurance corporation
His deep knowledge about the subject
The notebooks and examples were on point.
The explanation provided is clear.
Related Courses
Automated Monitoring with Zabbix
14 hoursThis course focuses on practical implementation and tooling. This course covers the installation, planning and configuration of Zabbix
Databricks
14 hoursAzure Databricks is a unified data analytics platform that allows users to store and visualize vast amounts of data from different sources. It provides a collaborative environment to build, deploy, and manage data analytics workloads easily. This
Data Cleaning
7 hoursData Cleaning or Data Cleansing refers to the process of detecting and fixing issues in a data set before analyzing it. This instructor-led, live training (online or onsite) is aimed at data scientists, data analysts, and business analysts who
Datadog Monitoring
7 hoursDatadog is a monitoring platform for cloud-based applications that provides tools for monitoring servers and databases. It helps determine performance metrics and perform event monitoring for infrastructure and cloud-based services. This
Netdata
7 hoursNetdata is an open-source infrastructure performance monitoring and troubleshooting solution that simplifies real-time data collection of system, hardware, and application metrics. Netdata helps users visualize and store data, set performance issue
Zenoss Monitoring for Administrators
21 hoursZenoss Community Edition is an application, server, and network management platform for monitoring availability, inventory/configuration, performance, and events. It is based on the Zope application server. This instructor-led, live training
Fluentd for Log Data Unification
14 hoursThis instructor-led, live training (online or onsite) is aimed at engineers who wish to set up an architecture where everything is logged. By the end of this training, participants will be able to: Install and configure Fluentd. Collect
KNIME Analytics Platform for BI
21 hoursKNIME Analytics Platform is a leading open source option for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures. With more than 1000 modules, hundreds of ready-to-run
Microsoft Power Platform Fundamentals
14 hoursMicrosoft Power Platform is a platform made up of three Microsoft products: Power BI, PowerApps, and Power Automate. These products use low-code programming to help users build simple apps, create automated workflows, and generate business
Monitoring Your Resources with Munin
7 hoursMunin is an open-source monitoring tool that helps system administrators monitor resources such as servers, workstations, networks, SANs, applications, network devices, etc. It shows resource trends and provides insights into questions such as
Nagios
35 hoursThe 5-day course demonstrates through hands-on practice the fundamentals of Nagios.
Nagios Core
21 hoursThis course covers the installation, planning and configuration of Nagios Core The level of this course is Intermediate
Nagios XI Administration
21 hoursNagios XI is enterprise server and network monitoring software. In this instructor-led, live training, participants will learn how to set up and operate Nagios XI as they step through process of managing Linux and Windows servers in a series
Sensu: Beginner to Advanced
14 hoursSensu is a telemetry and monitoring service for multi-cloud infrastructures at scale. Sensu is aimed at dynamic infrastructures that require a change in approach to monitoring systems that traditional monitoring systems cannot provide. This
SPSS Modeler
14 hoursIBM SPSS Modeler is a software used for data mining and text analytics. It provides a set of data mining tools that can build predictive models and perform data analytic tasks. This instructor-led, live training (online or onsite) is aimed at