Get in Touch

Course Outline

  1. Foundations of Big Data
    • The role of Big Data in the corporate landscape
    • Stages in developing a corporate Big Data strategy
    • The rationale behind adopting a holistic Big Data approach
    • Essential components of a Big Data platform
    • Big data storage solutions
    • Limitations of traditional technologies
    • Overview of various database types
    • The four dimensions of Big Data
  2. Business Impact of Big Data
    • Business significance of Big Data
    • Challenges associated with extracting valuable insights
    • Integrating Big Data with traditional data systems
  3. Big Data Storage Technologies
    • Overview of big data technologies
      • Data storage models
      • Hadoop
      • Hive
      • Cassandra
      • MongoDB
    • Selecting the appropriate big data technology
  4. Processing Big Data
    • Connecting to and extracting data from databases
    • Transforming and preparing data for processing
    • Utilizing Hadoop MapReduce for distributed data processing
    • Monitoring and executing Hadoop MapReduce jobs
    • Hadoop distributed file system building blocks
    • MapReduce and YARN
    • Handling streaming data with Spark
  5. Big Data Analysis Tools and Technologies
    • Programming Hadoop using Pig Latin
    • Querying big data with Hive
    • Data mining with Mahout
    • Visualization and reporting tools
  6. Implementing Big Data in Business
    • Managing and defining Big Data requirements
    • Business importance of Big Data
    • Selecting the right big data tools for specific problems

Data Warehousing Concepts

  • Understanding Data Warehousing
  • Differences between OLTP and Data Warehousing
  • Data Acquisition
  • Data Extraction
  • Data Transformation
  • Data Loading
  • Data Marts
  • Dependent vs Independent Data Marts
  • Database Design

ETL Testing Concepts:

  • Introduction
  • Software Development Life Cycle
  • Testing Methodologies
  • ETL Testing Workflow Process
  • ETL Testing Responsibilities within Data Stage

Big Data Fundamentals

  • The role of Big Data in the corporate landscape
  • Stages in developing a corporate Big Data strategy
  • The rationale behind adopting a holistic Big Data approach
  • Essential components of a Big Data platform
  • Big data storage solutions
  • Limitations of traditional technologies
  • Overview of various database types

NoSQL Databases

Hadoop

Map Reduce

Apache Spark

Requirements

Participants should possess a foundational awareness and some practical experience with storage tools, along with an understanding of how to manage large-scale data sets.

 14 Hours

Testimonials (1)

Upcoming Courses

Related Categories