Get in Touch

Course Outline

1. Introduction to Distributed PostgreSQL

  • Scaling challenges associated with single-node PostgreSQL
  • Overview of the Citus extension: purpose, architecture, and core components
  • Key concepts: coordinator node, worker nodes, metadata, and distribution keys

2. Cluster Architecture and Setup

  • Node types: distinguishing between coordinator and worker nodes
  • Table types: distributed, replicated, and local tables
  • Installing and configuring Citus within existing PostgreSQL environments
  • Cluster discovery and node management techniques

3. Data Distribution and Sharding Strategies

  • Sharding methods: comparing hash and append strategies
  • Selecting the appropriate distribution column to optimise performance
  • Managing distributed and replicated tables effectively
  • Re-balancing shards and scaling out the cluster

4. Distributed Query Execution and Optimisation

  • Understanding how Citus routes and parallelises queries
  • Interpreting distributed query plans
  • Query pushdown and execution optimisation techniques

5. Consistency, Transactions and Fault Tolerance

  • Two-Phase Commit (2PC) and atomic operations
  • Strategies for handling failures in distributed transactions

6. Operational Management and Use Cases

  • Monitoring tools and views specific to Citus
  • Maintenance procedures and upgrades in distributed environments

Requirements

  • Completion of Advanced Administration (High Availability & Replication) or equivalent professional experience
  • Robust knowledge of PostgreSQL configuration and performance tuning
  • Familiarity with Linux operating systems and fundamental network concepts

Audience

This course is tailored for Experienced Database Administrators, DevOps Engineers, and System Architects who currently manage production PostgreSQL environments and require the skills to scale them horizontally.

 7 Hours

Testimonials (2)

Upcoming Courses

Related Categories