EXO: End-to-End Local AI Cluster Deployment Training Course

EXO is an open-source framework that connects Apple Silicon devices into a distributed AI cluster, enabling local inference of frontier models larger than a single device can hold.

This instructor-led, live training (online or onsite) is aimed at system administrators and DevOps engineers who wish to deploy, configure, and manage EXO clusters for private LLM inference across multiple Apple Silicon or Linux nodes.

By the end of this training, participants will be able to:

Install and configure EXO on macOS and Linux nodes.
Enable automatic device discovery and build multi-node clusters.
Enable and verify RDMA over Thunderbolt 5 for ultra-low-latency inter-device communication.
Deploy frontier models (DeepSeek, Qwen, Llama) across clustered devices.
Monitor cluster health and troubleshoot common deployment issues.

Format of the Course

Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request a customized training, please contact us to arrange.

This course is available as onsite live training in United Arab Emirates or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to EXO and Local AI Clustering

Overview of the EXO framework and the exo-explore ecosystem
Comparing centralized cloud inference vs distributed local inference
Architecture: libp2p device discovery, MLX backend, dashboard, and API layers
Hardware requirements: Apple Silicon (M3 Ultra, M4 Pro/Max), Thunderbolt 5, shared storage

Installing EXO on macOS

Setting up Xcode, Metal ToolChain, and macOS prerequisites
Installing uv, Node.js, Rust nightly toolchain
Installing the pinned macmon fork for Apple Silicon monitoring
Cloning the repository and building the dashboard with npm
Running EXO from source and verifying the localhost:52415 dashboard

Installing EXO on Linux

Installing dependencies via apt or Homebrew on Linux
Configuring uv, Node.js 18+, and Rust nightly
Building the dashboard and running EXO in CPU-only mode
Directory layout: XDG Base Directory paths for config, data, cache, and logs

Automatic Device Discovery and Cluster Formation

Understanding libp2p-based auto-discovery across local networks
Configuring custom namespaces with EXO_LIBP2P_NAMESPACE for cluster isolation
Verifying node membership in the dashboard cluster view
Handling discovery failures and network segmentation issues

Enabling RDMA over Thunderbolt 5

RDMA architecture and the 99 percent latency reduction claim
Enabling RDMA in macOS Recovery mode with rdma_ctl
Cable requirements and port topology constraints on Mac Studio
Matching macOS versions across all cluster nodes
Troubleshooting RDMA discovery and DHCP configuration

Deploying Frontier Models

Using the dashboard to load and shard DeepSeek v3.1, Qwen3-235B, and Llama family models
Previewing instance placements with the /instance/previews API endpoint
Creating model instances with pipeline or tensor-parallel sharding
Configuring custom model cards from HuggingFace hub

Monitoring and Troubleshooting

Reading EXO logs and understanding distributed tracing
Interpreting cluster health in the dashboard cluster view
Diagnosing worker node failures and reconnection behavior
Using EXO_TRACING_ENABLED for performance bottleneck analysis

Cluster Maintenance and Updates

Updating EXO binaries and dashboard rebuild procedures
Migrating model caches and managing pre-downloaded models over NFS
Gracefully removing nodes and rebalancing workloads

Requirements

An understanding of networking fundamentals (IP, subnetting, firewalls)
Experience with macOS or Linux command-line administration
Familiarity with Python package management (pip/uv) and Node.js tooling

Audience

System administrators
DevOps engineers
AI infrastructure architects responsible for on-premise LLM deployment

21 Hours

Need help picking the right course?
uae@nobleprog.com or +971 4871 6715

EXO: End-to-End Local AI Cluster Deployment Training Course

Course Outline

Requirements

Upcoming Courses

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

EXO: End-to-End Local AI Cluster Deployment Training Course

Course Outline

Requirements

Upcoming Courses

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

EXO: End-to-End Local AI Cluster Deployment

Related Courses

Advanced LangGraph: Optimization, Debugging, and Monitoring Complex Graphs

Building Coding Agents with Devstral: From Agent Design to Tooling

Open-Source Model Ops: Self-Hosting, Fine-Tuning and Governance with Devstral & Mistral Models

Fiji: Image Processing for Biotechnology and Toxicology

LangGraph Applications in Finance

LangGraph Foundations: Graph-Based LLM Prompting and Chaining

LangGraph in Healthcare: Workflow Orchestration for Regulated Environments

LangGraph for Legal Applications

Building Dynamic Workflows with LangGraph and LLM Agents

LangGraph for Marketing Automation

Le Chat Enterprise: Private ChatOps, Integrations & Admin Controls

Cost-Effective LLM Architectures: Mistral at Scale (Performance / Cost Engineering)

Productizing Conversational Assistants with Mistral Connectors & Integrations

Enterprise-Grade Deployments with Mistral Medium 3

Mistral for Responsible AI: Privacy, Data Residency & Enterprise Controls

Related Categories

Artificial Intelligence (AI)

AI Sovereignty

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites