Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Ollama is an open-source solution designed to run large language models locally on both consumer and enterprise-grade hardware. By consolidating model quantization, GPU resource allocation, and API serving into a unified command-line interface, it empowers organizations to self-host prominent LLMs such as Llama, Mistral, and Qwen. This approach eliminates the need to transmit prompts or sensitive data to external services like OpenAI, Anthropic, or Google.

This instructor-led live training, available online or onsite, targets intermediate AI engineers and platform operators looking to transition from cloud-based LLM APIs to self-hosted, sovereign language model inference using Ollama.

Upon completing this training, participants will be equipped to:

Install Ollama across Linux, macOS, and Windows environments with GPU support.
Retrieve, quantize, and serve models from the Ollama registry and HuggingFace.
Develop custom Modelfiles incorporating system prompts and parameter adjustments.
Connect local LLMs to applications through the OpenAI-compatible API.
Enhance inference performance for both CPU-only and multi-GPU configurations.

Course Format

Interactive lectures and group discussions.
Extensive exercises and practical practice sessions.
Hands-on implementation within a live lab environment.

Customization Options

For requests tailored to specific organizational needs, please contact us to arrange a customized training session.

This course is available as onsite live training in United Arab Emirates or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

AI Sovereignty and Local LLM Deployment

Risks associated with cloud LLMs: data retention policies, training on user inputs, and foreign jurisdiction implications.
Ollama architecture: understanding the model server, registry, and OpenAI-compatible API.
Comparative analysis with vLLM, llama.cpp, and Text Generation Inference.
Model licensing terms for Llama, Mistral, Qwen, and Gemma.

Installation and Hardware Configuration

Installing Ollama on Linux with CUDA and ROCm support.
CPU-only fallback options and AVX/AVX2 optimization techniques.
Docker deployment strategies and persistent volume mapping.
Multi-GPU setups and VRAM allocation strategies.

Model Management

Retrieving models from the Ollama registry: example using 'ollama pull llama3'.
Importing GGUF models from HuggingFace and TheBloke repositories.
Understanding quantization levels: trade-offs between Q4_K_M, Q5_K_M, and Q8_0.
Model switching mechanisms and limits on concurrent model loading.

Custom Modelfiles

Writing Modelfile syntax: utilizing FROM, PARAMETER, SYSTEM, and TEMPLATE directives.
Tuning parameters such as temperature, top_p, and repeat_penalty.
Engineering system prompts for role-specific behavioral outputs.
Creating and publishing custom models to the local registry.

API Integration

Utilizing the OpenAI-compatible /v1/chat/completions endpoint.
Implementing streaming responses and JSON mode.
Integrating with LangChain, LlamaIndex, and custom applications.
Managing authentication and rate limiting via reverse proxy.

Performance Optimization

Configuring context window sizing and KV cache management.
Handling batch inference and parallel requests.
Allocating CPU threads and ensuring NUMA awareness.
Monitoring GPU utilization and memory pressure metrics.

Security and Compliance

Establishing network isolation for model serving endpoints.
Implementing input filtering and output moderation pipelines.
Maintaining audit logs for prompts and completions.
Verifying model provenance and hash integrity.

Requirements

Intermediate proficiency in Linux and container administration.
A high-level understanding of machine learning concepts and transformer models.
Familiarity with REST APIs and JSON data formats.

Target Audience

AI engineers and developers seeking to replace cloud LLM APIs with self-hosted alternatives.
Organizations bound by data sensitivity constraints that prohibit the use of cloud models.
Government and defense teams necessitating air-gapped language models.

14 Hours

Need help picking the right course?

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Course Outline

Requirements

Upcoming Courses

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Course Outline

Requirements

Upcoming Courses

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Related Courses

Advanced Ollama Model Debugging & Evaluation

Building Private AI Workflows with Ollama

Deploying and Optimizing LLMs with Ollama

Fine-Tuning and Customizing AI Models on Ollama

Multimodal Applications with Ollama

Getting Started with Ollama: Running Local AI Models

Ollama & Data Privacy: Secure Deployment Patterns

Ollama Applications in Finance

Ollama Applications in Healthcare

Ollama for Responsible AI and Governance

Ollama Scaling & Infrastructure Optimization

Prompt Engineering Mastery with Ollama

Related Categories

Ollama

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites