Get in Touch

Course Outline

Module 1: Essential Python for Machine Learning Workflows

• Programme initiation and environment configuration
Align on project objectives and establish a reproducible Python ML workspace

• Python language fundamentals (accelerated pace)
Review syntax, control structures, functions, and patterns prevalent in ML codebases

• Data structures for machine learning
Utilizing lists, dictionaries, sets, and tuples for features, labels, and metadata

• Comprehensions and functional utilities
Implementing transformations through comprehensions and higher-order functions

• Object-oriented Python for ML developers
Mastering classes, methods, composition, and practical design decisions

• Dataclasses and lightweight modelling
Creating typed containers for configuration, sample data, and results

• Decorators and context managers
Implementing patterns for timing, caching, logging, and resource-safe execution

• File and path management
Handling datasets robustly and managing serialization formats

• Exceptions and defensive programming
Writing ML scripts that fail safely and transparently

• Modules, packages, and project architecture
Structuring reusable ML codebases for maintainability

• Typing and code quality standards
Applying type hints, documentation, and lint-friendly structures

Module 2: Numerical Python, SciPy, and Data Handling

• NumPy foundations for vectorised computing
Executing efficient array operations and performance-conscious coding

• Indexing, slicing, broadcasting, and shapes
Ensuring safe tensor manipulation and shape reasoning

• Linear algebra essentials with NumPy and SciPy
Performing stable matrix operations and decompositions used in ML

• Deep dive into SciPy
Exploring statistics, optimization, curve fitting, and sparse matrices

• Pandas for tabular ML data
Cleaning, joining, aggregating, and preparing datasets effectively

• Deep dive into scikit-learn
Leveraging the estimator interface, pipelines, and reproducible workflows

• Visualization essentials
Creating diagnostic plots for data exploration and model behavior analysis

Module 3: Programming Patterns for Building ML Applications

• Transitioning from notebooks to maintainable projects
Refactoring exploratory code into structured, reusable packages

• Configuration management
Managing externalized parameters and implementing startup validation

• Logging, warnings, and observability
Implementing structured logging for debuggable ML systems

• Building reusable components with OOP and composition
Designing extensible transformers and predictors

• Practical design patterns
Applying Pipeline, Factory/Registry, Strategy, and Adapter patterns

• Data validation and schema checks
Preventing silent data issues before they impact models

• Performance optimization and profiling
Identifying bottlenecks and applying effective optimization techniques

• Model I/O and inference interfaces
Ensuring safe data persistence and clean prediction interfaces

• End-to-end mini project
Constructing a production-style ML pipeline with configuration and logging

Module 4: Statistical Learning for Tabular, Text, and Image Data

• Evaluation foundations
Mastering train/validation splits, honest cross-validation, and business-aligned metrics

• Advanced tabular machine learning
Utilizing regularized GLMs, tree ensembles, and leakage-free preprocessing

• Calibration and uncertainty estimation
Applying Platt scaling, isotonic regression, bootstrap, and conformal prediction

• Classical NLP methods
Understanding tokenization trade-offs, TF-IDF, linear models, and Naive Bayes

• Topic modelling
Grasping LDA fundamentals and understanding practical limitations

• Classical computer vision techniques
Implementing HOG, PCA, and feature-based pipelines

• Error analysis
Detecting bias, label noise, and spurious correlations

• Hands-on labs
Building a leakage-proof tabular pipeline
Comparing and interpreting text baselines
Conducting structured failure analysis for classical vision baselines

Module 5: Neural Networks for Tabular, Text, and Image Data

• Mastering the training loop
Implementing clean PyTorch loops with AMP, clipping, and reproducibility measures

• Optimization and regularization techniques
Managing initialization, normalization, optimizers, and schedulers

• Mixed precision and scaling strategies
Implementing gradient accumulation and checkpointing strategies

• Neural networks for tabular data
Utilizing categorical embeddings, feature crosses, and ablation studies

• Neural networks for text data
Working with embeddings, CNNs, BiLSTMs/GRUs, and sequence handling

• Neural networks for vision data
Understanding CNN fundamentals and ResNet-style architectures

• Hands-on labs
Developing a reusable training framework
Comparing Tabular NN vs. boosting models
Experimenting with CNN augmentation and scheduling

Module 6: Advanced Neural Architectures

• Transfer learning strategies
Implementing freeze/unfreeze patterns and discriminative learning rates

• Transformer architectures for text
Understanding self-attention internals and fine-tuning approaches

• Vision backbones and dense prediction
Exploring ResNet, EfficientNet, Vision Transformers, and U-Net concepts

• Advanced architectures for tabular data
Utilizing TabTransformer, FT-Transformer, and Deep & Cross networks

• Time series considerations
Managing temporal splits and detecting covariate shift

• PEFT and efficiency techniques
Evaluating LoRA, distillation, and quantization trade-offs

• Hands-on labs
Fine-tuning pretrained text transformers
Fine-tuning pretrained vision models
Comparing Tabular transformers vs. GBDT models

Module 7: Generative AI Systems

• Fundamentals of prompting
Executing structured prompting and controlled generation techniques

• LLM foundations
Understanding tokenization, instruction tuning, and hallucination mitigation

• Retrieval-Augmented Generation (RAG)
Implementing chunking, embeddings, hybrid search, and evaluation metrics

• Fine-tuning strategies
Applying LoRA and QLoRA with strict data quality controls

• Diffusion models
Grasping latent diffusion intuition and practical adaptation methods

• Synthetic tabular data generation
Using CTGAN and addressing privacy considerations

• Hands-on labs
Building a production-style RAG mini-application
Validating structured output with schema enforcement
Optional diffusion experimentation

Module 8: AI Agents and MCP

• Agent loop design
Implementing observe, plan, act, reflect, and persist cycles

• Agent architectures
Exploring ReAct, plan-and-execute, and multi-agent coordination strategies

• Memory management
Utilizing episodic, semantic, and scratchpad memory approaches

• Tool integration and safety
Defining tool contracts, implementing sandboxing, and defending against prompt injection

• Evaluation frameworks
Using replayable traces, task suites, and regression testing

• MCP and protocol-based interoperability
Designing MCP servers with secure tool exposure mechanisms

• Hands-on labs
Building an agent from scratch
Exposing tools via an MCP-style server
Creating an evaluation harness with safety constraints

Requirements

Participants are expected to possess a practical understanding of Python programming.

This programme is designed for technical professionals with intermediate to advanced skill levels.

 56 Hours

Testimonials (2)

Upcoming Courses

Related Categories