Public Sector

We've had the pleasure of working with UK and overseas central and local government departments, including Healthcare (NHS and Foundation Trusts), Defence, Education (Universities and colleges), many of the main Civil Service departments, Emergency Services; also public-owned corporations including the BBC, Bank of England, Ordnance Survey, and regulatory bodies such as Ofgem.

We are registered on Crown Commercial Service’s (CCS) Dynamic Purchasing System (RM6219 Training and Learning) and also with numerous tender portals such as Ariba, Coupa and Delta E-Sourcing.

Read more...

Graduate Training Schemes

Framework Training has a strong track record of providing a solid introduction into the working world for technical graduates across myriad industries. We provide the opportunity to learn and gain valuable hands-on experience in a supportive, friendly and sociable training environment.

Attract & retain the brightest new starters

We know it is vital for our clients to invest in the future of their talented grads; not only to provide them with high-quality, professional training essential for their roles, but to embed them within the organisation’s culture and guide them on the right path to a successful career.

After all, your new hires could well be the next leaders and their creative ideas and unique insights are invaluable to your business.

Read more ...

Learning & Development

Our unique portfolio of high-quality technical courses and training programmes are industry-respected. They’re carefully designed so that delegates can seamlessly apply what they’ve learnt back in the workplace. Our team of domain experts, trainers, and support teams know our field — and all things tech — inside out, and we work hard to keep ourselves up to speed with the latest innovations. 

We’re proud to develop and deliver innovative learning solutions that actually work and make a tangible difference to your people and your business, driving through positive lasting change. Our training courses and programmes are human-centred. Everything we do is underpinned by our commitment to continuous improvement and learning and generally making things much better.

Read more...

Corporate & Volume Pricing

Whether you are looking to book multiple places on public scheduled courses (attended remotely or in our training centres in London) or planning private courses for a team within your organisation, we will be happy to discuss preferential pricing which maximise your staff education budget.

Enquire today about:

  • Training programme pricing models  

  • Multi-course voucher schemes

Read more...

Custom Learning Paths

We understand that your team training needs don't always fit into a "one size fits all" mould, and we're very happy to explore ways in which we can tailor a bespoke learning path to fit your learning needs.

Find out about how we can customise everything from short overviews, intensive workshops, and wider training programmes that give you coverage of the most relevant topics based on what your staff need to excel in their roles.

Read more...

Advanced Prometheus & Grafana

Scale, Alert, and Operate: Advanced Prometheus & Grafana Skills.

About the course

Building upon your foundational knowledge of Prometheus and Grafana basics, this 2-day advanced workshop dives deeper into the powerful capabilities needed to implement robust monitoring, alerting, and visualisation solutions for production environments and at scale. It is designed for participants who are already familiar with the fundamental concepts covered in the introductory workshop or have equivalent hands-on experience with getting started with Prometheus and Grafana. This workshop focuses on mastering advanced techniques and understanding operational considerations crucial for real-world deployments.

The workshop begins with a review of core PromQL concepts before exploring advanced querying techniques and optimising query performance with Recording Rules. You will learn how to automate target discovery using various Service Discovery methods, which is crucial for monitoring dynamic infrastructure. A significant focus is placed on comprehensive alerting, covering the definition of complex alert rules in Prometheus and the detailed configuration and management of notifications using Alertmanager, including routing, grouping, and silencing.

Participants will also enhance their Grafana skills by mastering advanced dashboarding techniques, including building dynamic dashboards using variables and templates, applying transformations, and using data linking for deeper analysis. The workshop concludes by covering essential operational aspects like sizing, data retention, backup, and an overview of high availability/scaling strategies. Key security considerations for the monitoring stack and an introduction to integrating with other observability tools (logs, traces) are also included, providing participants with the knowledge to build and maintain production-ready Prometheus and Grafana deployments.

Instructor-led online and in-house face-to-face options are available - as part of a wider customised training programme, or as a standalone workshop, on-site at your offices or at one of many flexible meeting spaces in the UK and around the World.

    • Apply advanced PromQL techniques to perform complex data analysis and troubleshooting.
    • Define and use Recording Rules to optimise query performance and simplify complex expressions.
    • Implement Service Discovery to automatically manage monitoring targets in dynamic environments.
    • Define and manage Prometheus Alerting Rules effectively for different scenarios.
    • Configure and use Alertmanager for advanced alert routing, grouping, and notification management.
    • Build advanced Grafana dashboards using variables, templates, transformations, and linking for enhanced interactivity and reusability.
    • Utilise advanced Grafana features like the Explore view and dashboard import/export.
    • Understand key operational aspects for managing Prometheus and Grafana, including sizing, retention, and backup.
    • Understand basic security considerations for a production Prometheus and Grafana monitoring stack.
    • Understand how Prometheus and Grafana fit into a wider observability strategy with logs and traces (overview).
  • This advanced 2-day workshop is designed for IT professionals, system administrators, DevOps engineers, Site Reliability Engineers (SREs), and architects who are already familiar with the fundamentals of Prometheus and Grafana (equivalent to the introductory workshop) and need to deepen their skills for production deployments, automation, alerting, and operational management. It is ideal for:

    • Professionals who have completed the Introduction to Prometheus & Grafana workshop.

    • Users who are currently working with Prometheus and Grafana but need to learn advanced querying, alerting, and configuration techniques.

    • Teams looking to implement automated service discovery and robust alerting strategies for dynamic environments.

    • Those responsible for the operational management, scaling, and security of Prometheus and Grafana in production.

  • Participants must have:

    • Prior completion of the Introduction to Prometheus & Grafana (2 Day Workshop) or equivalent hands-on experience.

    • Equivalent experience includes being comfortable with basic Prometheus installation, configuration, scraping targets, fundamental PromQL queries, basic Grafana installation, and building simple dashboards.

    • Solid familiarity with Linux command-line environments.

    Knowledge of Docker is recommended for laboratory exercises.

  • This advanced Prometheus & Grafana course is available for private / custom delivery for your team - as an in-house face-to-face workshop at your location of choice, or as online instructor-led training via MS Teams (or your own preferred platform).

    Get in touch to find out how we can deliver tailored training which focuses on your project requirements and learning goals.

  • Advanced PromQL & Recording Rules

    • Review of PromQL Fundamentals: Quick recap of basic queries, labels, and aggregation.

    • More Advanced PromQL Patterns: Working with rate, irate, delta, increase, histograms, and joining time series.

    • Understanding Query Performance: Writing efficient PromQL queries for scale.

    • Recording Rules: Understanding the purpose of pre-calculating frequently used expressions for performance and simplicity.

    • Defining and Using Recording Rules: Configuring recording rules in Prometheus and querying the resulting new time series.

    • Hands-On Lab: Writing more complex PromQL queries, creating and verifying recording rules.

    Module 6: Service Discovery

    • The Challenge of Dynamic Environments: Why manual configuration doesn't scale.

    • Overview of Service Discovery Methods: Introduction to various mechanisms Prometheus uses to automatically find monitoring targets.

    • Configuring Common Service Discovery Methods: Implementing file-based discovery, and an overview or lab on cloud/orchestration-specific discovery (e.g., Kubernetes, EC2) if applicable.

    • Relabelling: Using relabelling rules in scrape configurations to transform or filter discovered targets and their labels.

    • Hands-On Lab: Implementing file-based service discovery. Optionally, configuring discovery for a dynamic environment based on the audience's likely use case.

    Alerting with Prometheus & Alertmanager

    • Review of Basic Alerting Rules: Quick recap of defining alert conditions in Prometheus.

    • Understanding Alert States and Life Cycle.

    • Introduction to Alertmanager: Overview of its role in managing alerts.

    • Setting up and Configuring Alertmanager: Installation and detailed configuration of alertmanager.yml.

    • Alert Routing: Defining rules to send alerts to different teams or channels.

    • Alert Grouping, Inhibition, and Silences: Strategies for managing alert noise.

    • Templating Alert Notifications: Customising the format and content of messages sent by Alertmanager.

    • Hands-On Lab: Defining advanced alerting rules, setting up and configuring Alertmanager with multiple receivers, testing grouping and inhibition rules.

    Advanced Grafana Dashboards

    • Review of Basic Dashboard Building: Quick recap of creating dashboards and adding panels.

    • Using Variables and Templating: Creating dynamic and reusable dashboards with template variables (e.g., for selecting jobs, instances, or environments).

    • Advanced Panel Types & Configuration: Exploring visualisations like Heatmaps, Worldmaps, and using features like thresholds and repeated panels.

    • Transformations: Applying data transformations within panels (e.g., sorting, filtering, calculations across series).

    • Annotations: Adding markers to graphs for events (e.g., deployments).

    • Data Links and Panel Links: Configuring links for drill-down and cross-referencing.

    • Importing and Exporting Dashboards: Sharing and managing dashboards as JSON.

    • Hands-On Lab: Creating a dynamic dashboard using template variables, configuring advanced panels, adding transformations and annotations, exporting a dashboard.

    Operational Aspects, Security, and Beyond

    • Prometheus Sizing and Capacity Planning Basics: Estimating resource needs.

    • Data Retention Policies: Configuring how long metrics are stored.

    • Basic Troubleshooting: Identifying common issues with Prometheus and Grafana.

    • High Availability & Scaling Concepts: Overview of strategies for resilience and handling large loads (e.g., HA Prometheus, Thanos/Mimir overview).

    • Integrating with other Observability Pillars: Overview of using Grafana with other data sources like Loki (logs) and Tempo (traces) for a unified view.

    • Backup and Restoration: Basic strategies for backing up Prometheus data.

    • Prometheus and Grafana Security Best Practices: Basic steps for securing your monitoring stack (authentication, TLS).

    • Hands-On Lab: Configuring data retention, basic troubleshooting exercise, performing a simple backup/restore simulation.

     

    • Prometheus Official Documentation: The comprehensive source for information on installing, configuring, and using Prometheus, including PromQL. https://prometheus.io/docs/

    • Grafana Official Documentation: The main resource for learning how to install, configure, and use Grafana to build dashboards and visualisations. https://grafana.com/docs/

    • Prometheus Community Forum: Get help, ask questions, and connect with other Prometheus users and contributors. https://community.prometheus.io/

    • Grafana Community Forum: Find answers, share knowledge, and interact with the wider Grafana user and development community. https://community.grafana.com/

Trusted by

Crown Commercial Service Supplier (CCS) logo CERN organisation logo OVO Energy company logo

Public Courses Dates and Rates

Please get in touch for pricing and availability.

Related courses