About the course:
Our instructor-led Prometheus and Grafana training course will allow you to explore and learn to create observaibility solutions to monitor and visualise the performance and health status of your distributed systems, apps and containerised clusters.
Practical hands-on exercises will give you the skills to put in place effective metrics, dashboards and alerts so you can ensure the performance and robustness of the platforms you support such as Docker, Kubernetes and more.
This course is available as part of a wider training program or as a customisable standalone workshop - talk to us about how we can tailor your training to fit in with your existing skills, your current tech stack, and your learning goals.
Through custom training we can show you how to install, configure and deploy Prometheus and / or Grafana to measure the performance and health of platforms including Kubernetes, Docker, Puppet, Spring Boot, Node ...the list is huge.
Learning outcomes
- Why Monitoring & Metrics are important
- Prometheus Architecture
- Obtaining & Configuring Prometheus
- Data collection methods
- Service Discovery
- Monitoring with Prometheus
- Setting up alerts
- Visualisation with Grafana
Who should attend
This course is aimed at Systems Administrators, IT Managers, SREs and DevOps / Operations engineers who want to manage infrastructure using Terraform.
Prerequisites
Delegates will ideally have some experience of using Windows or Linux with the Command Line - please let us know if you need help with this as we can organise a short workshop to get you up to speed with the essentials.
Live, instructor-led online and on-site training
We appreciate that you need flexibility to fit in with new working situations - whether you're an individual, part of a distributed team, or simply have projects and deadlines to meet.
Our remote training can take place online in a virtual classroom, with content split into modules to accommodate your scheduling challenges and meet your learning goals. Get in touch today to find out how we can help design a cost-effective, flexible training solution.
As soon as it's safe, we'll return to also offering the on-site custom training courses and programmes upon which we've built our reputation.
Introduction
- Why is Monitoring important?
- What should I be monitoring?
- Which tools will suit my needs best?
- Prometheus overview
- Grafana overview
Getting started with Prometheus
- Pre-requisites
- Installing & configuring Prometheus
Monitoring Fundamentals
- What to monitor?
- Node Exporter
- StatsD Exporter
- Graphite Exporter
- Other popular community exporters
- Push and Pull data collection
- Service Level Objectives (SLOs)
- Service Level Indicators (SLIs)
Setting up your Metrics
- Instrumenting an application
- Endpoints
- Label Naming
- Working with Time Series data
- Querying with PromQL
- Metric types:
- Counters
- Gauges
- Histograms
- Functions and Operators
- Aggregation
- Binary Operators
- Functions
- Monitoring Apps
- Collectors
- Self-destructing apps
- Platform as a Service (PaaS)
- Jobs and Instances
- Service Discovery
Alerts
- Defining Alerting rules
- Templating
- Alert notifications
- Setting up and configuring Alertmanager with Prometheus
- Grouping
- Inhibition
- Silences
- Behaviour
Visualisation with Grafana
- Installing and configuring Grafana to work with Prometheus
- Create a Dashboard
- Adding a metrics endpoint as a datasource
- Securing user permissions and authentication
- Publishing dashboards / snapshots