Skip to content
Hands-On Infrastructure Monitoring with Prometheus, published by Packt
Shell Ruby Python Other
Branch: master
Clone or download
Latest commit f544f51 Jun 16, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
cache Replace downloads dir with cache Dec 29, 2018
chapter03 Replace grafana download URL Feb 26, 2019
chapter05 Bump software to the latest version May 11, 2019
chapter06 Add blackbox-exporter scrape job May 19, 2019
chapter07/provision/kubernetes Bump software to the latest version May 11, 2019
chapter08 Upgrade golang May 11, 2019
chapter09
chapter10 Bump software to the latest version May 11, 2019
chapter11 Bump software to the latest version May 11, 2019
chapter12 Bump software to the latest version May 11, 2019
chapter13
chapter14
utils Merge pull request #30 from PacktPublishing/bump_all May 19, 2019
.gitignore Replace downloads dir with cache Dec 29, 2018
LICENSE Initial commit Dec 13, 2018
Makefile Add Makefile Dec 29, 2018
README.md

README.md

Hands-On Infrastructure Monitoring with Prometheus

Hands-On Infrastructure Monitoring with Prometheus

This is the code repository for Hands-On Infrastructure Monitoring with Prometheus , published by Packt.

Implement and scale queries, dashboards, and alerting across machines and containers

What is this book about?

Prometheus is an open source monitoring system. It provides a modern time series database, a robust query language, several metric visualization possibilities, and a reliable alerting solution for traditional and cloud-native infrastructure.

This book covers the following exciting features: Grasp monitoring fundamentals and implement them using Prometheus Discover how to extract metrics from common infrastructure services Find out how to take full advantage of PromQL Design a highly available, resilient, and scalable Prometheus stack Explore the power of Kubernetes Prometheus Operator Understand concepts such as federation and cross-shard aggregation Unlock seamless global views and long-term retention in cloud-native apps with Thanos

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

annotations:
     description: "Node exporter {{ .Labels.instance }} is down."
     link: "https://example.com"

Following is what you need for this book: If you’re a software developer, cloud administrator, site reliability engineer, DevOps enthusiast or system admin looking to set up a fail-safe monitoring and alerting system for sustaining infrastructure security and performance, this book is for you. Basic networking and infrastructure monitoring knowledge will help you understand the concepts covered in this book.

With the following software and hardware list you can run all code files present in the book (Chapter 1-14).

Software and Hardware List

Chapter Software required OS required
3-14 VirtualBox (6.0.4) Ubuntu 18.04 LTS / macOS 10.14.3
3-14 Vagrant (2.2.4) Ubuntu 18.04 LTS / macOS 10.14.3
3-14 Minikube (1.0.1) Ubuntu 18.04 LTS / macOS 10.14.3
3-14 kubectl (1.14.1) Ubuntu 18.04 LTS / macOS 10.14.3

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Related products

Get to Know the Author

Joel Bastos is an open source supporter and contributor, with a background in infrastructure security and automation. He is always striving for the standardization of processes, code maintainability, and code reusability. He has defined, led, and implemented critical, highly available, and fault-tolerant enterprise and web-scale infrastructures in several organizations, with Prometheus as the cornerstone. He has worked at two unicorn companies in Portugal and at one of the largest transaction-oriented gaming companies in the world. Previously, he has supported several governmental entities with projects such as the Public Key Infrastructure for the Portuguese citizen card. You can find his blogs at kintoandar and on Twitter with the handle @kintoandar.

Pedro Araújo is a site reliability and automation engineer and has defined and implemented several standards for monitoring at scale. His contributions have been fundamental in connecting development teams to infrastructure. He is highly knowledgeable about infrastructure, but his passion is in the automation and management of large-scale, highly-transactional systems. Pedro has contributed to several open source projects, such as Riemann, OpenTSDB, Sensu, Prometheus, and Thanos. You can find him on Twitter with the handle @phcrva.

Suggestions and Feedback

Click here if you have any feedback or suggestions.

You can’t perform that action at this time.