We are not on Kubernetes, is the service discovery still relevant?

Yes, Kubernetes discovery is one option among several: the configs cover static targets and file-based discovery too. The scrape, recording-rule, and alerting patterns apply to any infrastructure Prometheus can reach.

How does it keep alerts from becoming the usual on-call noise?

Alert rules follow the USE method (utilization, saturation, errors) per resource, carry severity tiers routed differently to PagerDuty and Slack, and include runbook annotations so each page is actionable. Recording rules pre-compute expensive queries so dashboards stay fast without firing on raw noise.

Does it include Grafana dashboards or long-term metric storage?

No. The scope is Prometheus itself: prometheus.yml, scrape configs, recording rules, alert rules, cardinality control, and promtool validation. Visualization and remote long-term storage are separate concerns you add on top.

By email right after purchase: ready to run, downloaded instantly, no setup wait.

One-time or subscription?

A one-time purchase; no subscription or hidden fees. VAT (20%) is included.

As a digital product, it can’t be refunded once downloaded. That’s why we show exactly what’s inside and who it’s for, right here.

Skill DevOps & Infra →

Prometheus Configuration

Set up Prometheus for comprehensive metric collection, storage, and monitoring of…

Set up Prometheus end to end for metric collection, scraping, recording rules, and alerting across infrastructure and applications. It delivers production-ready scrape configs, service discovery for Kubernetes, pre-computed recording rules, and severity-tuned alert rules backed by the USE method and cardinality control. The goal is meaningful alerts that engineers act on, not dashboards that drown teams in noise.

$15 one-time

Add to a kit →

Prices include 20% VAT. · Forged on real agency work · one-time, no lock-in

Type Skill
Category DevOps & Infra
Delivery Email · instant
License One-time

Run preview

forgehouse, prometheus-configuration

Inside the run · no black box

See the actual work before you buy it.

Monitoring fails in two ways: it misses the incident, or it pages you for nothing. This Prometheus setup controls cardinality at the door, pre-computes expensive queries, and writes alerts humans can live with.

Stand up the server with sizing decided up front: kube-prometheus-stack via Helm or Docker Compose, retention and storage volume set on day one, not after the disk fills.
Write prometheus.yml deliberately: 15s scrape and evaluation intervals, external labels for cluster and region, static targets plus Kubernetes service discovery gated by scrape annotations.
Control cardinality at the door: relabel_configs drop high-cardinality labels like user_id and request_id before scrape, and prometheus_tsdb_head_series is watched for series explosions.
Pre-compute the expensive queries as recording rules: per-job request rates, error percentage, p95 latency and per-node USE metrics, so dashboards never recompute them live.
Write alert rules that respect humans: minimum for: 5m to filter spikes, severity tiers (critical pages, warning goes to chat, info stays on the dashboard) and a runbook link in every annotation.
Gate every change in CI: promtool check on config and rules, all configuration versioned in git, hand-editing on the server forbidden, and a reload-failure metric alerting when config does not apply.

Use cases · what happens when you plug it in

One power source. 6 lines out.

prometheus-configuration · core

core active · 6 lines

Standing up Prometheus monitoring from scratch

✓ standing up prometheus m…
Kubernetes pod and service discovery scraping

✓ kubernetes pod and service
Recording rules for expensive queries

✓ recording rules for expe…
USE-method resource alerting (CPU, memory, disk)

✓ use-method resource aler…
Severity-tiered alert routing to PagerDuty and Slack

✓ severity-tiered alert ro…
Validating config and rules before deploy

✓ validating config and ru…

Benefits · what you walk away with

Yours to keep.

Drag time forward. Watch what stays.

Forever

That's what owning means.

The rented stack

ai writing tool: subscription

expired · access lost

analytics suite: subscription

expired · access lost

design platform: subscription

expired · access lost

(nothing left)

Your forge

Faster diagnosis with USE-based, resource-specific alerts
license: perpetual
Lower memory and query cost through cardinality control
license: perpetual
Reduced on-call fatigue via actionable, tuned severities
license: perpetual
Config drift prevented with Git-managed single source of truth
license: perpetual

subscriptions expire · deeds don't

What's included · the full manifest

Everything in the box.

Pick a piece up. Watch it work.

Complete prometheus.yml with global, alerting, and scrape sections

part 01 of 06 · in the box

6 parts · one working system · ships instantly by email

Who it's for

This wasn't forged for everyone.

Not for you if you'd rather rent a tool than own one.
Not for you if you want someone else to run your stack.
Not for you if you're happy guessing.

Still here? Good.

DevOps and SRE teams building observability infrastructure that surfaces real problems without alert overload.

then this was forged for you.

Works with

Universal by design: these run in any AI. Delivered in the open Agent Skills + MCP format (native in Claude); ChatGPT, Gemini, Cursor and Copilot adapt the same files their own way.

Claude Native format
ChatGPT Adapts via open standards
Gemini Adapts via open standards
Cursor Adapts via open standards
Copilot Adapts via open standards

Questions · still in the air

Catch what's on your mind.

the air is clear. nothing between you and the forge.

catch a spark: the forge will answer

We are not on Kubernetes, is the service discovery still relevant?

Yes, Kubernetes discovery is one option among several: the configs cover static targets and file-based discovery too. The scrape, recording-rule, and alerting patterns apply to any infrastructure Prometheus can reach.
How does it keep alerts from becoming the usual on-call noise?

Alert rules follow the USE method (utilization, saturation, errors) per resource, carry severity tiers routed differently to PagerDuty and Slack, and include runbook annotations so each page is actionable. Recording rules pre-compute expensive queries so dashboards stay fast without firing on raw noise.
Does it include Grafana dashboards or long-term metric storage?

No. The scope is Prometheus itself: prometheus.yml, scrape configs, recording rules, alert rules, cardinality control, and promtool validation. Visualization and remote long-term storage are separate concerns you add on top.
How is it delivered?

By email right after purchase: ready to run, downloaded instantly, no setup wait.
One-time or subscription?

A one-time purchase; no subscription or hidden fees. VAT (20%) is included.
Can I get a refund?

As a digital product, it can’t be refunded once downloaded. That’s why we show exactly what’s inside and who it’s for, right here.

Prometheus Configuration

See the actual work before you buy it.

One power source. 6 lines out.

Yours to keep.

The rented stack

Your forge

Everything in the box.

This wasn't forged for everyone.

Works with

Catch what's on your mind.

We are not on Kubernetes, is the service discovery still relevant?

How does it keep alerts from becoming the usual on-call noise?

Does it include Grafana dashboards or long-term metric storage?

How is it delivered?

One-time or subscription?

Can I get a refund?

Related products

Bash Defensive Patterns

Bazel Build Optimization

Changelog Automation

Cost Optimization