Dashboard Rot: Why Grafana Has 500 Unused Dashboards

Kevin Brown on Mar 2, 2025

5 min read

Library with organized and neglected sections, librarian curating books into archive and discard piles, representing dashboard hygiene and metric pruning

It’s 3 AM and the payments service is down. The on-call engineer opens Grafana, searches for “payments,” and finds 47 results. Some are personal experiments from an engineer who left two years ago. Some are labeled “Copy of Copy of Payments Dashboard.” Some show metrics from a version of the service that was deprecated in 2022. Ten minutes into the incident, they still haven’t found the authoritative payments dashboard.

This scenario plays out constantly. Grafana makes it trivially easy to create dashboards — as it should — but nobody deletes them. Over time, organizations accumulate hundreds of dashboards: unowned, outdated, or unused. Engineers waste time searching through noise, critical dashboards get buried in clutter, and stale dashboards actively mislead. That “Payments Health” dashboard showing all green? It’s pulling from a datasource that was decommissioned six months ago.

The proliferation that seemed harmless has now extended your MTTR. You need a data-driven approach to identify which dashboards to keep — and the cultural changes to make cleanup stick.

Every dashboard you create is a promise to maintain it. Dashboards without owners become misleading artifacts — they show stale data, use deprecated metrics, and waste time during incidents when accuracy matters most.

Why Dashboards Accumulate

Dashboard accumulation isn’t anyone’s fault — it’s a natural consequence of how teams work. Every dashboard enters your inventory through a legitimate path, and almost none ever leave.

New services get dashboards When a team launches a service, they create a dashboard for it. But when the service is deprecated or merged into another, the dashboard stays. Nobody thinks to delete it because nobody's tracking which dashboards map to which services.
Incidents spawn debug dashboards During an outage, someone creates "DEBUG - payment latency investigation" to correlate metrics. The incident gets resolved. The dashboard sits there forever because deleting it feels like destroying evidence. Six months later, there are forty "DEBUG-" dashboards, and nobody remembers what any of them were investigating.
Cloning is easier than customizing An engineer wants the payments dashboard but with their team's specific filters. They clone it. Now there are two dashboards. The original gets updated; the clone diverges. Next quarter, there are seven variants, all slightly different, none clearly authoritative.
Teams reorganize but dashboards don't When Platform Team A splits into Platform Infra and Platform Services, nobody updates dashboard ownership. The dashboards from the old team become orphans. People are reluctant to touch them because "Platform Team A" might still need them — even though that team no longer exists.
Nobody deletes anything Deletion feels risky. What if someone needs it? The safe choice is always to keep it. Multiply this instinct across an organization and you guarantee accumulation.

Each of these patterns has a different expected lifecycle, but they all share the same outcome: permanent residency in your Grafana instance. Understanding the intended lifespan of each type is key to building cleanup policies that make sense — you wouldn’t apply the same retention rules to a service overview dashboard and an incident debug dashboard.

Dashboard types and their lifecycle patterns.
Dashboard Type	Creation Frequency	Typical Lifespan	Deletion Rate
Service overview	Once per service	Years	< 5% ever deleted
Incident debug	Every incident	Should be days	< 1% deleted
Feature launch	Per feature	Should be weeks	< 10% deleted
Personal exploration	Frequently	Should be hours	< 5% deleted
Team clones	Per team	Forever (diverges)	Never

Dashboard types and their lifecycle patterns.

The asymmetry is stark: creating a dashboard takes minutes, deciding to delete one feels like it requires a committee.

newsletter.subscribe

Beyond the clutter, dashboards aren’t free. Every panel issues queries to your backend — Prometheus, InfluxDB, your SQL warehouse. A dashboard with 20 panels viewed by 50 people triggers 1,000 queries every refresh cycle. Grafana-managed alerts linked to dashboard panels run continuously, checking for firing conditions even when nobody’s watching. That DEBUG dashboard from last quarter? Its alert rules are still querying your metrics database every minute.

Measuring Dashboard Health

You can’t make evidence-based decisions about what to keep or delete without usage data. The first step in any hygiene program is instrumenting your Grafana instance to track who’s looking at what.

For each dashboard, you need four categories of metrics:

View metrics: Tell you whether anyone actually looks at this dashboard. Track views in the last 30 days, unique viewers (one person looking 50 times is different from 50 people looking once), and the last viewed date. A dashboard that hasn't been viewed in 90 days is a strong deletion candidate.
Edit metrics: Indicate maintenance activity. Track the last edit date, total edit count (the version number), and who made the last edit. A dashboard that was created two years ago and never edited is likely stale — either it's perfect (unlikely) or it's been abandoned.
Ownership metadata: Identifies who's responsible. Look for owner tags, team assignments, and the original creator. Dashboards without clear ownership are the hardest to make decisions about because nobody feels empowered to delete them.
Quality indicators: Flag technical problems. Count broken panels (those referencing deleted datasources), total panel count (excessive complexity suggests the dashboard should be split), and check for common naming anti-patterns like "Copy of" or "test" in the title.

Grafana Enterprise includes usage statistics out of the box. For Grafana OSS, you can track views through API audit logs, a reverse proxy that logs dashboard access, or custom middleware. The implementation details matter less than having some data to work with.

Calculating Health Scores

Raw metrics are useful but overwhelming when you have hundreds of dashboards. A composite health score (0-100) lets you quickly identify candidates for review.

A reasonable weighting:

Usage score (35%) Based on views and unique viewers in the last 30 days. Zero views scores zero; 100+ views with 10+ unique viewers scores full marks.
Freshness score (25%) Based on time since last edit. Edited in the last 30 days scores 100; over a year without edits scores 0.
Ownership score (25%) Has an assigned owner? Has a team tag? Has any tags at all? Each adds points.
Quality score (15%) Penalize broken panels heavily (they actively mislead), excessive complexity (30+ panels), and suspiciously simple dashboards (fewer than 3 panels might be incomplete).

Beyond the numeric score, flag dashboards with specific problems: “no-recent-views,” “stale,” “no-owner,” “broken-panels,” “unmaintained,” “likely-clone” (title contains “copy”), or “incident-artifact” (title contains “debug”).

Automate usage tracking from day one. Without data, hygiene decisions become political debates instead of evidence-based choices. “Nobody uses this dashboard” is much more persuasive when you can prove it with numbers.

These flags help prioritize review. A dashboard scoring 40 with “broken-panels” needs immediate attention — it’s actively misleading anyone who looks at it. A dashboard scoring 40 with “no-recent-views” can wait for the next cleanup cycle.

Making Cleanup Stick

Process alone won’t fix dashboard rot. People hoard dashboards because deletion feels risky and there’s no reward for tidiness. You need to change the incentives — and have responses ready for the inevitable objections.

"I might need it someday." Archive, don't delete. Keep the JSON export for 90 days. If you need it, you can restore it. In three years of cleanups, most organizations see fewer than 1% of archived dashboards ever requested back.
"It's not hurting anyone." It is. During incidents, engineers waste time finding the right dashboard among dozens of stale results. Document your own incident data to make this argument concrete — if your last outage included 8 minutes searching for the right dashboard, that's 8 minutes of extended downtime.
"I don't have time." The hygiene automation handles 80% of cases. Dashboard owners only need to review dashboards flagged for their attention — usually 5-10 per quarter. Most reviews take under a minute each.
"Someone else might use it." Usage data shows nobody has viewed this in 6 months. If someone needs it, they'll create their own — which they'd do anyway since this one is outdated.
"It's my personal dashboard." Great! Move it to your Personal folder. Personal dashboards are exempt from cleanup but won't clutter team search results.

The cultural shift matters more than the tooling. Celebrate deletion — a monthly “cleanest team” recognition creates positive peer pressure. Track dashboard count as a negative metric (lower is better). Make templates so easy to use that people stop clinging to old dashboards “just in case” they can’t recreate them. When people leave teams, include dashboard ownership in offboarding checklists so their dashboards get flagged for review within a week, not discovered during the next cleanup campaign.

The biggest obstacle to dashboard hygiene isn’t technical — it’s cultural. People hoard dashboards because deletion feels risky and there’s no reward for tidiness. Address the incentives, not just the process.

Getting Started

Dashboard hygiene isn’t a one-time project. It’s ongoing maintenance, like any other operational practice.

If you do nothing else: implement usage tracking and run one cleanup campaign. Those two actions will give you the data to make informed decisions and the momentum to build a sustainable hygiene practice.

Free PDF Guide

Download the Dashboard Hygiene Guide

Get the complete observability cleanup playbook for dashboard scoring, pruning decisions, and sustainable ownership workflows.

What you'll get:

Dashboard health scoring framework
Pruning decision tree guide
Ownership governance policy templates
Cleanup campaign execution plan

Free resource

Instant access

Download Now

Learn More

No credit card required.

The goal isn’t the smallest possible dashboard count. It’s ensuring that every dashboard that exists is accurate, maintained, and serves a clear purpose. When an engineer searches for “payments” during an incident, they should find 3 relevant dashboards, not 30 stale ones.

Enjoyed the read? Share it with your network.

Table of Contents

Download the Dashboard Hygiene Guide

Your Rate Limiter Is Your Biggest Outage Risk

Why Your Traces Are Unreadable: Span Design

Terraform Module Defaults That Won't Break Your Consumers

Why Your E2E Tests Are Flaky (And How to Fix Them)

How We Cut Preview Environment Costs by 60 Percent

Table of Contents

Why Dashboards Accumulate

Measuring Dashboard Health

Calculating Health Scores

Making Cleanup Stick

Getting Started

Download the Dashboard Hygiene Guide

Share this article

Your Rate Limiter Is Your Biggest Outage Risk

Why Your Traces Are Unreadable: Span Design

Terraform Module Defaults That Won't Break Your Consumers

Why Your E2E Tests Are Flaky (And How to Fix Them)

How We Cut Preview Environment Costs by 60 Percent