Optimize Alerting System to Reduce Alert Fatigue

Job Overview

Budget

$60.00

Level

MidLevel

Location

United Arab Emirates

Job Posted

22 Sep, 2025

Category

DevOps

Total Proposals

0

Job Description

Overview:

We are a growing online media company with a complex infrastructure spanning multiple servers and microservices. We use a combination of Prometheus, Grafana, and an alerting tool.

The Challenge:

Our team is suffering from "alert fatigue." We receive an overwhelming number of alerts, many of which are non-critical or false positives. This makes it difficult to distinguish real emergencies from noise, causing us to miss important issues.

Problems Caused:

The constant stream of alerts is causing burnout and a lack of trust in our monitoring system. This leads to slow response times for real incidents, as critical alerts are often ignored amidst the noise.

Proposed Method:

We need a freelancer to audit our existing alerting rules. This involves reviewing our current configurations, identifying noisy alerts, and implementing smarter, more actionable rules. The solution should prioritize critical alerts and suppress irrelevant ones.

Required Skills:

Expertise in Prometheus and Grafana.

Strong understanding of alerting best practices.

Experience with PromQL for advanced queries.

Proficiency in a scripting language like Python or Bash.

Experience Required:

At least 3 years of experience in a DevOps or SRE role with a focus on monitoring and observability.

Delivery:

A refined set of alerting rules and a brief report on the changes made.

Support:

We require 1 week of post-delivery support to address any immediate issues with the new rules.

Skills

  • Monitoring and logging tools and technologies (e.g., Prometheus, Grafana, ELK Stack)

Tags

Monitoring and logging tools and technologies (e.g., Prometheus, Grafana, ELK Stack)

Author Spotlight

Ahmed Khan

Ahmed Khan

Client

No description available.

Related Jobs

1 year ago Senior
$70.00 Hourly

We are seeking a Security Information and Event Management (SIEM) Analyst to set up and manage SIEM solutions for compre...

Log aggregation and analysis
View More
1 year ago MidLevel
$70.00 Hourly

We are seeking a Container Security Specialist to secure our containerized applications, ensuring each component is isol...

Containerization technologies (e.g., Docker, Kubernetes)
View More
1 year ago Senior
$75.00 Hourly

We are looking for a DevSecOps Engineer to integrate security into every stage of our software development lifecycle (SD...

CI/CD security and best practices
View More
1 year ago Junior
$75.00 Hourly

We are hiring a Microservices Security Consultant to implement and manage security protocols for our microservices archi...

Microservices security
View More
Ahmed Khan

Ahmed Khan

United Arab Emirates


Member Since
Oct 26, 2024
Total Created Jobs
10