Role: Grafana Monitoring/Alerting Support
Position Type: Part-Time Contract (20hrs/week)
Contract Duration: 6 months
Work Hours: EST
Location: 100% Remote
We are looking for an experienced Grafana Monitoring & Alerting Engineer with strong expertise in Prometheus, Node Exporters, and monitoring infrastructures. The idóneo candidate will ensure the reliability, accuracy, and performance of monitoring systems across distributed environments.
Key Responsibilities
- Install, configure, and maintain monitoring agents (Node Exporters, Prometheus).
- Perform regular health checks, upgrades, and validation of monitoring agents.
- Support and troubleshoot Grafana dashboard issues, including data source connectivity.
- Ensure operational dashboards remain functional and up to date.
- Deploy monitoring agents using automation tools.
- Validate Prometheus configurations, service availability, and alerting logic.
- Provide detailed troubleshooting, root cause analysis, and documentation.
- Maintain technical documentation and contribute to knowledge transfer.
Required Skills & Experience
- Strong Linux/Windows administration experience.
- Hands-on understanding of Prometheus and Grafana setup and workflows.
- Experience working with Kubernetes (k8s) environments.
- Strong troubleshooting, analytical, and RCA (Root Cause Analysis) skills.
- Ability to resolve installation, connectivity, and configuration issues.
- Experience supporting dashboards and monitoring pipelines in production.
Preferred Skills
- Experience with automation tools for agent deployment.
- Knowledge of monitoring best practices and alerting optimization.