Role: grafana monitoring/alerting support
position type: part-time contract (20hrs/week)
contract duration: 6 months
work hours: est
location : 100% remote
we are looking for an experienced grafana monitoring & alerting engineer with strong expertise in prometheus, node exporters, and monitoring infrastructures. The ideal candidate will ensure the reliability, accuracy, and performance of monitoring systems across distributed environments.
key responsibilities
* install, configure, and maintain monitoring agents (node exporters, prometheus).
* perform regular health checks, upgrades, and validation of monitoring agents.
* support and troubleshoot grafana dashboard issues, including data source connectivity.
* ensure operational dashboards remain functional and up to date.
* deploy monitoring agents using automation tools.
* validate prometheus configurations, service availability, and alerting logic.
* provide detailed troubleshooting, root cause analysis, and documentation.
* maintain technical documentation and contribute to knowledge transfer.
required skills & experience
* strong linux/windows administration experience.
* hands-on understanding of prometheus and grafana setup and workflows.
* experience working with kubernetes (k8s) environments.
* strong troubleshooting, analytical, and rca (root cause analysis) skills.
* ability to resolve installation, connectivity, and configuration issues.
* experience supporting dashboards and monitoring pipelines in production.
preferred skills
* experience with automation tools for agent deployment.
* knowledge of monitoring best practices and alerting optimization.