Job Description
Site Reliability Engineer, Observability
Hybrid work model: two days in-office and remote options on remaining days. Our Technology team is the backbone of the company, constantly creating, testing, learning, and iterating to meet customer needs in a fast‑paced, ideas‑led environment. Responsibilities
Support and evolve end‑to‑end observability solutions for collecting, shipping, storing, and querying OpenTelemetry signals (metrics, logs, and traces) across infrastructure, containers, and Kubernetes environments. Administer and operate core observability platforms (Splunk, New Relic, ClickHouse, Grafana, Lightrun), including service onboarding, access management, configuration, upgrades, and ongoing platform health. Contribute to building and advancing a contemporary OpenTelemetry‑based observability ecosystem that supports multiple telemetry types at scale. Improve and standardize instrumentation practices across services, driving consistent logging, metrics, and dis...
Hybrid work model: two days in-office and remote options on remaining days. Our Technology team is the backbone of the company, constantly creating, testing, learning, and iterating to meet customer needs in a fast‑paced, ideas‑led environment. Responsibilities
Support and evolve end‑to‑end observability solutions for collecting, shipping, storing, and querying OpenTelemetry signals (metrics, logs, and traces) across infrastructure, containers, and Kubernetes environments. Administer and operate core observability platforms (Splunk, New Relic, ClickHouse, Grafana, Lightrun), including service onboarding, access management, configuration, upgrades, and ongoing platform health. Contribute to building and advancing a contemporary OpenTelemetry‑based observability ecosystem that supports multiple telemetry types at scale. Improve and standardize instrumentation practices across services, driving consistent logging, metrics, and dis...