As a Site Reliability Engineer you’ll play a pivotal role in maintaining and enhancing our critical developer-facing tools at one of the biggest companies in the world. We’re seeking a candidate with expertise in Kubernetes, Go, and operations/observability technologies.#LI-DNI#LI-VC5ResponsibilitiesDevelop, monitor, and maintain observability tooling on Kubernetes (e.g., Prometheus, Jaeger, Grafana/Plutono)Develop (Golang) and collaborate closely with other development team, including onsite engagementsProvide occasional third-level support for internal toolsUtilize and create Grafana/Plutono/Prometheus dashboards and queriesAdminister and leverage log aggregation toolingOperate and monitor Kubernetes workloads, adhering to best practicesImplement and manage end-user monitoring toolsUpdate workflows on GitHub ActionsUse and update Terraform modulesEnhance operational efficiency and productivity for 50k engineersRequirements

As a Site Reliability Engineer you’ll play a pivotal role in maintaining and enhancing our critical developer-facing tools at one of the biggest companies in the world. We’re seeking a candidate with expertise in Kubernetes, Go, and operations/observability technologies.#LI-DNI#LI-VC5Responsibilities

Want more jobs like this?GetjobsinSofia, Bulgariadelivered to your inbox every week.

Want more jobs like this?

GetjobsinSofia, Bulgariadelivered to your inbox every week.

Get Jobs

3 years of experience in a similar role and knowledge of Kubernetes and GolangHands-on experience with operations and observability toolingKnowledge of creating and managing dashboards and queries in Grafana/Plutono/PrometheusExperience with log aggregation tools like Splunk, Open Telemetry, fluentbit, and ELK StackProficiency in administering and operating Kubernetes workloadsExperience with end-user monitoring tools (e.g., Dynatrace RUM)Familiarity with Sentry (sentry.io) for error managementExpertise in developing Helm charts and Helm chart librariesExperience updating workflows on GitHub ActionsExperience using and updating Terraform modulesVery good proficiency in English (written and spoken)Willingness to work in a hybrid setup (home office and office in Sofia)We offerOpportunity to Engineer your Future and to drive the world’s digital transformation with top industry clientsPersonal development program that will allow you to be valued for your strengthsWide range of professional trainings and workshopsBeing part of a collaborative, fast-growing, and innovative design teamEstablished and accelerated growth toward different career paths, competencies, and rolesBroad projects variety and possible mobility between projects over the timeCollaboration in a multicultural environment and exchange of best practices with colleagues around the worldVaried social benefits, Sports, Transportation and Health programsWork-life balance and flexible schedule, team buildings and sport opportunitiesModern office/collaboration spaces (incl. new Infinity Tower business center, Sofia)Hybrid By Design - we provide you with the best productivity options from the 2 worlds. Meet, socialize and enjoy F2F time with your colleagues, while working from the modern EPAM’s office for a few days per week and benefit from the EPAM’s virtual working environment - making you able to be productive and work from remote for the rest of the week