Senior Site Reliability Engineer

EPAM Systems

  • Argentina
  • Permanente
  • Tiempo completo
  • Hace 10 horas
We are seeking an experienced Senior Site Reliability Engineer to join our team and contribute to building and maintaining reliable, scalable infrastructure.In this role, you will collaborate closely with software and operations engineers to bridge the gap between infrastructure and software. You will play a key role in ensuring system reliability, scalability, and operational excellence while working with modern technologies and tools.ResponsibilitiesCollaborate with software engineers to integrate infrastructure and software systems seamlesslyApply SRE principles and software engineering practices to build, maintain, monitor, and operate complex infrastructureLeverage infrastructure automation tools to streamline operations and improve reliabilityDesign and maintain scalable web architectures and cloud-based technologiesWrite clean, efficient code in multiple programming languages such as Golang, Python, Ruby, and ScalaTroubleshoot and resolve issues, driving them to completion in high-pressure scenariosMonitor system performance and implement solutions to ensure uptime and reliabilityRequirementsAt least 3 years of experience in building, operating, or supporting large Linux-based web application environmentsProficiency in UNIX systems administration with strong scripting skills in Python, PHP, or BashHands-on experience running Docker with orchestration tools like Nomad, Kubernetes, or Amazon ECSFamiliarity with configuration management systems such as Ansible, Chef, or Puppet (experience with Puppet preferred)Strong communication skills and the ability to collaborate effectively with distributed teamsAbility to write clean, well-documented, and comprehensible systems and scriptsPassion for continuous learning and working with new technologies and programming languagesFluent English communication skills, both written and spoken, at a B2+ level or higherNice to haveExperience with observability and application performance monitoring tools such as ELK, Prometheus, New Relic, Sentry, or LightstepProficiency in Ruby or Scala for scripting and development tasksWe offer/Benefits
  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

EPAM Systems

Empleos similares

  • Site Reliability Engineer - (CUF544)

    Careers at SunDevs

    • Buenos Aires
    **Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas…
    • Hace 12 horas
  • RM-081] Senior Network Engineer

    Netser Group

    • Buenos Aires
    En Netser Group estamos en la búsqueda de un Networking Engineer Sr. con conocimientos en redes. Tendrá a su cargo la configuración y el soporte de las redes WAN/LAN y proyectos …
    • Hace 13 horas
  • AL528 - Site Reliability Engineer Sr Sre

    Ripio

    • Buenos Aires
    ¡Hola, futurx ripionauta! Si hay una palabra que nos define es **ACCESO**: nuestra misión es ser la puerta de acceso al mundo cripto. Cultivamos el trabajo en equipo, la tolera…
    • Hace 12 horas