Senior Site Reliability Engineer | (QNR-561)

Svitla Systems

  • Buenos Aires
  • Permanente
  • Tiempo completo
  • Hace 1 día
Svitla Systems Inc. is looking for a Senior Site Reliability Engineer for a full-time position (40 hours per week) in Argentina. Our client is a leading expert network, providing business and government professionals with opportunities to communicate with industry and subject-matter experts to answer research questions. Their customers consult with these experts over the phone, in person at conferences, teleconferences, custom events, and workshops, or may gather their primary research data through surveys, polls, or web-based data offerings. Experts are categorized into six main industry sectors: healthcare, financial and business services, consumer goods and services, energy, industrials, and basic materials; tech, media, and telecom; and legal and regulatory. Since 2003, the company has provided its customers with primary research services, helping professionals gain a comprehensive understanding of a topic before making significant investments and/or business decisions. Their multinational client list includes nine top 10 consulting firms, hundreds of hedge funds, and many of the largest private equity firms and Fortune-ranked companies.We are seeking a skilled Site Reliability Engineer (SRE) with experience managing production-level SaaS applications hosted on Azure. The perfect candidate will be adept at monitoring, analyzing, and troubleshooting application and infrastructure-related issues on time.Requirements:- Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.
- 5-7 years of software development experience in one or more programming languages.
- 3+ years of experience in designing, analyzing, and troubleshooting distributed systems.
- Strong expertise with:
- Infrastructure as Code,
- Kubernetes and CI/CD,
- Python scripting,
- Any observability tooling (e.g., Datadog, Prometheus, Grafana, New Relic).Nice to have:- Experience with event-driven architectures, particularly RabbitMQ (RMQ).
- Familiarity with DevOps practices, CI/CD pipelines, and Infrastructure as Code (IaC) principles.
- Experience with Azure DevOps (collaborative tool for software development) or similar platforms for managing builds and releases.
- Ability to thrive in a fast-paced, collaborative environment while handling multiple priorities.
- Knowledge/expertise with DatadogResponsibilities:- Design, implement, and manage highly available and scalable infrastructure on Microsoft Azure, leveraging Terraform and Python for automation.
- Build and operate Kubernetes (AKS) clusters to support containerized microservices, ensuring high reliability and performance.
- Develop and maintain Azure DevOps CI/CD pipelines to facilitate secure, consistent, and repeatable deployments.
- Proactively monitor production systems using Datadog. Triage incidents, perform root cause analysis, and implement post-incident improvements.
- Troubleshoot issues in both production and non-production environments using logs, metrics, traces, and system-level debugging to ensure system health.
- Collaborate with engineering teams to optimize system and application performance, resolving latency or capacity bottlenecks.
- Operate and support Azure-native services, including Azure Functions, SQL databases, storage accounts, and event-driven integrations (e.g., RabbitMQ).
- Define and maintain SLIs (Service Level Indicators), SLOs (Service Level Objectives), and participate in error budget practices to align system reliability with business goals.
- Enhance system observability by improving monitoring, alerting, and logging strategies, and implement automation to reduce manual intervention and operational toil.
- Collaborate cross-functionally with developers, QA, and product stakeholders to ensure application operability, resilience, and seamless deployments.
- Participate in the on-call rotation, ensuring service uptime and reliability during production incidents.We offer- US and EU projects based on advanced technologies.
- Competitive compensation based on skills and experience.
- Regular performance appraisals to support your growth.
- Flexibility in workspace, either remote, our welcoming office or local coworking.
- Bonuses for recommendations of new employees.
- Bonuses for article writing, public talks, other activities.
- 15 vacation days, 10 national holidays, sick leaves.
- Personalized learning program tailored to your interests and skill development.
- Free tech webinars and meetups organized by Svitla.
- Fun corporate online\offline celebrations and activities.
- Awesome team, friendly and supportive community!About SvitlaSvitla Systems is a global trusted IT solutions company headquartered in California, with business and development offices through out the US, Latin America, Europe, and Asia. Svitla is an outspoken advocate of workplace flexibility, best known for its well-established remote culture, individual approach to our teammate’s professional and personal growth, and family-like environment.Since 2003, Svitla has served a wide range of clients, from innovative start-ups in California to mega-large corporations such as Ingenico, Amplience, InvoiceASAP and Global Citizen. At Svitla, developers work with clients’ teams directly, building lasting and successful partnerships, as a result of seamless integration with on-site processes.Svitla Systems’ global mission is to build a business that contributes to the well-being of our partners, personnel and their families, improves our communities, and makes a lasting difference in the world. Join us!If you are interested in our vacancy, just click "Apply".
We will be happy to see you in our friendly team :)#J-18808-Ljbffr

Kit Empleo

Empleos similares

  • Site Reliability Engineer - (CUF544)

    Careers at SunDevs

    • Buenos Aires
    **Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas…
    • Hace 8 horas
  • RM-081] Senior Network Engineer

    Netser Group

    • Buenos Aires
    En Netser Group estamos en la búsqueda de un Networking Engineer Sr. con conocimientos en redes. Tendrá a su cargo la configuración y el soporte de las redes WAN/LAN y proyectos …
    • Hace 8 horas
  • AL528 - Site Reliability Engineer Sr Sre

    Ripio

    • Buenos Aires
    ¡Hola, futurx ripionauta! Si hay una palabra que nos define es **ACCESO**: nuestra misión es ser la puerta de acceso al mundo cripto. Cultivamos el trabajo en equipo, la tolera…
    • Hace 8 horas