J
Remoto LinkedIn

Site Reliability Engineer (SRE)

Jobgether Brazil 55 candidaturas 2 dias atrás

Salário estimado

R$ 7k - 10k/mês

Pleno CLT
64%

Score de curadoria

Indicador interno 0 a 100: transparência salarial, stack, descrição útil e sinais de qualidade do anúncio. Não é match com o seu CV.

Descrição da vaga

Texto agregado para leitura rápida. Confira sempre a fonte original ao enviar a candidatura.

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Site Reliability Engineer (SRE) based in Brazil.

This role is at the core of ensuring reliability, scalability, and performance across mission-critical systems in a highly innovative technology environment. You will be responsible for shaping and evolving observability, incident response, and automation practices that directly impact platform stability and customer experience. Acting as a bridge between development, platform, and security teams, you will help define operational excellence standards and drive a “software as operations” mindset. The environment is fast-paced, collaborative, and strongly oriented toward engineering ownership and continuous improvement. You will work on distributed systems running in Kubernetes-based infrastructures, with strong emphasis on resilience and proactive problem-solving. A key part of your mission will be reducing manual operational work through automation and AI-driven approaches (AIOps). This is a high-impact role where your work will directly improve system reliability and engineering efficiency at scale.

Accountabilities

In this role, you will be responsible for building and maintaining highly reliable systems while continuously improving operational maturity across engineering teams. You will define reliability standards, lead incident management practices, and drive automation initiatives that reduce operational toil and increase system resilience.

  • Define and track SLI, SLO, and SLA metrics, operating with error budget principles
  • Design and implement high availability, disaster recovery, and resilience strategies (RTO/RPO)
  • Build and evolve observability platforms (logs, metrics, traces, alerts, dashboards)
  • Lead incident response processes, including on-call coordination and escalation flows
  • Perform root cause analysis (RCA) and post-mortem reviews with preventive actions
  • Optimize system performance through capacity planning, tuning, and infrastructure analysis
  • Drive automation and self-healing solutions to eliminate repetitive operational tasks
  • Apply AI-driven approaches (AIOps) for anomaly detection, log analysis, and troubleshooting
  • Collaborate with development teams to improve system reliability and deployment safety
  • Ensure security, compliance, and operational best practices in production environments

Requirements

We are looking for a strong technical profile with deep infrastructure understanding, solid automation skills, and a proactive mindset focused on reliability and scalability.

  • Experience as an SRE, DevOps, or Backend/Platform Engineer in production environments
  • Strong knowledge of Kubernetes, Docker, and cloud-native architectures
  • Solid experience with observability tools (Grafana, Prometheus, ELK, Datadog, or similar)
  • Strong understanding of Linux systems, networking, HTTP, DNS, and TLS/SSL
  • Proficiency in scripting/automation using Python, Shell, or similar languages
  • Experience with distributed systems, incident management, and troubleshooting
  • Familiarity with CI/CD pipelines, infrastructure automation, and Git workflows
  • Knowledge of reliability engineering concepts (SLI, SLO, error budgets) is highly valued
  • Experience with high-availability systems and production-scale environments
  • Strong analytical thinking, autonomy, and structured problem-solving skills
  • Clear communication skills and ability to collaborate across engineering teams
  • Familiarity with AIOps, OpenTelemetry, or chaos engineering is a plus

Benefits

  • 100% remote work, with flexibility to work from anywhere in Brazil
  • Competitive compensation aligned with senior-level engineering roles
  • Health and dental care plans
  • Life insurance coverage
  • Meal and food allowances (depending on contract model)
  • Home office support and ergonomic assistance
  • Wellness and mental health support programs
  • Access to fitness and wellness platforms and partnerships
  • Learning and development programs to support career growth
  • Performance-based recognition and engagement initiatives
  • Collaborative and innovation-driven engineering culture.

How Jobgether Works

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vagas relacionadas

Seleção por stack em comum com esta oportunidade

S
LinkedIn
Match50%

Especialista SRE

Serasa Experian São Paulo 100 candidaturas Hoje

Salário estimado

R$ 23k - 38k/mês

Especialista CLT

Company DescriptionA Serasa Experian é a primeira e a maior Datatech do Brasil. Líder em soluções de inteligência para análise de riscos e oportunidades, com foco nas jornadas de crédito, autenticação e prevenção à fraude. Com tecnologia de ponta, inovação e os melhores talentos, transforma a incert...

Ver Detalhes
I
LinkedIn
Match35%

Engenheiro de Dados Pleno

iDdata São Paulo 25 candidaturas Hoje

Salário estimado

R$ 4k - 7k/mês

Júnior CLT

Buscamos uma pessoa para atuar como Engenheira(o) de Dados Pleno, com foco em desenvolvimento de pipelines e governança de dados no ecossistema Databricks. Neste cargo, você fará parte do time de Dados e Analytics da ID Data, colaborando diretamente em projetos para clientes de grande porte — contri...

Ver Detalhes
D
Remoto LinkedIn
Match65%

Data Scientist

DoorDash São Paulo 200 candidaturas Hoje

Salário estimado

R$ 9k - 14k/mês

Pleno CLT

About The TeamThe Analytics team is looking for experienced Data Scientists to guide measurement, strategy, and tactical decision-making across the company across a variety of teams and levels. Data Scientists at DoorDash work to uncover insights and turn them into relevant recommendations, driving ...

Ver Detalhes