E
LinkedIn

DevOps Engineer

EPAM Systems Brazil 25 candidaturas 24 dias atrás

Salário estimado

R$ 7k - 10k/mês

Pleno CLT
40%

Score de curadoria

Indicador interno 0 a 100: transparência salarial, stack, descrição útil e sinais de qualidade do anúncio. Não é match com o seu CV.

Descrição da vaga

Texto agregado para leitura rápida. Confira sempre a fonte original ao enviar a candidatura.

We are looking for a DevOps Engineer to help maintain production Kubernetes-based systems for a major technology company that specializes in infrastructure supporting AI research.

This position brings together site reliability engineering, observability and SQL production support duties, with a clear focus on monitoring, metrics, dashboards and operational excellence. The right candidate will partner with established engineering and research teams to uphold system reliability, resolve production issues and steadily strengthen visibility into system health and performance across an Azure Stack environment.

 

Responsibilities

  • Design, maintain and progressively improve observability solutions, including dashboards and visual reports built with Grafana or comparable monitoring tools
  • Set up, implement and oversee metrics, SLIs, SLOs and alerting approaches to guarantee reliability and transparency across production systems
  • Deliver business-hours operational support for Kubernetes-based production environments, involving initial troubleshooting, log review and metric-based investigations
  • Assist with SQL-based systems as part of production operations, contributing to issue examination and performance diagnostics
  • Examine incidents and system behavior to pinpoint root causes, take part in post-incident reviews and suggest enhancements for monitoring and reliability practices
  • Work hand in hand with engineering, platform and research teams to raise observability standards, refine operational processes and strengthen overall system stability
  • Add to documentation, knowledge-sharing activities and ongoing improvement initiatives within the team

Requirements

  • At least 2 years of relevant hands-on professional experience
  • Demonstrated track record in Site Reliability Engineering (SRE), DevOps, Production Support or equivalent roles working with production systems
  • Practical exposure to observability and monitoring stacks including Grafana, Prometheus, Elastic Stack, Datadog or similar tools
  • Strong command of Linux systems, supported by solid troubleshooting and log analysis capabilities
  • Working experience supporting Kubernetes-based environments in production settings
  • Background in delivering SQL production support, including query troubleshooting and basic performance diagnostics
  • Confident scripting skills in Python, Bash or similar languages for automation and day-to-day operational activities
  • Capability to investigate incidents, determine underlying causes and drive continuous improvement efforts
  • Effective communication and teamwork skills for working successfully with distributed and cross-functional teams
  • Proficient English communication skills, both spoken and written, at a B2+ level or higher

Nice to have

  • Experience handling APIs and integration patterns to link services together and enable system interoperability
  • Knowledge of databases, covering administration, tuning and production-level support activities
  • Exposure to Infrastructure as Code development and maintenance for automating environment provisioning and configuration
  • Practical experience using Microsoft Azure to manage cloud resources and run production workloads

 

We offer

  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Healthcare benefits
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn

 

EPAM is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, sexual orientation, gender identity or expression, disability, protected veteran status, or any other characteristic protected by applicable law.

 

Vagas relacionadas

Seleção por stack em comum com esta oportunidade

Y
Remoto LinkedIn
Match65%

JavaScript/TypeScript Developer - Remote

YO IT Consulting Brazil 25 candidaturas Hoje

Salário estimado

R$ 6k - 10k/mês

Pleno CLT

Job Title: Software EngineeringJob Type: Contractor (10-12 hours per week)Location: RemoteJob SummaryWe are looking for experienced software engineers to help train and evaluate next-generation AI systems through real-world software engineering tasks. This role is best suited for developers who can ...

Ver Detalhes
D
Remoto LinkedIn
Match65%

Data Engineer

DoorDash São Paulo 200 candidaturas Hoje

Salário estimado

R$ 9k - 13k/mês

Pleno CLT

Engineering the future of logistics – from Brazil to the worldDoorDash is building the world’s most reliable on-demand logistics platform. Brazil is a strategic and growing engineering hub for DoorDash. Based in São Paulo, our teams build and scale systems that power millions of users globally. This...

Ver Detalhes
I
Remoto LinkedIn
Match64%

Desenvolvedor Backend Python - RJ

innolevels Rio de Janeiro 25 candidaturas Ontem

Salário estimado

R$ 7k - 11k/mês

Pleno CLT

Estamos contratando um Desenvolvedor Backend Python com foco em IA.Entendemos que para essa atuação, é necessário experiência em: Python; APIs REST; SOAP; Flask, FastAPI ou Django; SQL e bancos de dados relacionais; Git; Noções de testes automatizados, unitários e/ou integração; Experiência ou vivên...

Ver Detalhes