T
Remoto LinkedIn

Data Engineer - Data Foundry Engineer

TRACTIAN đ—•đ—„ ‱ SĂŁo Paulo ‱ 171 candidaturas 16 dias atrĂĄs

SalĂĄrio estimado

R$ 9k - 14k/mĂȘs

Pleno CLT
58%

Score de curadoria

Indicador interno 0 a 100: transparĂȘncia salarial, stack, descrição Ăștil e sinais de qualidade do anĂșncio. NĂŁo Ă© match com o seu CV.

Descrição da vaga

Texto agregado para leitura rĂĄpida. Confira sempre a fonte original ao enviar a candidatura.

Why join us

TRACTIAN is transforming the industrial world by empowering frontline maintenance workers to achieve more. We’ve fused cutting-edge hardware with innovative software into one powerful platform, disrupting legacy systems and delivering smarter, faster solutions for our clients.

At TRACTIAN, you'll break boundaries, question convention, and collaborate with top talent to drive real change. As a part of our growth-stage startup, you’ll work alongside the founders, shaping the vision, products, and experiences that will define the future of industrial tech.

Data Science at TRACTIAN

The Data Science team at TRACTIAN focuses on extracting valuable insights from vast amounts of industrial data. Using advanced statistical methods, algorithms, and data visualization techniques, this team transforms raw data into actionable intelligence that drives decision-making across engineering, product development, and operational strategies. The team constantly works on optimizing prediction models, identifying trends, and providing data-driven solutions that directly enhance the company’s operational efficiency and the quality of its products.

What you'll do

We're looking for a Data Engineer with a strong engineering foundation and comfort with AI workflows to join our Data Foundry team. In this role, you'll be the bridge between our model training and data annotation teams, building the pipelines and infrastructure that turn raw, messy data into gold-standard datasets ready for AI consumption.

Responsibilities

  • Design and maintain robust data pipelines to ingest from a wide range of sources, including APIs, documents, websites, and raw sensor data
  • Integrate and optimize ETL/ELT processes developed by MLE colleagues, improving performance, reliability, and long-term maintainability
  • Own the full dataset lifecycle, from raw ingestion through cleaning, validation, and delivery as training-ready data
  • Define and enforce data quality standards and governance practices across the Data Foundry team
  • Build and maintain labeling pipeline infrastructure for ML applications, working closely with the annotation team
  • Participate in architectural decisions, code reviews, and technical mentorship within the team
  • Document data sources, pipeline logic, and processing decisions for reproducibility and team alignment


,Requirements

  • 3+ years of experience in data engineering
  • Degree in Computer Science, Data Engineering, Computer Engineering, Information Systems, or equivalent technical background
  • Solid understanding of the ML training lifecycle and what properties make a dataset suitable for model training
  • Familiarity with layered data architecture patterns such as Medallion Architecture (Bronze/Silver/Gold) or Data Mesh
  • Proficiency in Python, with focus on data manipulation, pipeline development, and automation
  • Workflow orchestration using code-based tools such as Temporal, Airflow, Prefect, Dagster, or equivalent
  • Distributed data processing with Spark, Databricks, or similar
  • REST and gRPC API integration
  • Strong SQL skills, both for data modeling and query optimization
  • Experience with streaming systems and event-driven pipelines (Kafka, Kinesis, or equivalent)


,Soft Skills

  • Comfortable jumping into ongoing codebases and optimizing work built by others, without needing to start from scratch
  • Technology-agnostic: you evaluate tools based on what the project needs, adopt new ones quickly, and don't get attached to a specific stack
  • At ease in fast-moving environments where priorities shift and the right answer isn't always obvious
  • Engineering-first mindset: you think in pipelines, own outcomes, and care about the quality of what you ship
  • Driven by curiosity and innovation, not by comfort with a known toolset


,Nice to Have

  • Experience making architectural decisions and contributing to the technical growth of a team, formally or informally
  • Go, for high-performance pipeline components
  • dbt for transformation layer modeling
  • Open table formats: Delta Lake, Apache Iceberg, or Hudi
  • Data quality frameworks such as Great Expectations or Soda
  • Cloud experience, preferably OCI (our current migration target). AWS, GCP, or Azure background is also valued
  • Rapid prototyping with Streamlit or similar tools. The use of LLMs and GenAI to speed up internal tooling and experimentation is actively encouraged
  • Experience with data annotation workflows or training dataset pipelines


Compensation:

  • Competitive salary and stock options
  • 30 days of paid annual leave
  • Education and courses stipend
  • Earn a trip anywhere in the world every 4 years
  • R$1.035/month for meals allowance
  • Health plan with national coverage and without coparticipation
  • Dental Insurance: we help you with dental treatment for a better quality of life.
  • Wellhub and Sports Incentive: R$300/mo extra if you practice activities


Vagas relacionadas

Seleção por stack em comum com esta oportunidade

S
LinkedIn
Match35%

Especialista P&D - Aprendizado de MĂĄquina

Samsung Brasil ‱ Greater Campinas ‱ 187 candidaturas Hoje

SalĂĄrio estimado

R$ 15k - 25k/mĂȘs

Especialista CLT

Position SummarySOBRE A SAMSUNG E O SRBRÉ missĂŁo da Samsung inspirar o mundo e moldar o futuro com ideias e tecnologias transformadoras!E vocĂȘ sabia que muitas dessas inovaçÔes sĂŁo pensadas e desenvolvidas por talentos brasileiros e impactam a vida de milhĂ”es de pessoas pelo mundo?Nosso Centro de Pe...

Ver Detalhes →
P
Remoto LinkedIn
Match64%

Desenvolvedor(a) Full Stack

Pecege ‱ Brazil ‱ 172 candidaturas Ontem

SalĂĄrio estimado

R$ 6k - 10k/mĂȘs

Pleno CLT

O Pecege é formado por pessoas que acreditam na educação como um poderoso agente de transformação social . Nossa missão é democratizar o conhecimento para promover o desenvolvimento econÎmico, social e cultural .Contamos com cada colaborador(a) (chamado(as) internamente(as) de Pecegers ) para contri...

Ver Detalhes →
Z
LinkedIn
Match49%

Desenvolvedor Full Stack Nodejs - Processo Seletivo Ativo

ZANC Assessoria Nacional de Cobrança ‱ Porto Alegre, Rio Grande Do Sul, Brazil ‱ 25 candidaturas Ontem

SalĂĄrio estimado

R$ 8k - 11k/mĂȘs

Pleno CLT

Sobre a EmpresaZanc Acessoria Nacional de CobrançaLocalização: Porto Alegre-RSDetalhes da VagaÁrea de Atuação: Informåtica / TI / TecnologiaPrincipais ResponsabilidadesIrå desenvolver soluçÔes atuando tanto no frontend e backend de aplicaçÔes, participar de anålise de requisitos e o desenho de soluç...

Ver Detalhes →