Logo Innodata Inc.

[Remote] AI/ML Research Engineer, LLM Post-Training & Evaluation

Innodata Inc.via Jobright
RemotoUsSêniorCLTOntem

Salário Estimado

R$ 12.870,00 - R$ 19.305,00

0de 100

Regular

Score da Vaga

Descrição da Vaga

Note: The job is a remote job and is open to candidates in USA.


Innodata Inc. is a leading data engineering company providing AI technology solutions to major technology firms and industries.


They are seeking an AI/ML Research Engineer to build and optimize technical foundations for model improvement, focusing on large language models and evaluation systems.


Responsibilities • Lead or co-lead technically complex ML engineering projects from initial customer discussions through implementation and delivery

Design, build, and improve LLM training and post-training pipelines, including data ingestion, preprocessing, fine-tuning, evaluation, and experiment tracking • Implement and optimize evaluation systems for LLMs and multimodal models, including offline benchmarks and task-specific test harnesses
Integrate human-in-the-loop and AI-augmented evaluation signals into model development workflows • Build robust infrastructure and tooling for reproducible experimentation, metrics logging, and regression monitoring
Diagnose model behavior and pipeline failures, including data issues, training instability, metric inconsistencies, and evaluation drift • Collaborate with Language Data Scientists and Applied Research Scientists to translate evaluation frameworks into executable systems
Work closely with customer technical stakeholders to understand goals, constraints, and success criteria; propose and implement technically sound solutions • Contribute to internal research and platform development, including benchmark frameworks, evaluation tooling, and post-training workflow improvements
Contribute to best practices and standards for LLM training, evaluation, and quality assurance across projects • Mentor junior engineers and contribute to technical design reviews, documentation, and engineering rigor across the team Skills
BS/MS/PhD in Computer Science, Machine Learning, AI, Applied Mathematics, or a related quantitative technical field (MS/PhD preferred) • 2-3 years of relevant industry or research engineering experience in ML/AI systems
Hands-on experience with LLM training / fine-tuning / post-training, including at least one of: supervised fine-tuning (SFT), preference optimization (e.g., DPO or related methods), RLHF / RLAIF-style workflows, task- or domain-adaptation of foundation models • Strong programming skills in Python and experience building production-quality ML code
Experience with modern ML frameworks (e.g., PyTorch, JAX, TensorFlow) and model libraries/tooling (e.g., Hugging Face ecosystem, vLLM, distributed training stacks) • Experience designing and implementing evaluation pipelines for LLM/ML systems, including metrics computation, dataset handling, and experiment comparisons
Strong understanding of data pipelines and ML systems engineering, including reproducibility, observability, and debugging • Experience with large-scale distributed ML systems and performance optimization for training/evaluation workloads (GPU/accelerator environments preferred)
Experience with large-scale data processing and workflow orchestration in support of model training/evaluation • Ability to collaborate directly with technical stakeholders including research scientists, ML engineers, data engineers, and customer technical leads
Strong written and verbal communication skills, including the ability to explain complex technical tradeoffs to both technical and non-technical audiences • Experience training, fine-tuning, and evaluating transformer-based models
Understanding of post-training workflows and model iteration loops • Familiarity with inference-time considerations (latency, throughput, memory/performance tradeoffs) where relevant to evaluation or deployment
Experience implementing automated evaluation pipelines and test harnesses • Experience with experiment tracking, versioning, and reproducibility practices
Ability to assess metric quality and ensure consistency across model comparisons • Proficiency in Python and strong software engineering fundamentals
Experience with data processing pipelines, storage formats, and scalable dataset workflows • Familiarity with CI/CD, testing, and engineering quality practices for ML systems
Experience with multimodal model training/evaluation (text + image/audio/video) • Experience with long-context evaluation and/or model adaptation for long-context tasks
Experience with agentic or multi-turn evaluation harnesses, tool-use simulation, or interactive environment testing • Experience working in customer-facing technical consulting, solutions engineering, or applied research delivery
Familiarity with LLM safety, alignment, robustness, or red-teaming evaluation approaches • Contributions to open-source ML/LLM tooling or published technical work in relevant areas Company Overview
(NASDAQ: INOD) Innodata is a global data engineering company.

We believe that data and AI are inextricably linked.


It was founded in 1988, and is headquartered in Hackensack, New Jersey, USA, with a workforce of 5001-10000 employees.


Its website is http://www.innodata.com.


Company H1B Sponsorship • Innodata Inc. has a track record of offering H1B sponsorships, with 2 in 2024.


Please note that this does not guarantee sponsorship for this specific role.

Requisitos

  • 2-3 years of relevant industry or research engineering experience in ML/AI systems
  • Hands-on experience with LLM training / fine-tuning / post-training, including at least one of: supervised fine-tuning (SFT), preference optimization (e.g., DPO or related methods), RLHF / RLAIF-style workflows, task- or domain-adaptation of foundation models
  • Strong programming skills in Python and experience building production-quality ML code
  • Experience with modern ML frameworks (e.g., PyTorch, JAX, TensorFlow) and model libraries/tooling (e.g., Hugging Face ecosystem, vLLM, distributed training stacks)
  • Experience designing and implementing evaluation pipelines for LLM/ML systems, including metrics computation, dataset handling, and experiment comparisons
  • Strong understanding of data pipelines and ML systems engineering, including reproducibility, observability, and debugging
  • Experience with large-scale data processing and workflow orchestration in support of model training/evaluation
  • Ability to collaborate directly with technical stakeholders including research scientists, ML engineers, data engineers, and customer technical leads
  • Strong written and verbal communication skills, including the ability to explain complex technical tradeoffs to both technical and non-technical audiences
  • Experience training, fine-tuning, and evaluating transformer-based models
  • Understanding of post-training workflows and model iteration loops
  • Familiarity with inference-time considerations (latency, throughput, memory/performance tradeoffs) where relevant to evaluation or deployment
  • Experience implementing automated evaluation pipelines and test harnesses
  • Experience with experiment tracking, versioning, and reproducibility practices
  • Ability to assess metric quality and ensure consistency across model comparisons
  • Proficiency in Python and strong software engineering fundamentals
  • Experience with data processing pipelines, storage formats, and scalable dataset workflows
  • Familiarity with CI/CD, testing, and engineering quality practices for ML systems
  • Experience with multimodal model training/evaluation (text + image/audio/video)
  • Experience with long-context evaluation and/or model adaptation for long-context tasks
  • Experience with agentic or multi-turn evaluation harnesses, tool-use simulation, or interactive environment testing
  • Experience working in customer-facing technical consulting, solutions engineering, or applied research delivery
  • Familiarity with LLM safety, alignment, robustness, or red-teaming evaluation approaches
  • Contributions to open-source ML/LLM tooling or published technical work in relevant areas

Responsabilidades

  • Lead or co-lead technically complex ML engineering projects from initial customer discussions through implementation and delivery
  • Design, build, and improve LLM training and post-training pipelines, including data ingestion, preprocessing, fine-tuning, evaluation, and experiment tracking
  • Implement and optimize evaluation systems for LLMs and multimodal models, including offline benchmarks and task-specific test harnesses
  • Integrate human-in-the-loop and AI-augmented evaluation signals into model development workflows
  • Build robust infrastructure and tooling for reproducible experimentation, metrics logging, and regression monitoring
  • Diagnose model behavior and pipeline failures, including data issues, training instability, metric inconsistencies, and evaluation drift
  • Collaborate with Language Data Scientists and Applied Research Scientists to translate evaluation frameworks into executable systems
  • Work closely with customer technical stakeholders to understand goals, constraints, and success criteria; propose and implement technically sound solutions
  • Contribute to internal research and platform development, including benchmark frameworks, evaluation tooling, and post-training workflow improvements
  • Contribute to best practices and standards for LLM training, evaluation, and quality assurance across projects
  • Mentor junior engineers and contribute to technical design reviews, documentation, and engineering rigor across the team

Vagas Semelhantes

RemotoRidgefield Park, New Jersey, UsOntem

R$ 15k - 23k/mês

SêniorCLT

Who we are: Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are the AI technology solutions provider-of-choice to 4 out of 5 of the world’s biggest technology companies, as well as leading companies across...

Logo United Software Group Inc

AIML Engineer

United Software Group IncLinkedIn
RemotoMinneapolis, Minnesota, UsOntem

R$ 13k - 19k/mês

SêniorCLT

Job Title: Lead AI/ML Engineer Location : Remote Duration : Fulltime Teams Meeting Video Call Description Required Qualifications: • Proven AI/ML Leadership: 10-15 years of experience in the AI/ML field, with at least 4-5 years in a leadership or management role leading technical teams in the delive...

Logo Walmart Canada

Principal, Data Scientist - Gen AI

Walmart CanadaJobgether
RemotoWashington, District Of Columbia, Us6 dias atrás

R$ 12k - 24k/mês

SêniorCLT

This a Full Remote job, the offer is available from: United States, Canada, California (USA) Position Summary... What you'll do... Principal Data Scientist (Computer Vision) Join Walmart and your work could help over 275 million global customers live better every week. Yes, we are the Fortune #1 com...

At Walmart, we offer competitive pay as well as performance-based bonus awards and other great benefits for a happier mind, body, and walletHealth benefits include medical, vision and dental coverageFinancial benefits include 401(k), stock purchase and company-paid life insurance
RemotoRemoto7 dias atrás

R$ 16k - 25k/mês

SêniorCLT

Note: The job is a remote job and is open to candidates in USA. Apetan Consulting LLC is seeking an experienced AI / ML / GenAI Architect / Senior Engineer to design, develop, and deploy advanced artificial intelligence and machine learning solutions. The role involves leading AI initiatives, buildi...

Interessado nesta vaga?

Candidatar-se

Você será redirecionado para o site original

Informações

NívelSênior
ContratoCLT
LocalUs
RemotoSim
MoedaBRL
PublicadaOntem
FonteJobright

Análise de Vaga com IA

Estimativa salarial, match de tecnologias e análise de requisitos feitos com Inteligência Artificial

Powered by CodeCortex
← Voltar às Vagas