Senior AI Engineer (Evals/Observability Concentration)
Salário Estimado
R$ 12.870,00 - R$ 19.305,00
Descrição da Vaga
Risepoint is an education technology company that provides world-class support and trusted expertise to more than 100 universities and colleges.
We primarily work with regional universities, helping them develop and grow their high-ROI, workforce-focused online degree programs in critical areas such as nursing, teaching, business, and public service.
Risepoint is dedicated to increasing access to affordable education so that more students, especially working adults, can improve their careers and meet employer and community needs.
The Impact You Will Make Risepoint is developing an AI-powered Student Journey Platform and is seeking a Senior AI Engineer with deep expertise in Retrieval-Augmented Generation (RAG), multi-agent architectures, and LLM evaluation frameworks.
This role focuses on designing, implementing, and operationalizing AI systems with a strong emphasis on structured evaluation (including LLM-as-Judge), measurable quality, and production-grade reliability.
The ideal candidate has experience integrating LLMs with enterprise data sources, building testable and observable AI workflows, and improving system performance through rigorous evaluation and iteration.
This role contributes directly to a platform that is central to the organization’s long-term strategy.
How You Will Bring Our Mission to Life What You Will Do • Build and maintain evaluation frameworks (LLM-as-Judge, rubric-based scoring, regression test suites) to measure output quality, reliability, and drift with the responsibility of debugging production level issues as detected.
What Success Looks Like • RAG pipelines return grounded, source-attributed responses with minimal hallucination.
How Impact Will be Measured • AI systems demonstrate measurable improvements in quality using defined evaluation benchmarks.
What You’ll Bring to the Team Experience That Matters Most • 3-5 years of full stack engineering experience with strong fundamentals in object-oriented programming, applicable design patterns, and AI-focused system design.
Langfuse, LangSmith, OpenTelemetry-based tracing, custom evaluation harnesses). • Experience implementing guardrails, policy enforcement, and safety layers in AI driven systems while leveraging LLM-as-Judge for validation and continuous improvement.
Experience in Databricks (model serving endpoints, ML Flow) Risepoint is an equal-opportunity employer and supports a diverse and inclusive workforce.
Requisitos
- Risepoint is developing an AI-powered Student Journey Platform and is seeking a Senior AI Engineer with deep expertise in Retrieval-Augmented Generation (RAG), multi-agent architectures, and LLM evaluation frameworks
- RAG pipelines return grounded, source-attributed responses with minimal hallucination
- Multi-agent workflows are observable, testable, and maintainable as complexity increases
- 3-5 years of full stack engineering experience with strong fundamentals in object-oriented programming, applicable design patterns, and AI-focused system design
- Professional experience in Python, C#, Java, or a similar language used in production systems
- Experience with LLM evaluation and observability tooling (e.g. Langfuse, LangSmith, OpenTelemetry-based tracing, custom evaluation harnesses)
- Experience implementing guardrails, policy enforcement, and safety layers in AI driven systems while leveraging LLM-as-Judge for validation and continuous improvement
- Experience That’s Great to Have
- Familiarity with performance optimization techniques for LLM-based systems (latency, caching, routing, batching)
- Experience building production-grade RAG systems (retrieval pipelines, chunking strategies, embeddings, reranking, context construction)
- Experience contributing to internal AI standards, reusable frameworks, or platform-level tooling
- Experience deploying AI systems in cloud environments (AWS, Azure, GCP)
- Experience in Databricks (model serving endpoints, ML Flow)
Responsabilidades
- This role focuses on designing, implementing, and operationalizing AI systems with a strong emphasis on structured evaluation (including LLM-as-Judge), measurable quality, and production-grade reliability
- The ideal candidate has experience integrating LLMs with enterprise data sources, building testable and observable AI workflows, and improving system performance through rigorous evaluation and iteration
- This role contributes directly to a platform that is central to the organization’s long-term strategy
- Build and maintain evaluation frameworks (LLM-as-Judge, rubric-based scoring, regression test suites) to measure output quality, reliability, and drift with the responsibility of debugging production level issues as detected
- Architect and implement multi-agent workflows with clear coordination, tool usage, and failure handling patterns
- Build structured observability into AI systems (tracing, prompt/version tracking, evaluation logging, cost and latency monitoring)
- Define and enforce quality gates for AI features using automated evals prior to production release
- Optimize inference performance (latency, token usage, caching, batching, routing across models)
- Collaborate with product and engineering teams to translate business requirements into testable AI system designs
- Contribute to code reviews, architectural discussions, and internal standards for AI development
- Design and implement Retrieval-Augmented Generation (RAG) systems and Model Context Protocol (MCP) servers using structured and unstructured enterprise data
- Develop and manage fine-tuning workflows (SFT, preference optimization, or related techniques) including dataset preparation, versioning, and validation
- Evals are automated, reproducible, and integrated into CI/CD or release workflows
- AI systems demonstrate measurable improvements in quality using defined evaluation benchmarks
- Fine-tuned models and/or programmatic solutions show validated performance gains over baseline foundation models
- AI systems meet defined SLAs for latency, reliability, and cost
Vagas Semelhantes
Senior AI Engineer (Evals/Observability Concentration)
R$ 16k - 25k/mês
Risepoint is an education technology company that provides world-class support and trusted expertise to more than 100 universities and colleges. We primarily work with regional universities, helping them develop and grow their high-ROI, workforce-focused online degree programs in critical areas such...
Senior GenAI Engineer
R$ 13k - 19k/mês
Job Title: Senior GenAI Engineer Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: Secret Employee Type: Regular Percentage of Travel Required: Up to 10% Type of Travel: Local • * * The Opportunity: Join our team as a Senior GenAI Engineer supporting Depa...
Backend developer (node) - sênior
R$ 11k - 16k/mês
[Backend Developer (Node) | Senior] Remoto | 44h semanais Para você nos conhecer melhor Desenvolvemos pessoas, softwares & negócios +16 anos de experiência desenvolvendo software em 4 continentes com NPS +84, nos colocando em zona de excelência! Atuamos no desenvolvimento de soluções digitais (web, ...
Senior Software Engineer (Go, Kubernetes)-PerfectScale by DoiT (Portugal)
R$ 13k - 19k/mês
This a Full Remote job, the offer is available from: EMEA Senior Backend Engineer - PerfectScale by DoiT Location Our Senior Backend Engineer will be an integral part of our EMEA engineering teams. This full-time remote role is open to employees based in the UK, Ireland, Estonia, Netherlands, Sweden...
Informações
Análise de Vaga com IA
Estimativa salarial, match de tecnologias e análise de requisitos feitos com Inteligência Artificial
Quer se preparar melhor? Pratique entrevistas com IA no Recrutadoria ou melhore suas habilidades no BitMentor