Logo Interon IT Solutions LLC

Gen AI / Agentic Engineer

Interon IT Solutions LLCvia Dice
RemotoChantilly, Virginia, UsPlenoCLT16 dias atrás

Salário Estimado

R$ 7.128,00 - R$ 10.692,00

0de 100

Excelente

Score da Vaga

Descrição da Vaga

#W2 Role Job Title: Gen AI / Agentic Engineer Location: Remote Type: W2 Contract Job Summary We are looking for a GenAI / Agentic Engineer to design, build, and deploy LLM-powered applications on AWS.


This role is focused on real production engineering-APIs, RAG pipelines, agent workflows, evaluation, deployment, monitoring, and performance/cost tuning.


Responsibilities • Build and maintain LLM-powered backend services using Python and FastAPI (chat, search, summarization, Q&A).

Design and implement RAG pipelines end-to-end: ingestion, parsing, chunking, embeddings, indexing, retrieval, reranking, and grounded responses. • Develop agentic workflows for multi-step automation (tool calling, orchestration, state/memory, retries, audit logs).
Deploy and support GenAI workloads on AWS using ECS/Lambda, S3, SQS, DynamoDB/RDS, OpenSearch (or vector store), and related services. • Implement security and governance controls: auth, authorization, secrets, encryption, PII handling, and prompt-injection defenses.
Build evaluation and monitoring for quality, hallucination reduction, latency, and cost (test sets, regression checks, dashboards, alerts). • Work across full SDLC: design docs, estimates, coding, code reviews, CI/CD, testing, release, and production support.
Communicate architecture decisions clearly and explain tradeoffs (accuracy vs latency vs cost) to stakeholders.

Required Skills (Point-Based) • 10+ years overall IT experience with backend/API engineering and cloud deployments

2+ years hands-on GenAI/LLM experience delivering real features (not just demos) • 6+ years strong Python (core Python, clean coding, debugging, packaging)
Experience with asyncio and concurrency (threads/async), plus profiling and performance tuning • Comfortable with stateful/long-running workflows: transaction handling, retries, idempotency, and failure recovery
5+ years building REST APIs / microservices, strong API design and error handling • 5+ years with FastAPI (or similar) including middleware, dependency injection, background tasks
Experience implementing auth/security using JWT/OAuth, RBAC, secure configuration, secrets handling • Strong testing discipline using pytest (unit/integration tests, mocks, API contract testing)
Proven experience building RAG systems end-to-end: chunking strategies, embeddings, retrieval tuning, reranking, grounding/citations • Hands-on with RAG optimization: hybrid retrieval, metadata filters, top-k tuning, chunk tuning, reranking strategies
Experience with agentic patterns: tool calling, orchestration, memory/state, structured outputs, audit trails • Experience implementing guardrails: output schema enforcement (JSON), refusal handling, safety filters, prompt-injection defenses, PII masking
5+ years AWS experience using ECS/Lambda, S3, SQS, DynamoDB/RDS (and related services) • Strong AWS security fundamentals: IAM, KMS, Secrets Manager, CloudWatch logs/metrics/alarms
Experience deploying LLM workloads via Amazon Bedrock (preferred) or SageMaker • Strong system design: scalability, caching, rate limiting, queues, resilience/failure handling
Ability to clearly explain GenAI architecture decisions and tradeoffs across accuracy/latency/cost Nice to Have • LangChain / LangGraph / LlamaIndex (any)
OpenSearch vector search or vector DB experience (Pinecone/Weaviate/FAISS, etc.) • Docker, Terraform/CDK, CI/CD (GitHub Actions/Jenkins)
Experience in regulated environments (finance/healthcare/telecom) with governance controls

Requisitos

  • Required Skills (Point-Based)
  • 10+ years overall IT experience with backend/API engineering and cloud deployments
  • 2+ years hands-on GenAI/LLM experience delivering real features (not just demos)
  • 6+ years strong Python (core Python, clean coding, debugging, packaging)
  • Experience with asyncio and concurrency (threads/async), plus profiling and performance tuning
  • Comfortable with stateful/long-running workflows: transaction handling, retries, idempotency, and failure recovery
  • 5+ years building REST APIs / microservices, strong API design and error handling
  • 5+ years with FastAPI (or similar) including middleware, dependency injection, background tasks
  • Experience implementing auth/security using JWT/OAuth, RBAC, secure configuration, secrets handling
  • Strong testing discipline using pytest (unit/integration tests, mocks, API contract testing)
  • Proven experience building RAG systems end-to-end: chunking strategies, embeddings, retrieval tuning, reranking, grounding/citations
  • Hands-on with RAG optimization: hybrid retrieval, metadata filters, top-k tuning, chunk tuning, reranking strategies
  • Experience with agentic patterns: tool calling, orchestration, memory/state, structured outputs, audit trails
  • Experience implementing guardrails: output schema enforcement (JSON), refusal handling, safety filters, prompt-injection defenses, PII masking
  • 5+ years AWS experience using ECS/Lambda, S3, SQS, DynamoDB/RDS (and related services)
  • Strong AWS security fundamentals: IAM, KMS, Secrets Manager, CloudWatch logs/metrics/alarms
  • Strong system design: scalability, caching, rate limiting, queues, resilience/failure handling
  • Ability to clearly explain GenAI architecture decisions and tradeoffs across accuracy/latency/cost
  • LangChain / LangGraph / LlamaIndex (any)
  • OpenSearch vector search or vector DB experience (Pinecone/Weaviate/FAISS, etc.)
  • Docker, Terraform/CDK, CI/CD (GitHub Actions/Jenkins)
  • Experience in regulated environments (finance/healthcare/telecom) with governance controls

Responsabilidades

  • This role is focused on real production engineering-APIs, RAG pipelines, agent workflows, evaluation, deployment, monitoring, and performance/cost tuning
  • Build and maintain LLM-powered backend services using Python and FastAPI (chat, search, summarization, Q&A)
  • Design and implement RAG pipelines end-to-end: ingestion, parsing, chunking, embeddings, indexing, retrieval, reranking, and grounded responses
  • Develop agentic workflows for multi-step automation (tool calling, orchestration, state/memory, retries, audit logs)
  • Deploy and support GenAI workloads on AWS using ECS/Lambda, S3, SQS, DynamoDB/RDS, OpenSearch (or vector store), and related services
  • Implement security and governance controls: auth, authorization, secrets, encryption, PII handling, and prompt-injection defenses
  • Build evaluation and monitoring for quality, hallucination reduction, latency, and cost (test sets, regression checks, dashboards, alerts)
  • Work across full SDLC: design docs, estimates, coding, code reviews, CI/CD, testing, release, and production support
  • Communicate architecture decisions clearly and explain tradeoffs (accuracy vs latency vs cost) to stakeholders

Vagas Semelhantes

RemotoRemoto9 dias atrás

R$ 9k - 14k/mês

PlenoCLT

Job Description: • Design and build robust backend services and microservices that power the DevX platform ecosystem. • Integrate Large Language Models (LLMs) and custom AI models to enable features like semantic code search, automated refactoring, and natural language infrastructure provisioning. •...

S

Desenvolvedor Python

SIS Innov & TechWhatJobs
RemotoTaguatinga, Tocantins, BrHoje

R$ 6k - 10k/mês

PlenoCLT

Sobre a Empresa Há mais de 20 anos mercado, somos uma consultoria estratégica de Inovação e Transformação Digital. Nossa especialidade é impulsionar as demandas de nossos clientes, integrando processos, pessoas e tecnologia de alta performance. Sobre o Cargo: Desenvolvedor Experiência sólida com Pyt...

RemotoBrOntem

R$ 7k - 11k/mês

PlenoCLT

Requisitos Técnicos • Experiência sólida com Java 8+ • Conhecimento em Spring Boot, JPA/Hibernate • Experiência com Angular 10+ • Domínio de HTML5, CSS3, TypeScript • Experiência com APIs REST • Conhecimento em banco de dados relacionais (PostgreSQL, MySQL, Oracle ou SQL Server) • Versionamento com ...

R$ 9k - 14k/mês

PlenoCLT

Backend Engineer (Java 17 & Angular) Ubicación: Remoto Proyecto de largo plazo en una gran empresa del sector financiero internacional Inglés avanzado (deseable) ¿Te gustaría formar parte de un equipo que diseña soluciones críticas para uno de los grupos financieros más importantes del mercado britá...

Interessado nesta vaga?

Candidatar-se

Você será redirecionado para o site original

Informações

NívelPleno
ContratoCLT
LocalChantilly, Virginia, Us
RemotoSim
MoedaBRL
Publicada16 dias atrás
FonteDice

Análise de Vaga com IA

Estimativa salarial, match de tecnologias e análise de requisitos feitos com Inteligência Artificial

Powered by CodeCortex
← Voltar às Vagas