All jobs

Gen AI Engineer - LLM, RAG

100% Remote Full-time Open now

Hello Everyone, Hope you are doing good!!!! My name is Pavan and I work with SPAR Information System., I have a great opportunity for you, please find the job details below, if you are interested in applying please send me your updated resume and best time for you to discuss about this opportunity in details. Role: Generative AI Engineer Location: Atlanta, GA - Hybrid Duration: Long term contract JD: Generative AI Engineer: Semantic Search & LLM Platforms Job Summary We are looking for a Generative AI Engineer to design, build, and scale an enterprise AI-powered semantic search platform for API discovery and knowledge retrieval. The role focuses on developing LLM-driven search, RAG pipelines, and cloud-native AI services that enable natural language interaction with large-scale technical repositories. The ideal candidate has strong hands-on experience with LLMs, embeddings, vector databases, FastAPI microservices, and multi-cloud AI deployments, and is passionate about building reliable, production-grade GenAI systems.

Key Responsibilities

Design and implement AI-powered semantic search solutions for large-scale API and technical documentation repositories. Develop Retrieval-Augmented Generation (RAG) pipelines using OpenAI embeddings, LangChain, LangGraph, and vector databases (FAISS, pgvector). Build and maintain FastAPI-based microservices for LLM-powered search, summarization, and inference with secure authentication (JWT). Create and manage data ingestion and indexing pipelines, including document chunking, metadata extraction, embedding generation, and vector refresh workflows. Implement multi-cloud LLM integration and routing across Azure OpenAI, AWS Bedrock, and GPT-4 with fault-tolerant fallback mechanisms. Apply grounding techniques and hallucination mitigation strategies to improve response accuracy and reliability. Define and track RAG and LLM evaluation metrics such as precisionk, grounding score, latency, and hallucination rate. Integrate monitoring, logging, and observability using LangSmith and OpenTelemetry for model performance and system health. Deploy and scale AI services using cloud-native architectures (AWS Lambda, ECS Fargate, API Gateway, DynamoDB, S3). Collaborate with UI/UX and platform teams to deliver intuitive interfaces for natural-language API discovery. Contribute to CI/CD pipelines to enable automated testing, deployment, and versioning of AI services. Required Skills & Qualifications Technical Skills Programming: Python APIs & Services: FastAPI, REST APIs, JWT authentication Generative AI & LLMs: GPT-4, OpenAI embeddings, LangChain, LangGraph, RAG architectures Vector Databases: FAISS, pgvector Cloud Platforms: AWS (Lambda, ECS Fargate, API Gateway, DynamoDB, S3), Azure OpenAI, AWS Bedrock AI Evaluation & Observability: LangSmith, OpenTelemetry, RAG evaluation metrics DevOps & CI/CD: Docker, CI/CD pipelines, cloud-native deployments. Experience Hands-on experience building production-grade GenAI or semantic search systems Experience working with large-scale document or API repositories Strong understanding of LLM reliability, grounding, and hallucination control Experience deploying and operating AI systems in cloud environments Preferred Qualifications Experience designing enterprise knowledge search or developer productivity platforms Familiarity with multi-cloud AI architectures Exposure to agent-based LLM workflows Experience mentoring or providing technical guidance to other engineers Thanks & Regards, Pavan Raikhelkar LEAD TALENT ACQUISITION SPECIALIST Direct Number:- Fax : Email: Website: (An E-verify Company) NOTE: We respect your online privacy. This is not an unsolicited mail. Under bill 1618 title III passed by the 105th us congress this mail cannot be considered Spam as long as we include contact information and a method to be removed from our mailing list. If you are not interested in receiving our e-mails, please reply with a "REMOVE" in the subject line. We apologize for any inconvenience caused by this mail. Apply tot his job Apply tot his job Apply To this Job

You might also like

Genetic Counselor Assistant – Invitae – Remote Remote_United States

100% Remote Full-time

Local Markets Strategist - Remote in Southeast

100% Remote Full-time

Industry Partner Go to Market Lead – Insurance

100% Remote Full-time

Senior Business Analyst: Go-To-Market & Territory Management

100% Remote Full-time

Sales Associate (Part-Time), Los Angeles

100% Remote Full-time

Remote Part-Time Focus Group Participants (Up To $750/Week)

100% Remote Full-time

Online Digital Careers | Entry-Level Positions Paying $25-$35/hr

100% Remote Full-time

Senior Director, Strategy & Business Development Executive, Department of War (US Army, US Navy, US Air Force)

100% Remote Full-time

Customer Success Agent - Remote Position - No Degree Needed - $25-$35/hr

100% Remote Full-time

We’re Inviting Retired Risk, Governance & Compliance Professionals to Join Us (Part-Time/Consulting)

100% Remote Full-time

Experienced Phone Customer Service Representative – Wellness and Supplements Industry

100% Remote Full-time

Experienced Customer Service Representative – Remote Opportunity in Texas

100% Remote Full-time

Packaging Strategy Lead – Data Governance & Analytics Specialist | arenaflex Full-Time Position

100% Remote Full-time

Business Development Manager, Korea

100% Remote Full-time

GEN AI Sr Data Scientist/Data Scientist

100% Remote Full-time

Experienced Associate Manager, Customer Experience – Strategic Planning and Leadership for Premium Customer Segments

100% Remote Full-time

Require MAA Professional Music Teacher Store 020 in Charlotte, NC

100% Remote Full-time

Copywriter at Ulta Beauty – Remote Job Position

100% Remote Full-time

Senior Talent Acquisition Manager, APAC

100% Remote Full-time

Experienced Online Data Entry Specialist – Part-Time Remote Opportunity for Fresher to Earn Money Without Investment at blithequark

100% Remote Full-time