🏢 Summary: Opportunity for a Senior AI Solutions Architect to design and lead production-grade LLM systems focused on retrieval-augmented generation and knowledge graph integration. The role involves building end-to-end orchestration layers, semantic search pipelines, and evaluation frameworks to deliver reliable, context-aware AI solutions. You will work with vector and graph databases, advanced embedding strategies, and enterprise AI platforms in a scalable cloud environment.
🗂️ Requirements: Experience designing and implementing LLM-based systems in production, Hands-on experience with retrieval-augmented generation (RAG), Strong knowledge of embeddings, vector search, and chunking strategies, Experience building entity extraction pipelines, Experience working with knowledge graphs, Proficiency in Python, Understanding of prompt engineering, Experience designing agent workflows, Experience defining evaluation frameworks and quality metrics for AI systems, Familiarity with distributed systems and scalable data architectures, Experience implementing observability, logging, and tracing
📃 Skills: Python, SQL, RAG, LLM, Embeddings, VectorDB, GraphDB, GCP, CloudSpanner, VertexAI, Gemini, IAM, Encryption, Observability
🏢 Description: Project overview This project focuses on building a scalable AI platform that transforms expert knowledge into structured graph based intelligence and connects it with enterprise grade language models. The solution emphasizes semantic search, agent workflows, and robust evaluation frameworks to ensure reliable outputs. Position overview We are looking for a Senior AI Solutions Architect to lead the design and implementation of advanced LLM driven solutions with a strong focus on retrieval augmented generation and knowledge graph integration. You will take ownership of the full orchestration layer, shaping how structured and unstructured data is transformed into high quality, context aware AI responses. Technology stack Python, SQL, vector databases, graph databases, Google Cloud Platform, Cloud Spanner, Vertex AI, Gemini, LLM frameworks, embedding models, observability tools, IAM, encryption Responsibilities Design and manage end to end LLM orchestration and retrieval pipelines Define embedding model selection and chunking strategies, including context window management and trade offs affecting retrieval quality and cost Own the entity extraction pipeline to convert unstructured content into graph nodes and relationships Implement entity resolution, relationship normalization, and deduplication processes Design and refine semantic search strategies and retrieval logic across graph and vector layers Develop prompt engineering approaches and agentic workflows for advanced use cases Integrate graph based outputs with enterprise AI platforms such as Gemini Design and maintain evaluation frameworks including ground truth dataset creation Measure and improve retrieval quality using metrics such as recall, precision at K, faithfulness, and answer relevance Establish systematic regression testing practices for AI pipelines Optimize LLM usage costs across the full retrieval and generation lifecycle Implement observability, logging, and tracing to monitor performance and reliability Requirements Experience designing and implementing LLM based systems in production environments Hands on experience with retrieval augmented generation and semantic search Strong understanding of embeddings, vector search, and chunking strategies Experience building entity extraction pipelines and working with knowledge graphs Proficiency in Python and data processing workflows Understanding of prompt engineering and agent workflow design Experience defining evaluation frameworks and quality metrics for AI systems Familiarity with distributed systems and scalable data architectures Experience implementing observability, logging, and tracing in data intensive environments Nice to have Experience with Google Cloud Platform services including Cloud Spanner and Vertex AI Familiarity with enterprise AI platforms such as Gemini Knowledge of cost optimization techniques for large scale LLM systems Experience with graph data models and hybrid architectures combining graph, relational, and vector data Exposure to advanced evaluation techniques for generative AI and ranking systems What We Offer: Vacation days : Up to 26 business days per year. 10 illness/special days off per year (fully paid, no medical papers needed) for all contract types Health and life insurance (Luxmed) MyBenefit platform with Multisport option Internal psychological support service English language classes from the first working day Access to external learning platforms : O’Reilly, LinkedIn Learning, Udemy, and a wide catalog of diverse internal training Flexible workplace : work from the office, from home, or choose a hybrid option Tech Skills Mentoring Program Opportunities to develop as a public speaker, mentor, or technical interviewer Fully paid idle (bench) when not involved in a project Certification reimbursement (AWS, GCP, Microsoft, etc.)