Senior AI Solutions Architect with LLM and RAG, Knowledge Graph Platform
- Almaty
- Astana
- Belgrade
- Cluj-Napoca
- Dnipro
- Kharkiv
- Krakow
- Kyiv
- Larnaca
- Lodz
- Lublin
- Lviv
- Odesa
- Remote.Bulgaria
- Remote.Georgia
- Remote.Kazakhstan
- Remote.Poland
- Riga
- Sofia
- Tbilisi
- Varna
- Warsaw
- Wroclaw
- Yerevan
If you received this vacancy from our recruiters — read our Privacy Notice.
Project overview
Position overview
Technology stack
Responsibilities
- Design and manage end to end LLM orchestration and retrieval pipelines
- Define embedding model selection and chunking strategies, including context window management and trade offs affecting retrieval quality and cost
- Own the entity extraction pipeline to convert unstructured content into graph nodes and relationships
- Implement entity resolution, relationship normalization, and deduplication processes
- Design and refine semantic search strategies and retrieval logic across graph and vector layers
- Develop prompt engineering approaches and agentic workflows for advanced use cases
- Integrate graph based outputs with enterprise AI platforms such as Gemini
- Design and maintain evaluation frameworks including ground truth dataset creation
- Measure and improve retrieval quality using metrics such as recall, precision at K, faithfulness, and answer relevance
- Establish systematic regression testing practices for AI pipelines
- Optimize LLM usage costs across the full retrieval and generation lifecycle
- Implement observability, logging, and tracing to monitor performance and reliability
Requirements
- Experience designing and implementing LLM based systems in production environments
- Hands on experience with retrieval augmented generation and semantic search
- Strong understanding of embeddings, vector search, and chunking strategies
- Experience building entity extraction pipelines and working with knowledge graphs
- Proficiency in Python and data processing workflows
- Understanding of prompt engineering and agent workflow design
- Experience defining evaluation frameworks and quality metrics for AI systems
- Familiarity with distributed systems and scalable data architectures
- Experience implementing observability, logging, and tracing in data intensive environments
Nice to have
- Experience with Google Cloud Platform services including Cloud Spanner and Vertex AI
- Familiarity with enterprise AI platforms such as Gemini
- Knowledge of cost optimization techniques for large scale LLM systems
- Experience with graph data models and hybrid architectures combining graph, relational, and vector data
- Exposure to advanced evaluation techniques for generative AI and ranking systems
Looking for Similar Opportunities?

We offer
Vacation
As per the laws of your country. We do ask you to take a proper rest
Health insurance
We help you to take out an insurance policy for you and your loved ones
Sick pay
10 days without a doctor's note, afterwards - as per the laws of your country
Time off for state holidays
According to the official calendar, regardless of the client’s schedule
Pleasant environment
Two large corporate parties and many small get-togethers for colleagues
Comfort service
Solving technical and everyday problems at work
What if I can’t find it?
FAQ for Candidates
Work on global projects, grow your career in a supportive, flexible, and innovative tech environment. We help cover the cost of IT certifications and provide access to top-tier courses and learning platforms. View current openings and take the next step with us.