Join Nixi AI as
Bachelor/ Master thesis (m/f/d)

Evaluation of Multi-Agent Retrieval-Augmented Generation (RAG) Systems Without Ground Truth

Cartoon illustration of a student working on a Master thesis in AI, featuring Nixi AI and Friedrich Schiller University Jena logos, with a computer screen showing multi-agent evaluation workflow

Bewerbungen geschlossen

Location:
Jena, FUSION group at the Friedrich Schiller University Jena, Work from home is possible

Start: As soon as possible

Language: Fluent English

Type: Full-time Master thesis

Seniority: Student in the fields of computer science, mathematics or any comparable degree

Your Mission

You will:

- Build a multi-agent RAG in Python, orchestrating LLM agent calls.
- Design evaluation rubrics and structured outputs.
- Conduct prompt tuning and calibration experiments (along with creating synthetic GT data).
- Optionally, wrap components in a chatbot interface for live test interactions.

Why This Role Is Unique

Work in a dynamic, international, and interdisciplinary environment in the beautiful city of Jena

Your work directly shapes product, positioning & growth

Work side‑by‑side with the founding team on all aspects—from pitch decks to pilot support

Flexible working hours and a family-friendly working environment

Friedrich Schiller University is a traditional university with a strong research profile rooted in the heart of Germany. As a university covering all disciplines, it offers a wide range of subjects. Its research is focused on the areas Light—Life—Liberty. It is closely networked with non-research institutions, research companies and renowned cultural institutions. With around 18,000 students and more than 8,600 employees, the university plays a major role in shaping Jena’s character as a cosmopolitan and future-oriented city.

The Project

The position is affiliated with the FUSION group of Univ. Prof. Dr Birgitta König-Ries, at Friedrich Schiller University Jena and in collaboration with the Nixi AI. The FUSION group is highly interdisciplinary and diverse, working towards building better biodiversity platforms and tools. More information about FUSION. Nixi AI is a health tech startup which is an all-in-one platform for AI-powered medical documentation, billing, and decision support.The project focuses on building and validating a multi-stage RAG (Retrieval‑Augmented Generation) pipeline without relying on any pre-existing ground truth data. It will explore two complementary evaluation methods: generating synthetic question-answer pairs with LLMs to serve as pseudo-ground truth, and employing an “LLM-as-a-judge” approach, where one or more LLMs assess the relevance, factual accuracy, and fluency of the pipeline’s outputs based on carefully designed prompts. A primary objective is to produce a confidence score for each generated answer, enabling end users to trust the system’s responses. The evaluation framework will be implemented in Python, integrating judge agents and synthetic benchmarks, and will be calibrated and validated through human spot-checks and statistical analyses to ensure the calibration of confidence and reliability of judgments.

Who Thrives Here

- Student in the fields of computer science, mathematics or any comparable degree
- Experience with the Python programming language
- Prior knowledge of working with LLMs, Langchain, Langgraph, and Huggingface is a plus
- Capable of working independently
- Willingness to learn new skills and technologies
- Fluency in English and good communication skills

Ready to Build with Us?

Get in touch and send a short email with your application and your CV to birgitta.koenig-ries@uni-jena.de  or Hello@NixiAi.ai.

Since all application documents will be duly destroyed after the recruitment process, we ask you to submit only copies of your documents.

For further information for applicants, please also refer to https://www.uni-jena.de/stellenmarkt (in German)

Please also note the information on the collection of personal data at https://www.uni-jena.de/stellenmarkt



Together, we’ll give German doctors two hours back every day.Nixi AI is committed to diversity. We welcome applicants of all backgrounds, even if you don’t tick every box.