We are looking for a Senior Automated QA Engineer to lead our testing efforts. You won't just be testing standard web interfaces; you'll be figuring out how to reliably automate testing for non-deterministic AI features, multi-step AI agents, and complex LLM pipelines. If you know Playwright inside and out and have scars from trying to test LLM hallucinations in production, we want to talk to you.
What you’ll be doing

Own the E2E framework: Build, maintain, and scale our automated testing framework using Playwright (TypeScript/Python).
Test the unpredictable: Design strategies to test non-deterministic LLM outputs, AI agents, and RAG pipelines where standard assertions don't always work.
Tackle LLM-specific challenges: Build guardrails and automated checks for prompt drift, hallucinations, latency, and context window limits.
Evaluate Agent behavior: Create scenarios to test how our AI agents handle edge cases, multi-step reasoning, and error recovery in real-world document processing workflows.
Integrate and collaborate: Wire your tests into our CI/CD pipelines to ensure we can ship quickly without breaking the core AI logic. Work closely with AI researchers, backend engineers, and product managers to define what "quality" means for an AI agent.

Dein Profil

What we’re looking for

Experience: 5+ years in QA Automation or Software Engineering in Test (SDET).
Playwright expertise: You have deep, hands-on experience building reliable, scalable test suites in Playwright. You know how to handle flaky tests, parallel execution, and complex DOM structures.
Coding chops: Strong programming skills in TypeScript, JavaScript, or Python.
AI/LLM testing experience: You understand how LLMs work under the hood. You know the challenges of testing them (non-determinism, evaluating accuracy vs. exact match, security/injection risks) and have used tools or frameworks (like LLM-as-a-judge, LangSmith, DeepEval, etc.) to evaluate them.
Systems thinking: You can look at a complex architecture involving a frontend, backend APIs, vector databases, and LLM endpoints, and know exactly where things are likely to break.
Communication: You can clearly explain complex QA issues to both highly technical machine learning engineers and non-technical stakeholders.

Bonus points if you have:

Experience in the financial tech or document automation space.
Familiarity with containerization (Docker, Kubernetes) and advanced CI/CD setups (GitHub Actions, GitLab CI).
Experience testing API performance and LLM endpoint latency.

Unser Versprechen

We trust amazing people to do amazing things and make a long-term impact - we give you Freedom and ownership of meaningful work that directly impacts the business
We're building a positive organizational culture where personal and professional growth are just as important as business growth
We believe different perspectives make Hypatos a better community - that is why we're committed to building a diverse and inclusive environment where you feel you belong
Beyond a top market compensation package including company shares, you will enjoy a personal development budget, meal allowance, sporting activities and free beers :)

Apply for this job

Über uns

Hypatos is redefining enterprise work by deploying LLM-powered AI Agents into the heart of business operations. Backed by leading investors (Elaia, Blackfin Tech, Grazia Equity, UVC Partners, DTFC, Plug & Play), we are expanding rapidly and building the foundation for the next generation of intelligent business systems.
Join us to shape the future of enterprise automation and help improve the way hundreds of millions of people work every day.

This job is no longer accepting applications

See open jobs at Hypatos.See open jobs similar to "QA Tester" UVC Partners.

See more open positions at Hypatos

Powered by Getro.com

Privacy policy Cookie policy