May 19, 2026
Automation QA Engineer
Mid • Remote
Warsaw, Poland
We are looking for an Automation QA Engineer (Fullstack) to join our team, with strong expertise in test automation, AI workflows, and quality assurance for scalable platforms.
Your responsibilities:
Define and execute test strategies across the full stack: backend APIs (FastAPI), frontend UIs, and
agentic workflows.Define testing standards and frameworks that enable developers to own unit and integration test
coverage, in line with a shift-left quality model.Design testing approaches for non-deterministic AI outputs: LLM evaluation benchmarks, regression
datasets, golden-set validation.Enable and govern the automated testing strategy: owning E2E and system-level validation while
supporting developers in writing their own unit and integration tests.Implement workflow validation, ensuring holistic quality across text, audio, and generated content (not
just individual outputs).Collaborate with AI engineers to define quality gates for content generation pipelines.
Perform API testing, load testing, and performance validation for platform services.
We are looking for you, if you have:
3–5+ years of QA experience with a fullstack focus (backend + frontend).
Strong experience with Python test frameworks (pytest). Familiarity with frontend testing tools
(Playwright, Cypress, or equivalent).Experience testing APIs (REST, WebSocket) and microservice architectures.
Ability to design test strategies for AI/ML systems with non-deterministic outputs.
Hands-on experience with CI/CD integration for automated testing.
Understanding of Docker, cloud environments (Azure/AWS), and deployment pipelines.
Experience with performance and load testing tools.
We offer:
Participation in interesting and demanding projects
Flexible working hours (regular CET hours, with occasional meetings with the US team)
A great, non-corporate atmosphere
Stable employment conditions (contract of employment or B2B contract)
Opportunities for development and promotion
Attractive package of benefits
Work model: remote or hybrid (2 days per week from the office)
We reserve the right to contact the selected candidates.
Similar jobs you might like
Technology
EPAM Systems
Senior AI Test Automation Engineer
Senior
On-site
Gdansk, Poland
🏢 Summary: Office-based role for a Regular or Senior AI Test Automation Engineer to build and maintain AI-driven test automation for a modern investment platform. The position focuses on automating functional, integration, and non-functional testing using AI agents and integrating quality controls into CI/CD pipelines. The engineer will ensure compliance, traceability, and high test coverage in a fast-paced, full-stack environment. 🗂️ Requirements: 4+ years of test automation experience, Minimum 2 years in AI-assisted or agentic testing environments, Proficiency with Playwright and PyTest, Experience with CI/CD tools such as Jenkins or GitLab, Experience in API testing, Experience with AI testing agents and synthetic data generation, Knowledge of financial platforms and time-series data validation, Understanding of test strategy design and coverage metrics, Experience with compliance-driven testing in agile environments 📃 Skills: Python, PyTest, Playwright, Jenkins, GitLab, CI/CD, API, AI, Automation, Testing, SyntheticData, TimeSeries, Agile 🏢 Description: We are looking for an experienced Regular or Senior AI Test Automation Engineer to join a full-stack team developing a cutting-edge, AI-native investment platform. The person in this role will be responsible for test automation using AI agents and modern testing tools. This is a 100% office-based position. The project utilizes technologies such as Python, PyTest, Playwright, CI/CD tools (Jenkins, GitLab), as well as solutions for API testing and synthetic data generation If you're ready to make an impact in a dynamic environment, we want to hear from you! Responsibilities Automate test coverage across functional, integration and non-functional layers using AI agents (e.g., Unit Testing Agent, API Test Agent, Automation QA Agent) Integrate testing into CI/CD pipelines and implement quality gates for code, performance and security validation Leverage AI-generated test strategies and synthetic test data to accelerate validation cycles Ensure traceability and auditability of test results in line with investment platform compliance requirements Collaborate with the team to validate code quality and promote test-driven development practices Monitor and optimize test execution using observability tools and analytics from platform usage Requirements Minimum 4+ years of experience in test automation, including at least 2 years in AI-assisted or agentic testing environments Proficiency with test frameworks (e.g., Playwright, PyTest), CI/CD tools (Jenkins, GitLab) and API testing Experience with AI testing agents, synthetic data generation and test orchestration Familiarity with financial platforms, time-series data validation and compliance-driven testing Strong understanding of test strategy design, coverage metrics and defect triage in agile environments We offer/Benefits We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
iTeamly
Senior QA Engineer (Robot Framework)
Senior
Remote
Krakow, Poland
16,000 - 23,500 PLN
🏢 Summary: The offer is for an experienced QA Engineer to ensure high software quality in a modern microservices environment, taking ownership of test strategy and automation across the full development lifecycle. The role focuses on API and end-to-end test automation, CI/CD integration, and maintaining quality standards in distributed systems. 🗂️ Requirements: 5+ years of experience in software testing or QA, Hands-on experience in test automation, Experience with API testing, Experience with end-to-end testing, Practical knowledge of Python, Practical knowledge of Playwright, Practical knowledge of Robot Framework, Practical knowledge of SQL, Understanding of microservices architecture, Understanding of cloud-based systems, Experience working in Agile environments 📃 Skills: Python, Playwright, Robot, SQL, API, Microservices, CI/CD, Agile, Cloud 🏢 Description: We are looking for an experienced QA Engineer to join a team delivering scalable, high-impact software solutions in a modern microservices environment . In this role, you will be responsible for ensuring high software quality and taking ownership of testing strategy across the full development lifecycle . ✅ Your responsibilities: Collaborate closely with development and product teams including Product Owners and Architects to define quality standards and testing strategies Prepare and maintain test plans and test cases Develop and maintain automated tests for API and end-to-end layers in distributed systems Perform manual testing for edge cases and complex business scenarios Ensure system quality, performance, and reliability Work with CI/CD pipelines to integrate automated tests and support smooth releases Contribute to improving QA processes and introducing best practices within the team Analyze requirements and system architecture to design effective testing approaches 🧠 Our requirements: Hands-on experience in QA, including test automation 5+ years of experience in software testing or similar role Experience with API and end-to-end testing Practical knowledge of Python, Playwright, Robot Framework, and SQL Understanding of microservices architecture and modern cloud-based systems Understanding of Agile software development processes Strong analytical and teamwork skills with a proactive mindset 🌟 What we offer: Work on challenging and impactful technology projects in a distributed systems environment Collaboration with an experienced and supportive team Opportunities for growth and real influence on product quality Flexible working hours, remote or hybrid setup Training budget and continuous professional development opportunities
Technology
iTeamly
Senior QA Engineer (Robot Framework)
Senior
Remote
Krakow, Poland
16,000 - 23,500 PLN
🏢 Summary: QA Engineer role focused on ensuring high software quality in a modern microservices environment through manual and automated testing across the full development lifecycle. The position involves defining testing strategies, developing API and end-to-end automated tests, and integrating tests into CI/CD pipelines. The role emphasizes ownership of quality standards in distributed systems. 🗂️ Requirements: 5+ years of experience in software testing, Hands-on experience in QA and test automation, Experience with API testing, Experience with end-to-end testing, Practical knowledge of Python, Practical knowledge of Playwright, Practical knowledge of Robot Framework, Practical knowledge of SQL, Understanding of microservices architecture, Understanding of cloud-based systems, Experience working with CI/CD pipelines, Knowledge of Agile methodologies 📃 Skills: Python, Playwright, RobotFramework, SQL, API, CI/CD, Microservices, Agile, Cloud 🏢 Description: We are looking for an experienced QA Engineer to join a team delivering scalable, high-impact software solutions in a modern microservices environment . In this role, you will be responsible for ensuring high software quality and taking ownership of testing strategy across the full development lifecycle . ✅ Your responsibilities: Collaborate closely with development and product teams including Product Owners and Architects to define quality standards and testing strategies Prepare and maintain test plans and test cases Develop and maintain automated tests for API and end-to-end layers in distributed systems Perform manual testing for edge cases and complex business scenarios Ensure system quality, performance, and reliability Work with CI/CD pipelines to integrate automated tests and support smooth releases Contribute to improving QA processes and introducing best practices within the team Analyze requirements and system architecture to design effective testing approaches 🧠 Our requirements: Hands-on experience in QA, including test automation 5+ years of experience in software testing or similar role Experience with API and end-to-end testing Practical knowledge of Python, Playwright, Robot Framework, and SQL Understanding of microservices architecture and modern cloud-based systems Understanding of Agile software development processes Strong analytical and teamwork skills with a proactive mindset 🌟 What we offer: Work on challenging and impactful technology projects in a distributed systems environment Collaboration with an experienced and supportive team Opportunities for growth and real influence on product quality Flexible working hours, remote or hybrid setup Training budget and continuous professional development opportunities
Technology
Link Group
QA Engineer
Senior
Hybrid
Warsaw, Poland
30,000 - 40,000 PLN
🏢 Summary: The offer is for a QA Engineer responsible for ensuring the quality and reliability of business applications through manual and automated testing across the full software delivery lifecycle. The role involves creating and maintaining automated tests, integrating them into CI/CD pipelines, and collaborating closely with development teams in an AI-driven environment. It combines hands-on testing, quality risk analysis, and continuous improvement of QA practices. 🗂️ Requirements: Minimum 4 years of experience in software quality assurance, Hands-on experience in manual testing, Hands-on experience in test automation, Programming skills in Python, Java, C#, or similar, Experience with version control systems, Knowledge of test automation frameworks, Experience integrating automated tests with CI pipelines, Understanding of software development practices, Ability to analyze requirements and define test coverage, Experience using AI tools in engineering work 📃 Skills: Python, Java, C#, Git, CI/CD, Automation, Testing, AI 🏢 Description: We are looking for a QA Engineer who will help ensure that business applications are reliable, user-friendly, and ready for production use. In this role, you will work closely with developers, project stakeholders, and end users to understand requirements, identify risks early, and support quality throughout the full software delivery lifecycle. The position combines manual testing, test automation, collaboration with technical teams, and continuous improvement of QA practices. It is well suited to someone who enjoys finding issues, improving processes, and using both technical skills and curiosity to deliver better software. The company operates in the financial sector and has a strong AI-oriented culture. Artificial intelligence is used in a practical way to support daily work, automate repetitive tasks, improve efficiency, and speed up delivery. As part of the recruitment process, the candidate’s AI mindset will also be assessed, including openness to using modern AI tools, ability to learn with AI support, critical review of AI-generated outputs, responsible usage, and readiness to identify areas where AI can improve QA and engineering work. Responsibilities Prepare and maintain test documentation, including test scenarios, test cases, and testing procedures. Perform different types of testing, including functional, regression, integration, UI, data validation, exploratory, and end-to-end testing. Verify applications from both technical and user perspectives to identify defects and usability improvements. Work with business users to clarify requirements and understand expected system behavior. Cooperate with developers, project managers, and other stakeholders during planning, daily meetings, and delivery activities. Identify quality risks early and suggest practical ways to reduce them. Support improvements in testing standards, development quality, and release readiness. Create and maintain automated tests using programming languages such as Python, Java, or C#. Participate in code reviews related to test automation and quality tooling. Help integrate automated tests into CI/CD processes. Share QA knowledge with the team and contribute to internal documentation or user guidance. Requirements At least 4 years of experience in software quality assurance. Hands-on experience with manual testing and test automation. Programming skills in Python, Java, C#, or a similar language. Understanding of software development practices and ability to work closely with engineering teams. Experience with version control tools such as Git or similar systems. Knowledge of test automation frameworks and interest in connecting automated tests with CI pipelines. Ability to analyze requirements, spot risks, and translate them into effective test coverage. Comfortable using AI tools to support learning, testing, automation, and daily engineering tasks. Responsible approach to AI-assisted work, including validation, critical thinking, and awareness of limitations. Strong problem-solving skills and attention to detail. Good communication skills and ability to work with both technical and non-technical stakeholders. Proactive attitude, willingness to learn, and flexibility in taking on different QA-related responsibilities.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Gdansk, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing AI agents and LLM-based systems through both manual and automated approaches. The position emphasizes building scalable automation frameworks, LLM test harnesses, and robust evaluation processes for Gen AI features. You will collaborate cross-functionally to ensure reliability, accuracy, and quality of AI-driven applications. 🗂️ Requirements: 5+ years of software QA experience, Minimum 1 year testing AI agents or LLM-based systems, Experience in manual and automated testing of AI agents, Strong Python programming skills for test automation, Experience with pytest or equivalent framework, Expertise in prompt engineering and LLM evaluation metrics, Experience evaluating Gen AI/LLM applications (grounding, hallucination, determinism), Knowledge of precision, recall, criteria recall metrics, Experience with Jira, QMetry or TestRail, Experience with CI/CD integration and version control systems, Understanding of AWS cloud environments, Availability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, Jira, QMetry, TestRail, AWS, CI/CD, Git 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Wroclaw, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing and automating quality assurance for Gen AI and LLM-based applications, including AI agents and integrated systems. The position combines manual and automated testing, development of LLM test harnesses, and evaluation of model quality metrics such as grounding, precision, and recall. Work is conducted in collaboration with cross-functional teams, with working hours aligned to 13:00–21:00 Polish time. 🗂️ Requirements: 5+ years of software QA experience, Minimum 1 year testing AI agents, agentic solutions or LLM-based systems, Hands-on experience in manual and automated testing of AI agents, Strong Python programming skills for test automation, Experience with pytest or equivalent framework, Experience integrating AI/ML libraries in automated tests, Expertise in AI agent frameworks and prompt engineering, Experience evaluating Gen AI/LLM systems (grounding, accuracy, hallucination, determinism), Knowledge of Gen AI/LLM evaluation metrics (precision, recall, criteria recall, efficiency), Experience with Jira, QMetry or TestRail, Experience with version control systems and CI/CD integration, Understanding of AWS cloud environments, Availability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, AWS, Jira, QMetry, TestRail, CI/CD, Git 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Poznan, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing AI agents and LLM-based applications through both manual and automated approaches. The position centers on building scalable LLM test harnesses, designing Gen AI evaluation frameworks, and ensuring reliability, accuracy, and performance of AI-driven systems. The role requires strong Python-based automation skills and experience with AI evaluation metrics, cloud environments, and CI/CD integration. 🗂️ Requirements: 5+ years of software QA experience, Minimum 1 year testing AI agents, agentic solutions or LLM-based systems, Experience with manual and automated testing of AI agents, Strong Python programming skills for test automation, Experience with pytest or equivalent testing frameworks, Expertise in AI agent frameworks and prompt engineering, Experience evaluating Gen AI/LLM systems (grounding, accuracy, hallucination, determinism), Knowledge of evaluation metrics: precision, recall, criteria recall, efficiency, Experience with Jira, QMetry or TestRail, Experience with version control systems, Experience integrating tests into CI/CD pipelines, Understanding of AWS cloud environments, Ability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, AWS, Jira, QMetry, TestRail, CI/CD, Git, Copilot 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Lodz, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing AI agents and LLM-based systems through both manual and automated approaches. The position emphasizes building scalable automation frameworks, LLM test harnesses, and evaluating GenAI features using advanced metrics. The role requires strong Python automation skills, experience with AI evaluation frameworks, and collaboration across product and engineering teams. 🗂️ Requirements: 5+ years of software QA experience, At least 1 year testing AI agents or LLM-based systems, Experience with manual and automated testing of AI agents, Strong Python programming for test automation, Experience with pytest or equivalent framework, Expertise in AI agent frameworks and prompt engineering, Experience evaluating Gen AI/LLM systems (grounding, accuracy, hallucination, determinism), Knowledge of evaluation metrics (precision, recall, criteria recall, efficiency), Experience with Jira, QMetry, or TestRail, Experience with version control systems, Experience integrating tests into CI/CD pipelines, Understanding of AWS cloud environments, Availability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, AWS, Jira, QMetry, TestRail, CI/CD, Git 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Katowice, SL, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing and automating evaluation of Gen AI and LLM-based applications, including AI agents and integrated systems. The position combines manual and automated testing, development of LLM test harnesses, and implementation of AI evaluation frameworks to ensure reliability, accuracy, and performance. Work is conducted in collaboration with cross-functional teams, with working hours aligned to 13:00–21:00 Polish time. 🗂️ Requirements: 5+ years of software QA experience, Minimum 1 year testing AI agents, agentic solutions or LLM-based systems, Hands-on experience in manual and automated testing of AI agents, Strong Python programming skills for test automation, Experience with pytest or equivalent testing frameworks, Expertise in AI agent frameworks and prompt engineering, Experience evaluating Gen AI / LLM systems (grounding, accuracy, hallucination, determinism), Knowledge of Gen AI / LLM evaluation metrics (precision, recall, criteria recall, efficiency), Experience with Jira, QMetry or TestRail, Experience with version control systems and CI/CD integration, Understanding of AWS cloud environments, Ability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, AWS, Jira, QMetry, TestRail, CI/CD, Git 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Technology
New offer
EPAM Systems
Senior AI QA Engineer (Automation & Manual, AI-based Applications Testing)
Senior
Remote
Warsaw, Poland
🏢 Summary: Fully remote Senior AI QA Engineer role focused on testing AI agents and LLM-based applications through both manual and automated approaches. The position emphasizes building scalable LLM test harnesses, designing GenAI evaluation frameworks, and ensuring quality across AI-driven systems, APIs, and databases. Requires strong Python automation skills and hands-on experience with AI evaluation metrics and cloud environments. 🗂️ Requirements: 5+ years of software QA experience, Minimum 1 year testing AI agents or LLM-based systems, Hands-on experience in manual and automated testing of AI agents, Strong Python programming for test automation, Experience with pytest or equivalent framework, Expertise in AI agent frameworks and prompt engineering, Experience evaluating GenAI/LLM systems (grounding, accuracy, hallucination, determinism), Knowledge of LLM evaluation metrics (precision, recall, criteria recall, efficiency), Experience with Jira, QMetry or TestRail, Experience with version control systems, Experience integrating tests into CI/CD pipelines, Understanding of AWS cloud environments, Ability to work 13:00–21:00 Polish time 📃 Skills: Python, pytest, LLM, GenAI, LangChain, OpenAI, AWS, Jira, QMetry, TestRail, Git, CI/CD 🏢 Description: We are seeking a skilled Senior AI QA Engineer with strong experience in both manual and automated testing and extensive exposure to AI-based application testing. The ideal candidate will test a variety of applications, including projects involving AI agents and integrations with APIs and databases. You will help ensure our solutions are reliable and accurate and meet business requirements, while also contributing to the development of our automation capabilities. This is a fully remote position with a requirement to work from 13:00 to 21:00 Polish time, due to the client team's location. Responsibilities Research and evolve automation frameworks in line with Gen AI tooling and best practices Design and automate evaluation of Gen AI features — grounding, answer accuracy, determinism/reproducibility, precision, recall, and criteria recall Build automated LLM test harnesses that scale evaluation beyond human-in-the-loop Selection and application of Gen AI evaluation frameworks, measuring answer quality and pipeline efficiency Perform manual testing as needed to validate new features, integrations, and user stories Build and maintain test cases from requirements and user stories Test applications that may include AI agents, APIs, databases, and other integrations Collaborate with product, engineering, and operations teams to understand requirements and deployment environments Track and report test results, defects, and quality metrics Assist with troubleshooting production issues and escalate risks as needed Guide and support team members, including onshore and offshore consultants Requirements 5+ years of experience in software QA, with at least 1 year focused on testing AI agents, agentic solutions or LLM-based systems Hands-on experience with both manual and automated testing of AI agents, including prompt/instruction testing and evaluation of agentic workflows Strong programming skills in Python test automation — pytest or equivalent, scripting and AI/ML library integration Expertise in AI agent frameworks, prompt engineering and evaluation metrics for LLM-based systems Demonstrated experience testing and evaluating Gen AI / LLM applications — grounding, answer accuracy and hallucination/determinism checks Applied knowledge of Gen AI / LLM evaluation frameworks and metrics — precision, recall, criteria recall and efficiency Familiarity with issue and test management tools such as Jira, QMetry and TestRail Experience with version control systems and integrating tests into CI/CD pipelines Flexibility to use AI-powered tools for QA such as GitHub Copilot and LLM-based test generation Understanding of cloud environments, particularly AWS Excellent communication, collaboration and leadership skills Nice to have Experience with agentic AI platforms such as LangChain, OpenAI Function Calling or similar Experience with AI safety, bias and reliability testing Experience with test data generation for AI/ML systems We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.