June 8, 2026

Site Reliability Engineer - Observability

Senior • Remote

Warsaw, Poland

Job description

Company Description

At CluePoints, we’re redefining how clinical trials are run. As the premier provider of Risk-Based Quality Management (RBQM) and Data Quality Oversight software, we harness advanced statistics, artificial intelligence, and machine learning to ensure the quality, accuracy, and integrity of clinical trial data, helping life sciences organizations bring safer, more effective treatments to patients faster.

We’re proud to be an ambitious, fast-growing technology scale-up with a dynamic and diverse international team representing more than 20 nationalities. Collaboration, flexibility, and continuous learning are part of our DNA.

At CluePoints, you’ll find a culture where you can grow, make an impact, and have fun along the way.Guided by our values of Care, Passion, and Smart Disruption, we’re united by a shared mission: to create smarter ways to run efficient clinical trials and deliver AI-powered insights that improve human outcomes worldwide.

Role:
The Site Reliability Engineer, Observability & RUM is responsible for improving end-to-end observability across our platforms and customer-facing applications, with a particular focus on frontend and Real User Monitoring (RUM). This role combines core SRE practices with ownership of monitoring, logging, tracing, alerting, and user-experience telemetry in production.

You will help evolve our observability capabilities across Azure and Kubernetes environments, improve incident detection and diagnosis, and support decisions around managed versus self-managed observability tooling. You will partner closely with Engineering, Support, QA, and Security teams to ensure systems ship with actionable telemetry, dashboards, alerts, and operational runbooks.

Job requirements

5+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Observability Engineering roles.
Strong hands-on experience with observability and monitoring platforms, including several of the following:Elastic, Grafana, Prometheus, OpenTelemetry, Sentry, monitoring agents, and managed APM/observability platforms.
Experience implementing and supporting Real User Monitoring (RUM) and frontend/application observability in production environments.
Ability to work across frontend, backend, and platform teams to improve telemetry, alerting, and incident diagnosis.
Experience evaluating or operating managed observability platforms and understanding the trade-offs versus self-managed stacks.

(Nice to have)

Experience supporting ML, AI, or LLM-backed services in production (RAG, LangSmith, Arize Phoenix, LangChain, LangGraph, Azure OpenAI, OpenAI, or Anthropic APIs).

Job responsibilities

Own and improveReal User Monitoring (RUM) for customer-facing applications, including browser performance, client-side errors, user journeys, and frontend service dependencies.
Partner with frontend, product, and engineering teams to improve visibility into user experience, JavaScript/runtime failures, page performance, and customer-impacting issues.
Establish and maintain end-to-end observabilityacross frontend, backend, infrastructure, and Kubernetes environments using metrics, logs, traces, dashboards, and alerting.
Evaluate, implement, and operate managed and self-managed observability solutions, helping guide the evolution of the observability stack.
Support and improve observability tooling such as Sentry, Elastic, Grafana, Prometheus, OpenTelemetry, monitoring agents, and related APM platforms.
Define and maintain SLIs, SLOs, and alerting strategies that improve service reliability, reduce noise, and enable faster detection of production issues.
Lead or support incident detection, alert triage, live production troubleshooting, and service restoration across outage, latency, batch, file transfer, and degradation scenarios, in partnership with Support and Production teams.

Job benefits

🇵🇱 What We Offer – Poland

Comprehensive Health Insurance (medical, dental, and online consultations, 100% employee coverage)
Life Insurance through UNUM
Cafeteria Plan with flexible monthly credits for wellness, entertainment, and travel
MultiSport Card, co-financed 50/50
Employee Capital Plans (PPK) with 4% employer contribution
A hub-based hybrid model that blends flexibility with purpose — connecting teams through collaboration, learning, and a vibrant social culture.

Equal Opportunities & Data Privacy Statement
CluePoints is an equal opportunity employer committed to diversity and inclusion in the workplace.
Your personal data will be processed by CluePoints for recruitment purposes in accordance with the Regulation (EU) 2016/679 (GDPR).
If you wish for your data to be retained for future opportunities, please include the following statement in your CV:
“I consent to the processing of my personal data by CluePoints for the purposes of future recruitment processes.”

Similar jobs you might like

Healthcare

CluePoints

Test Manager

Senior

Remote

Warsaw, Poland

🏢 Summary: Leadership role responsible for defining and driving an automation-first quality strategy across multiple product squads in a clinical trial software environment. Combines hands-on test automation framework design with team leadership, release governance, and AI-driven quality engineering practices. Ensures end-to-end quality outcomes, risk-based testing, and release readiness aligned with product goals. 🗂️ Requirements: Proven experience leading quality engineering across multiple squads, Strong people management and mentoring experience, Hands-on experience with Playwright or similar automation frameworks, Experience in UI, API, integration, and end-to-end testing, Experience integrating automated tests into CI/CD pipelines, Strong knowledge of regression strategy and risk-based testing, Experience leading Release Readiness and Go/No-Go processes, Experience using AI coding assistants in testing workflows, Understanding of validation, traceability, and compliance practices 📃 Skills: Playwright, Automation, Testing, API, UI, E2E, CI/CD, AI, Copilot, Codex, Regression, RBQM, Compliance, Debugging, Frameworks 🏢 Description: About the job Company Description At CluePoints, we’re redefining how clinical trials are run. As the premier provider of Risk-Based Quality Management (RBQM) and Data Quality Oversight software, we harness advanced statistics, artificial intelligence, and machine learning to ensure the quality, accuracy, and integrity of clinical trial data, helping life sciences organizations bring safer, more effective treatments to patients faster. We’re proud to be an ambitious, fast-growing technology scale-up with a dynamic and diverse international team representing more than 20 nationalities. Collaboration, flexibility, and continuous learning are part of our DNA. At CluePoints, you’ll find a culture where you can grow, make an impact, and have fun along the way. Guided by our values of Care, Passion, and Smart Disruption , we’re united by a shared mission: to create smarter ways to run efficient clinical trials and deliver AI-powered insights that improve human outcomes worldwide. The Role Reporting directly to the Quality Director, you will own the testing strategy and quality outcomes for 3–5 squads within a product domain. You will define and drive an automation-first testing approach while remaining technically involved in framework design, debugging, and quality oversight. You are the direct manager for the testers in your squad and will collaborate closely with Product and Engineering teams, to ensure risk-based quality practices, test readiness, and delivery support. The role will be a combination of both 'hands on' + Leadership & Strategy. What You’ll Be Doing Quality Strategy & Squad Leadership Own end-to-end quality outcomes across multiple squads. Define and execute a scalable quality strategy aligned with product and release goals. Lead squad-level risk analysis, test planning, and execution oversight. Mentor and grow 5–8 testers (manual & automation) with goal to bring team towards hybrid testers. Automation & Technical Leadership Drive automation-first testing practices across squads. Design, evolve, and review scalable UI/API/E2E automation frameworks (Playwright or equivalent). Optimize regression and end-to-end strategies to improve release confidence. Stay hands-on in automation design, debugging, and framework improvements. AI-First Quality Engineering Champion AI-driven approaches in test design, automation development, and defect analysis. Enable teams to effectively use AI tools (e.g., Copilot, Codex) to accelerate quality engineering. Ensure AI-generated artifacts are reviewed, reliable, and maintainable. Release Readiness & Governance Lead Release Readiness and Go/No-Go meetings with structured quality metrics and risk insights. Collaborate with Product Owners, Engineering Managers, Product Managers, and Release Management. Ensure validation documentation, traceability, and compliance requirements are met. Identify risks early and implement mitigation strategies proactively. What You’ll Bring Proven experience leading quality engineering across multiple cross-functional squads. Strong people management and mentoring experience. Hands-on automation experience (Playwright or modern frameworks). Full-stack testing expertise (API, UI, integration, end-to-end). Experience integrating automation into CI/CD pipelines. Strong understanding of regression strategy, risk-based testing, and release governance. Experience driving Release Readiness / Go-NoGo meetings. Demonstrated use of AI coding assistants and AI-first testing approaches. Excellent communication, stakeholder management, and critical thinking skills.

Technology

CluePoints

Senior DevOps Engineer

Senior

Remote

Warsaw, Poland

🏢 Summary: Full-time DevOps/SRE role focused on automating and optimizing the Software Development Lifecycle, including CI/CD pipelines, build and release processes, and developer infrastructure. The position aims to improve integration speed, reliability, and collaboration by advancing trunk-based development and modern DevOps practices. You will work closely with development and SRE teams to enhance automation, infrastructure, and software delivery performance. 🗂️ Requirements: 6+ years in DevOps or SRE in Software or SaaS environment, Strong Linux systems administration experience, Proficiency in Bash, Python or Perl scripting, Experience with Git and branching strategies, Hands-on experience with CI/CD systems, Experience with trunk-based development practices, Strong knowledge of Docker and Kubernetes, Experience with Terraform and Ansible, Experience automating build and release processes, Understanding of DevOps and SRE principles, Experience managing CI infrastructure and developer tooling 📃 Skills: Linux, Bash, Python, Perl, Git, GitLab, GitHub, Jenkins, ArgoCD, Docker, Kubernetes, Terraform, Ansible, CI/CD, SonarQube, Ontrack 🏢 Description: Job description Company Description At CluePoints, we’re redefining how clinical trials are run. As the premier provider of Risk-Based Quality Management (RBQM) and Data Quality Oversight software, we harness advanced statistics, artificial intelligence, and machine learning to ensure the quality, accuracy, and integrity of clinical trial data, helping life sciences organizations bring safer, more effective treatments to patients faster. We’re proud to be an ambitious, fast-growing technology scale-up with a dynamic and diverse international team representing more than 20 nationalities. Collaboration, flexibility, and continuous learning are part of our DNA. At CluePoints, you’ll find a culture where you can grow, make an impact, and have fun along the way.Guided by our values of Care, Passion, and Smart Disruption , we’re united by a shared mission: to create smarter ways to run efficient clinical trials and deliver AI-powered insights that improve human outcomes worldwide. The Role In this role, you’ll work at the heart of our Software Development Lifecycle (SDLC) automation efforts. You’ll be responsible for improving how we integrate, build, and release code — ensuring that developers can deliver value quickly and safely. This role is ideal for someone passionate about DevOps practices, automation, and enabling developer productivity at scale. (We cannot consider B2B Contractors for this Position) - Full time Permanent Employee applications will be considered for this vacancy. Job requirements What You’ll Bring 6+ years of experience in a DevOps or SRE role in a Software or SaaS environment Strong experience with Linux systems administration Proficiency in scripting languages (e.g., Bash, Python, or Perl) Solid experience with Git and branching strategies , with a focus on improving collaboration and enabling high-frequency integration Strong experience with CI/CD systems (e.g., GitLab CI, GitHub Actions, Jenkins, ArgoCD) and building reliable, fast feedback pipelines Good understanding of modern CI practices , including frequent integration, maintaining a stable main branch, and reducing merge complexity Experience supporting or working in environments moving toward trunk-based development (e.g., short-lived branches, incremental changes) Deep understanding of containerization tools (Docker) and orchestration (Kubernetes) Experience with infrastructure-as-code and automation tools (Terraform, Ansible) Strong understanding of DevOps principles: automation, feedback loops, and shift-left testing Experience automating software build and release processes A strong grasp of software integration workflows, dependency management, and production readiness Understanding of SRE principles and how reliability and infrastructure practices support software delivery Ability to work cross-functionally and guide teams toward improved engineering practices Job responsibilities What You’ll Be Doing Design, build, and maintain shared CI/CD pipeline templates and automation tools with a focus on fast feedback and reliable integration Develop and support internal tools for build, test, and release automation Collaborate with development teams to improve how code is integrated, tested, and prepared for production Help define and evolve best practices for source control, branching strategies, and code collaboration , supporting a move toward trunk-based development Guide teams in adopting incremental, high-quality changes that reduce risk and improve delivery flow Partner with SRE teams to align deployment strategies, observability, and infrastructure practices with application delivery Manage, optimize, and monitor developer infrastructure (CI runners, SonarQube, Ontrack, artifact repositories, etc.) Drive improvements in release readiness, code quality, and testing practices across teams Identify and remove bottlenecks in the software delivery lifecycle , improving speed without compromising reliability Continuously evaluate new tools and technologies to improve our platform and developer experience Job benefits 🇬🇧 What We Offer – United Kingdom Private Medical Insurance through Vitality Health (full hospital cover, 24/7 GP, and therapy sessions) Group Critical Illness Cover with Aviva Life Insurance (death-in-service lump sum) Pension Scheme with 9% employer contribution via Scottish Widows Opportunities for professional development and sponsored certifications A hub-based hybrid model that blends flexibility with purpose — connecting teams through collaboration, learning, and a vibrant social culture. 🇬🇧 Equal Opportunities & Data Protection Statement CluePoints is an equal opportunities employer. We celebrate diversity and are committed to creating an inclusive environment for all employees and applicants. We welcome applications from all individuals regardless of age, disability, gender identity or expression, marital or civil partnership status, pregnancy or maternity, race, religion or belief, sex, or sexual orientation. Any personal data you share during your application will be processed in accordance with the UK GDPR and the Data Protection Act 2018 and will be used solely for recruitment purposes. By submitting your application, you consent to the processing of your data for recruitment and employment purposes.

Technology

SmartRecruiters Inc.

Senior Site Reliability Engineer II

Senior

Remote

Warsaw, Poland

25,000 - 35,000 PLN

🏢 Summary: Senior Site Reliability Engineer role focused on strengthening reliability and observability of a large-scale cloud platform. The position involves driving SRE best practices, improving monitoring and automation, and partnering with product teams to design resilient services. The engineer will lead incident response, enhance tooling, and support scalable, distributed systems in a cloud-native environment. 🗂️ Requirements: 7+ years professional engineering experience, Strong knowledge of SRE practices (SLIs, SLOs, error budgets, incident management, on-call), Experience with JVM-based systems, Hands-on experience with AWS or other cloud providers, Experience with Kubernetes and distributed systems, Experience with Infrastructure as Code, Deep knowledge of Linux administration and troubleshooting, Strong scripting skills (Bash, Golang or Python), Experience with monitoring and observability tools, Ability to manage and troubleshoot production incidents 📃 Skills: AWS, Kubernetes, Linux, Bash, Golang, Python, JVM, Java, Node.js, IaC, TCP/IP, DNS, VPN, SQL, NoSQL, Observability, Monitoring, Docker 🏢 Description: Company Description 🚀 SmartRecruiters transforms hiring for the world’s leading enterprises. We deliver an AI-powered hiring platform built for global scale, automating and optimizing the entire talent acquisition process. More than 4,000 companies, including LinkedIn, McDonald's, VISA, CD Projekt Red, Allegro rely on SmartRecruiters to build winning teams. 🚀 In 2025, SmartRecruiters joined SAP, the global leader in enterprise applications. Together, we are accelerating the reinvention of hiring by combining AI innovation with the scale and resources of SAP’s ecosystem. We designed our R&D structure based on the empowered product teams model. It means our teams are responsible for business outcomes and have autonomy in solving problems in the way that “customers love yet work for the business” (yes, we are heavily influenced by this and that ). Job Description The SmartRecruiters Internal Engineering Team is looking for a Senior Site Reliability Engineer to our reliability initiatives and help us strengthen the reliability and observability of our platform at scale. If you are passionate about cloud, networking, observability, and partnering with product teams to curate reliability practices, we have a spot with your name on it! Important: the position is available only under a standard contract of employment with 80% of tax deductible cost. You may be located anywhere in Poland and work remotely or out of our Cracow office. What you’ll deliver: Cooperate closely with other Platform and Engineering teams on strategic reliability and observability initiatives across SmartRecruiters Improve, automate and grow SmartRecruiters observability and reliability tooling (metrics, logs, traces, alerting) Respond to production incidents and client threats, lead remediation, and drive follow‑up improvements Partner with product engineers working in Java, Node.js, and Python to design, instrument, and operate services for failure, owning SLIs/SLOs and error budgets together Create reusable building blocks (dashboards, alerts, libraries and IaC modules) that can be rolled out company‑wide Mentor members of the engineering team and act as an advocate for modern SRE and observability practices Document standards, best practices, and policies for monitoring, alerting, incident response, and reliability Conduct capacity planning and performance testing of platform We want you to: Make a difference Have a positive, can-do attitude Do the right thing, not the "easy" thing Give and receive support from our awesome engineering team Qualifications: While not strictly required, we see most of our Senior Engineers have 7+ years of professional experience Working knowledge of SRE and observability industry standards and best practices (SLIs/SLOs, error budgets, incident management, on‑call) Engineering experience in JVM stack Experience with AWS (or other cloud provider), Kubernetes, and IaC tools and practices, including running and troubleshooting distributed applications Proven track record of delivering solutions for reliability, monitoring, and container management Deep knowledge of the Linux operating system, with a focus on system hardening and troubleshooting performance issues Very good scripting skills (Bash, Golang or Python) Experience managing and troubleshooting database systems, both SQL and NoSQL is a plus Solid understanding of networking standards, including TCP/IP, DNS, VPN and load balancing is a plus Comfortable partnering with teams to design resilient data access and use database observability to prevent and resolve incidents Strong communication skills, with a good understanding of English, both verbal and written, and the ability to coach and influence other engineers Benefits: We support 100% remote work with Wi-Fi reimbursement and an additional stipend for the equipment (the MacBook laptop is provided by us) Unlimited vacation days (yes - it's really unlimited) Private Medical Care for you and your dependents (Luxmed) Wellness Programme (Multisport Card and even more) Company-wide shutdowns in August and around Christmas Additional information SmartRecruiters is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Technology

Grid Dynamics Poland

Site Reliability Engineer

Senior

Hybrid

Warsaw, Poland

🏢 Summary: Site Reliability Engineer role focused on leading the cloud platform layer of a large-scale enterprise migration to GCP, with full ownership of observability and FinOps capabilities. The position involves architecting cost attribution, distributed tracing, monitoring, and performance engineering solutions in a production-grade Kubernetes environment. You will work on complex distributed systems, extending multi-language codebases and managing infrastructure as code in a regulated enterprise setting. 🗂️ Requirements: 4–6 years software or DevOps engineering experience, 2–3 years hands-on cloud infrastructure management in production, Strong GCP expertise including GKE and Cloud Run, Proven experience building observability solutions with OpenTelemetry, Experience with distributed tracing and profiling in distributed systems, Advanced Python scripting for automation and tooling, Strong Terraform proficiency with multi-environment setups, Ability to read and modify Kotlin and Java codebases, Experience implementing monitoring, alerting, and SLOs for containerized/serverless services, Experience with infrastructure cost attribution and cloud billing APIs 📃 Skills: GCP, GKE, CloudRun, Kubernetes, OpenTelemetry, Terraform, Python, Kotlin, Java, FinOps, PubSub, Bigtable, Docker, SLO, Tracing 🏢 Description: We are looking for a Site Reliability Engineer to join a high-stakes global tech ecosystem and drive the delivery of a critical enterprise platform migration to the cloud. Your core mission will be to architect, build, and productionalize the observability and cost intelligence (FinOps) layer for a massive, multi-year financial platform transformation. You will take end-to-end ownership of the cloud platform layer, giving internal stakeholders full visibility into platform behavior, performance, and infrastructure spend. Working alongside a nearshore team of senior engineers, you will solve highly complex architectural challenges in a production-grade, distributed system. Responsibilities: End-to-End Infrastructure & FinOps Ownership: Architect and implement a cloud usage and cost attribution dashboard, providing detailed per-pod and per-service cost breakdown using cloud billing APIs and internal FinOps hubs. Advanced Observability & Tracing: Instrument end-to-end distributed tracing using OpenTelemetry, configuring collectors within Kubernetes environments and exporting traces to cloud monitoring systems utilizing RED metrics. Performance Engineering & Stress Testing: Write custom tooling from scratch to deliver database performance monitoring, load testing, and trend analysis for critical underlying storage layers. Monitoring & Alerting Automation: Build and deploy scalable production monitoring, custom alerting policies, and SLO tracking for containerized and serverless services. Infrastructure as Code: Independently manage, write, and apply infrastructure modifications using Terraform, working within established enterprise repository standards, modules, and environment state management. Cross-Language Codebase Extension: Read, debug, and extend existing platform code across a diverse stack including Kotlin, Java, and Python to seamlessly integrate technical metrics without disrupting business logic. Quality & Release Assurance: Implement rigorous unit testing with high code coverage for all newly developed monitoring tools to comply with strict enterprise quality gates and sign-offs. Min requirements: Experience: 4 to 6 years of professional software or DevOps engineering experience, with at least 2 to 3 years of hands-on cloud infrastructure management in production. Advanced Cloud Infrastructure: Deep operational proficiency with Google Cloud Platform (GCP), specifically with managing and configuring workload-level alerting on Google Kubernetes Engine (GKE) and Cloud Run. Observability & OpenTelemetry: Proven track record of building observability solutions in distributed systems, using OpenTelemetry (both auto and manual instrumentation) alongside distributed tracing and profiling tools. Strong Automation Scripting: Intermediate-to-advanced fluency in Python for writing custom test tooling, metrics integration scripts, and backend automation from scratch. Solid Infrastructure as Code: Strong proficiency in Terraform, including experience with multi-environment setups, workspaces, and corporate module standards. Polyglot & JVM Familiarity: Practical ability to read, understand, and modify existing backend codebases written in Kotlin and Java. Crucial Non-Technical Skills: Extreme technical autonomy to resolve blockers independently, rapid onboarding skills into large unfamiliar codebases, and fluent written English for async alignment and pull requests. Process Alignment: Ability to thrive in a highly regulated enterprise environment with strict peer reviews, robust documentation requirements, and formal deployment procedures. Would be a plus: Domain Knowledge: Previous experience working within financial services, fintech, investment banking, or other highly regulated industries. Enterprise Streaming Tools: Working knowledge of cloud messaging systems (such as Cloud Pub/Sub) utilized for inter-service communication. Advanced Storage Engines: Familiarity with high-throughput distributed database architectures, such as Google Cloud Bigtable. Systems Languages Awareness: Ability to read or debug foundational code written in low-level systems languages like Rust or C++ during multi-stack production deployments. We offer: Opportunity to work on bleeding-edge projects Work with a highly motivated and dedicated team Competitive salary Flexible schedule Benefits package - medical insurance, sports Corporate social events Professional development opportunities Well-equipped office About us: Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI , supported by profound expertise and ongoing investment in data , analytics , cloud & DevOps , application modernization and customer experience . Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Katowice, Poland

🏢 Summary: Senior C++ Engineer with strong observability expertise to lead the design and implementation of monitoring and telemetry solutions across the software development lifecycle. The role focuses on low-latency, high-performance distributed systems, ensuring measurable customer experience, reliability, and operational insight. Combines hands-on engineering with technical leadership in large-scale production environments. 🗂️ Requirements: 5+ years of observability engineering experience (metrics, tracing, logging), Strong C++ engineering skills, Experience with profiling and telemetry pipelines, Experience building large-scale monitoring or observability platforms, Knowledge of latency-sensitive market data systems, Expertise in OpenTelemetry, eBPF, and GitOps, Experience with API-driven automation and CI/CD-integrated observability, Knowledge of cloud-native, Kubernetes, and distributed systems architectures, Ability to define standards and guide engineering teams, English proficiency at B2 level or above 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Cloud-native, Distributed-systems, Metrics, Tracing, Logging, Profiling, Telemetry 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Lodz, LD, Poland

🏢 Summary: Senior C++ Engineer role focused on leading observability implementation across the software development lifecycle in low-latency, high-performance distributed systems. The position combines hands-on C++ engineering with technical leadership to deliver scalable monitoring, telemetry, and operational tooling. The role emphasizes measurable customer experience, system reliability, and operational insight in market data or trading environments. 🗂️ Requirements: 5+ years of observability engineering experience (metrics, tracing, logging), Strong C++ engineering skills, Experience with profiling and telemetry pipelines, Experience building and maintaining large-scale monitoring or observability platforms, Expertise in OpenTelemetry, eBPF, and GitOps, Experience with API-driven automation and CI/CD-integrated observability, Knowledge of cloud-native, Kubernetes, and distributed systems architectures, Knowledge of customer-experience measurement for latency-sensitive systems, Ability to define standards and guide engineering teams, English proficiency at B2 level or higher 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Telemetry, Tracing, Logging, Profiling, Cloud-native 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Krakow, Poland

🏢 Summary: Senior C++ Engineer role focused on leading observability implementation across the software development lifecycle for low-latency, high-performance distributed systems. The position combines hands-on engineering with technical leadership to ensure measurable customer experience, system reliability, and operational insight. It involves building large-scale monitoring platforms and advancing telemetry practices in latency-sensitive market data environments. 🗂️ Requirements: 5+ years of experience in observability engineering (metrics, tracing, logging), Strong proficiency in C++, Experience with profiling and telemetry pipelines, Experience building and maintaining large-scale monitoring or observability platforms, Expertise in OpenTelemetry, eBPF, and GitOps, Experience with API-driven automation and CI/CD-integrated observability, Knowledge of cloud-native, Kubernetes, and distributed systems architectures, Experience with customer-experience measurement in latency-sensitive systems, Ability to define standards and guide engineering teams, English proficiency at B2 level or higher 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Telemetry, Tracing, Logging, Profiling, Cloud-native, DistributedSystems 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Warsaw, Poland

🏢 Summary: Senior C++ Engineer role focused on leading observability implementation across the software development lifecycle in high-performance, low-latency distributed systems. The position combines hands-on engineering with technical leadership to build and scale monitoring, telemetry, and operational tooling for market data environments. It emphasizes reliability, customer-experience measurement, and cloud-native observability practices. 🗂️ Requirements: 5+ years of experience in observability engineering (metrics, tracing, logging), Strong C++ engineering skills, Experience with profiling and telemetry pipelines, Experience building large-scale monitoring or observability platforms, Knowledge of latency-sensitive market data systems, Expertise in OpenTelemetry, Expertise in eBPF, Experience with GitOps practices, Experience with API-driven automation, Experience integrating observability with CI/CD, Knowledge of Kubernetes, Knowledge of cloud-native architectures, Knowledge of distributed systems architectures, Ability to define standards and guide engineering teams, English proficiency at B2 level or higher 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Telemetry, Tracing, Logging, Profiling, Cloud-native, DistributedSystems 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Top tech minds driving innovation in AI, cloud and digital platform modernization Supportive team and agile, startup-like culture Hybrid by design mode and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Career development programs Thought leadership, mentoring, soft skills and well-being programs Certification (Anthropic, Gemini, GCP, Azure, AWS) English classes We cover it all: Stable pay Participation in the Employee Stock Purchase Plan with a 15% discount Benefits package (health insurance, multisport, shopping vouchers) Referral bonuses up to $2,000 Offices featuring entertainment and relaxation zones, table tennis and football, free snacks, coffee and more Corporate, social and well-being events Please, note: Benefits listed above are available to employees only We are open for working with Contractors. Terms of B2B cooperation agreements are agreed individually We will reach out to selected candidates exclusively EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Poznan, WP, Poland

🏢 Summary: Senior C++ Engineer role focused on leading observability implementation across the full software development lifecycle for low-latency, high-performance distributed systems. The position combines hands-on engineering with technical leadership to build and scale monitoring, telemetry, and operational tooling for latency-sensitive market data environments. It requires strong C++ expertise and deep experience in production-grade observability practices. 🗂️ Requirements: 5+ years of experience in observability engineering (metrics, tracing, logging), Strong C++ engineering skills, Experience with profiling and telemetry pipelines, Experience building and maintaining large-scale monitoring or observability platforms, Knowledge of latency-sensitive market data systems, Expertise in OpenTelemetry, eBPF, and GitOps, Experience with API-driven automation and CI/CD-integrated observability, Knowledge of cloud-native, Kubernetes, and distributed systems architectures, Ability to define standards and guide engineering teams, English proficiency at B2 level or higher 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Telemetry, Tracing, Logging, Profiling, Cloud-native, DistributedSystems 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Top tech minds driving innovation in AI, cloud and digital platform modernization Supportive team and agile, startup-like culture Hybrid by design mode and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Career development programs Thought leadership, mentoring, soft skills and well-being programs Certification (Anthropic, Gemini, GCP, Azure, AWS) English classes We cover it all: Stable pay Participation in the Employee Stock Purchase Plan with a 15% discount Benefits package (health insurance, multisport, shopping vouchers) Referral bonuses up to $2,000 Offices featuring entertainment and relaxation zones, table tennis and football, free snacks, coffee and more Corporate, social and well-being events Please, note: Benefits listed above are available to employees only We are open for working with Contractors. Terms of B2B cooperation agreements are agreed individually We will reach out to selected candidates exclusively EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Technology

EPAM Systems

Senior C++ Engineer with Observability

Senior

Remote

Gdansk, Poland

🏢 Summary: Senior C++ Engineer with Observability expertise to lead the implementation of monitoring and telemetry across high-performance, low-latency distributed systems. The role combines hands-on C++ engineering with technical leadership to ensure measurable customer experience, system reliability, and operational insight. Focused on large-scale observability platforms for latency-sensitive market data environments. 🗂️ Requirements: 5+ years in observability engineering (metrics, tracing, logging), Strong C++ engineering skills, Experience with profiling and telemetry pipelines, Experience building and maintaining large-scale monitoring or observability platforms, Knowledge of customer-experience measurement for latency-sensitive market data systems, Expertise in OpenTelemetry, eBPF, GitOps, Experience with API-driven automation and CI/CD-integrated observability, Knowledge of cloud-native, Kubernetes, distributed systems architectures, Ability to define standards and guide engineering teams, English proficiency B2+ 📃 Skills: C++, OpenTelemetry, eBPF, GitOps, Kubernetes, CI/CD, APIs, Cloud, Metrics, Tracing, Logging, Profiling, Telemetry 🏢 Description: We are looking for a Senior C++ Engineer with Observability expertise to spearhead our observability implementation. This position blends practical engineering with technical leadership, guaranteeing that customer experience, system reliability, and operational insight remain measurable and integrated across the entire software development lifecycle. The perfect candidate will offer substantial experience in low-latency trading, market data, or similar high-performance distributed systems, paired with a solid C++ foundation and a proven history of delivering production monitoring, telemetry, and operational tooling at scale. Responsibilities Spearhead observability implementation throughout the software development lifecycle Guarantee that customer experience, system reliability, and operational insight stay measurable and integrated Construct and maintain large-scale monitoring, observability, or control platforms Set standards and promote the adoption of observability best practices Collaborate effectively with development teams to deliver telemetry and operational tooling Deploy metrics, tracing, logging, profiling, and telemetry pipelines Advance customer-experience measurement for latency-sensitive market data systems Guide engineering teams toward shared observability goals Requirements More than 5 years of experience in observability engineering, covering metrics, tracing, and logging Strong skills in profiling and telemetry pipelines Knowledge of customer-experience measurement for latency-sensitive market data systems Experience building and maintaining large-scale monitoring, observability, or control platforms Strong C++ engineering skills along with the capacity to collaborate effectively with development teams Expertise in OpenTelemetry, eBPF, and GitOps Capability in API-driven automation and CI/CD-integrated observability practices Knowledge of cloud-native, Kubernetes, and distributed systems architectures Demonstrated ability to guide engineering teams and define standards English proficiency at B2 level or above Nice to have Experience in trading or market-making Knowledge of exchange connectivity Understanding of market data environments We offer We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

CluePoints

CluePoints is a fast-growing international technology scale-up operating in the life sciences and clinical research industry. The company specializes in Risk-Based Quality Management (RBQM) and Data Quality Oversight software, leveraging advanced statistics, artificial intelligence, and machine learning to enhance the quality, accuracy, and integrity of clinical trial data. Its mission is to create smarter, more efficient ways to run clinical trials and deliver AI-powered insights that improve patient outcomes worldwide. With a diverse team representing over 20 nationalities, CluePoints fosters a collaborative, flexible, and learning-oriented culture. Guided by its core values of Care, Passion, and Smart Disruption, the company emphasizes innovation, impact, and continuous growth.

Check if your resume is ATS-ready before applying →Build an ATS-optimized resume