June 8, 2026

Senior Platform Optimization & Observability Engineer

Senior • Remote

150 - 200 PLN

Wroclaw, Poland

Tech stack:

ELK stack (observability, APM, security)
VMware, Hyper‑V, KVM / Proxmox
Log Analytics (migration source)
Monitoring and alerting platforms

Requirements:

Hands‑on experience optimizing virtualization platforms
Strong storage performance and capacity optimization skills
Experience with platform security hardening
Deep operational experience with ELK stack
Experience migrating dashboards, queries, and reports from Log Analytics
Strong understanding of DR optimization and recovery metrics

Nice to have:

Compliance scanning tools (CIS)
SOC 1 / SOC 2 / C5 familiarity
Sentinel rule migration experience
Experienced in using AI tools in day-to-day workflow

Project description:

You will own platform health, optimization, and the observability stack in a complex enterprise environment. The project focuses on improving platform performance, security posture, DR effectiveness, and migrating monitoring and security capabilities to a new observability platform.

Main responsibilities:

Optimize virtualization and storage platforms
Expand observability with APM and security capabilities
Migrate monitoring and security assets from Azure tooling
Optimize logging, alerting, and retention strategies
Review and improve DR and firewall configurations
Collaborate with network and security engineers

Similar jobs you might like

Technology

emagine Polska

Observability Specialist

Senior

Hybrid

Warsaw, Poland

🏢 Summary: The offer is for an Observability Specialist responsible for designing, implementing, and maintaining a scalable telemetry and monitoring infrastructure in cloud-native environments. The role focuses on Kubernetes observability, Elastic Stack management, and performance optimization using modern telemetry standards. It involves driving SRE practices and ensuring high system reliability through advanced monitoring and AIOps solutions. 🗂️ Requirements: Experience monitoring Kubernetes (OpenShift) environments, Hands-on implementation of OpenTelemetry for logs, traces, and metrics, Strong expertise in ELK stack deployment and maintenance, Proficiency in automating Elastic environments using Ansible, Experience with Application Performance Monitoring for code-level analysis, Knowledge of shard optimization, mapping, and Index Lifecycle Management, Experience defining and monitoring SLOs and managing Error Budgets, Integration of observability solutions with major cloud providers 📃 Skills: Kubernetes, OpenShift, OpenTelemetry, Elasticsearch, Logstash, Kibana, Ansible, ElasticAPM, AIOps, SRE, ILM, Sharding, Mapping, Cloud 🏢 Description: Introduction & Summary We are seeking an experienced Observability Specialist dedicated to ensuring the reliability and performance of our systems. This role involves collaborating with enterprise architects and IT professionals to design, implement, and oversee a scalable telemetry infrastructure. The ideal candidate will possess deep expertise in ELK or similiar technologies and modern telemetry standards. Main Responsibilities As our Observability Engineer, your core duties will include: Architectural Collaboration: Partner with system architects and local engineering teams in Denmark to design resilient monitoring solutions. Monitor Kubernetes environments with OpenTelemetry (OTel) standards for logs, traces, and metrics. Manage centralized data collection and automate Elastic deployments using Ansible. Utilize Elastic APM for identifying code-level bottlenecks and resolving latency issues. Implement AIOps configurations for proactive anomaly detection and automated root-cause analysis. Drive Site Reliability Engineering (SRE) methodologies across teams. Elastic Stack Management: Deploy, scale, and maintain Elasticsearch, Logstash, and Kibana (ELK) environments. Key Requirements Cloud-Native Observability: Strong skills in monitoring Kubernetes (Openshift) environments and integrating with major cloud providers. APM & Distributed Tracing: Expertise in Application Performance Monitoring (APM) to identify code-level bottlenecks and latency issues. OpenTelemetry (OTel): Hands-on experience implementing OpenTelemetry (or similiar) standards for logs, traces, and metrics to ensure vendor-neutral telemetry. Infrastructure as Code (IaC): Proficiency in automating Elastic environments with Ansible. Performance Engineering: Expert-level knowledge of shard optimization, mapping, and Index Lifecycle Management (ILM) to balance high performance with cost control. SRE Methodology: Experience defining and monitoring Service Level Objectives (SLOs) and managing Error Budgets. Strong communication skills for collaboration with IT teams. NIce to Have: Elastic Stack Mastery: Deep expertise in architecting and managing Elasticsearch, Logstash, and Kibana (ELK) at scale. Data Ingestion & Fleet: Proven experience deploying Elastic Agent and Fleet for centralized agent management and data collection. AIOps & Machine Learning: Ability to configure Elastic ML models for proactive anomaly detection and automated root cause analysis. Other Details This is position based in Warsaw, flexible Hybrid model, focused on leading-edge observability solutions in a dynamic and collaborative environment.

Technology

BlueSoft

AWS Senior DevOps Engineer

Senior

Remote

Warsaw, Poland

🏢 Summary: Technical role focused on leading AWS multi-account migrations and modernization initiatives, including landing zone design, VDI transitions, and central platform enhancements. The position involves rebuilding environments with Terraform and CI/CD, implementing governance, networking, identity, and observability across enterprise-scale AWS environments. It combines hands-on cloud engineering with migration planning and execution in complex infrastructures. 🗂️ Requirements: Strong hands-on experience with AWS multi-account environments, Experience with AWS migrations and landing zone architecture, Practical experience with AWS Organizations and IAM, Experience with Route 53, VPC, Transit Gateway, CloudWatch, CloudTrail, Experience migrating VDI platforms such as Amazon WorkSpaces or AppStream 2.0, Strong Terraform and Infrastructure as Code experience, Experience building and maintaining CI/CD pipelines, Knowledge of DevOps or SRE practices, Experience implementing DNS, routing, and identity federation in AWS 📃 Skills: AWS, Terraform, IAM, Organizations, ControlTower, Route53, VPC, TransitGateway, CloudWatch, CloudTrail, WorkSpaces, AppStream, CI/CD, EKS, Kubernetes, Lambda, EventBridge, S3, Athena, Glue, Jenkins, GitHub, GitLab, Bedrock 🏢 Description: If Terraform state files don’t scare you and account migrations sound more exciting than stressful — keep reading 😄 Project Overview Development of a central AWS platform supporting governance, observability, and shared services in a multi-account model. Modernization of the central account data pipeline for logs, telemetry, and operational data. Delivery of migration initiatives covering AWS account transitions and VDI platform migrations . Execution of discovery assessments , dependency mapping, target landing zone design, and migration runbooks. Onboarding infrastructure services: EKS logging , Route 53 Global Resolver , VPC Flow Logs , Bedrock invocation . Collaboration with Security, Networking, EUC, and Platform teams in complex enterprise environments. Daily Responsibilities Perform discovery of existing AWS accounts: IAM models, workloads, networking, logging, cost footprint, dependencies. Plan and execute migrations into governed AWS multi-account structures using Organizations / Control Tower patterns. Migrate VDI workloads using Amazon WorkSpaces / AppStream 2.0 or equivalent solutions. Rebuild environments using Terraform / IaC , CI/CD pipelines, and immutable infrastructure practices. Implement DNS, routing, identity federation, and access controls across migrated environments. Support cutovers, hypercare, rollback readiness, monitoring, and post-migration optimization. Requirements Strong hands-on experience with AWS multi-account environments , migrations, and landing zone architecture. Experience with AWS Organizations, IAM, Route 53, VPC, Transit Gateway, CloudWatch, CloudTrail . Practical knowledge of VDI platforms such as Amazon WorkSpaces or similar enterprise desktop solutions. Strong Terraform / IaC background with CI/CD automation skills. Experience in DevOps / SRE practices: HA, resilience, RCA, observability, incident response. Ability to manage discovery workshops, migration planning, stakeholder communication, and execution streams. Nice to Have EKS / Kubernetes Lambda / EventBridge S3 / Athena / Glue GitHub Actions / GitLab CI / Jenkins FinOps / cost optimization Bedrock / GenAI platform exposure

Technology

emagine Polska

Site Reliability Engineer

Senior

Remote

Lisbon, Portugal

🏢 Summary: Hands-on Observability Engineer role focused on building and automating enterprise-grade monitoring and observability solutions across AWS-based cloud and distributed systems. The position centers on developing infrastructure as code, CI/CD pipelines, and monitoring ecosystems to improve reliability, performance, and incident response. Approximately 90% of the role involves coding in Python and Terraform. 🗂️ Requirements: Strong hands-on experience with AWS, Strong Python development and scripting experience, Strong experience with Terraform, Experience building and maintaining CI/CD pipelines using Jenkins, Experience with Elasticsearch and ELK Stack, Experience with Linux systems, Shell scripting skills, Understanding of monitoring, logging, and alerting concepts, Experience working in Agile or DevOps environments 📃 Skills: AWS, Python, Terraform, Jenkins, Elasticsearch, ELK, Linux, Bash, CI/CD, Kubernetes, Grafana, Prometheus, Datadog, NewRelic, Snowflake, Databricks, dbt, Matillion 🏢 Description: Role Overview We are looking for a skilled and proactive Observability Engineer to implement, automate, and support enterprise-grade observability and monitoring solutions across cloud and application platforms. The ideal candidate should have strong AWS infrastructure knowledge, hands-on automation skills, and experience building reliable monitoring and alerting ecosystems for modern distributed applications. The role involves working closely with Platform Engineering, Data Engineering, and Application teams to develop observability solutions and bring operational visibility, reliability, incident detection, and platform performance. Main Responsibilities · Design, implement, and maintain observability solutions for cloud-native and distributed systems. · Build monitoring, logging, alerting, and dashboarding solutions across infrastructure and applications. · Develop automation scripts and tooling using Python. · Implement and maintain Infrastructure as Code (IaC) using Terraform. · Build and support CI/CD pipelines using Jenkins and Git-based workflows. · Configure and optimize monitoring for AWS services, Kubernetes workloads, APIs, databases, and applications. · Create actionable alerts and operational dashboards to improve incident response and system reliability. · Work with engineering teams to onboard applications into observability platforms. · Support troubleshooting, root cause analysis, and performance optimization initiatives. · Ensure observability standards, governance, and best practices are followed across projects. Key Requirements · Strong hands-on experience with Amazon Web Services (AWS). · Solid Python development/scripting experience. · Strong experience with Terraform. · Experience building and maintaining CI/CD pipelines using Jenkins. · Elasticsearch / ELK Stack experience and building queries. · Worked with Data Platforms monitoring is preferred. · Experience with Linux systems and shell scripting. · Understanding of monitoring, logging, and alerting concepts. · Experience working in Agile/DevOps environments. Nice to Have Skills Experience with any of the following is highly desirable: · Snowflake · Databricks · dbt · Matillion · Grafana · New Relic · Datadog · Prometheus · Elasticsearch / ELK Stack experience NOTES: We are looking for an Engineer who loves to build. This is a highly technical role—90% of the job is hands-on coding in python and terraform.

Technology

emagine Polska

Senior Virtualisierungsspezialist (m/w/d)

Senior

Hybrid

Gilching, BY, Germany

🏢 Summary: Senior Virtualisierungsspezialist zur Verwaltung, Optimierung und Weiterentwicklung hochverfügbarer virtualisierter Plattformen. Verantwortung für den Betrieb von VMware (vSphere, ESXi) und/oder Proxmox sowie für Fehleranalyse, Performance-Optimierung und Integration in bestehende Infrastruktur- und Sicherheitsarchitekturen. Enge Zusammenarbeit mit Infrastruktur- und Netzwerkteams in sicherheitskritischen Umgebungen. 🗂️ Requirements: Mehrjährige praktische Erfahrung in Virtualisierung, Fundierte Betriebserfahrung mit VMware vSphere und ESXi oder Proxmox, Erfahrung in Fehleranalyse und Performance-Optimierung, Verständnis von Hochverfügbarkeitskonzepten, Kenntnisse in Plattform-Sicherheitsarchitekturen, Sehr gute Deutschkenntnisse (C1) 📃 Skills: Virtualisierung, VMware, vSphere, ESXi, Proxmox, Hochverfügbarkeit, Sicherheitsarchitektur, Performanceanalyse, Troubleshooting 🏢 Description: Einführung & Zusammenfassung: Für die effiziente Verwaltung und Integration hochverfügbarer Plattformen suchen wir einen erfahrenen Senior Virtualisierungsspezialisten. Die ideale Kandidatin oder der ideale Kandidat bringt umfangreiche praktische Erfahrung in der Virtualisierung mit, insbesondere mit VMware und Proxmox, sowie Kenntnisse in Fehleranalyse und Performance-Optimierung. Die Rolle erfordert auch ein tiefes Verständnis für Hochverfügbarkeit und Sicherheitsarchitekturen. Hauptverantwortlichkeiten: Überwachung, Analyse und Weiterentwicklung virtualisierter Umgebungen. Betrieb und Management von VMware (vSphere, ESXi) und/oder Proxmox. Durchführung von Fehleranalysen, Performance-Optimierungen und Stabilitätsmaßnahmen. Integration virtueller Lösungen in bestehende Infrastruktur- und Sicherheitsarchitekturen. Enge Zusammenarbeit mit Infrastruktur- und Netzwerkteams. Anforderungen: Mehrjährige praktischen Erfahrung in der Virtualisierung. Fundiertes Betriebswissen (keine rein theoretischen Profile). Gutes Verständnis für Hochverfügbarkeit und Plattform-Sicherheit. Sehr gute Deutschkenntnisse (mindestens C1-Niveau). Wünschenswerte Qualifikationen: Kenntnisse in Cloud-Umgebungen. Erfahrung mit Backup- und Wiederherlösungssystemen. Vertrautheit mit Automatisierungstools. Sonstige Details Aufgrund unserer Zusammenarbeit mit Kunden im Bereich kritischer Infrastrukturen kann eine Sicherheitsüberprüfung nach Sicherheitsüberprüfungsgesetz (SÜG) erforderlich sein. Wir freuen uns auf Ihre Bewerbung!

Technology

Spyrosoft

Senior Kubernetes Platform Engineer

Senior

Remote

Wroclaw, Poland

150 - 200 PLN

🏢 Summary: Design and deliver production-grade Kubernetes platforms across Azure (AKS) and on-prem environments for critical in-house applications. The role focuses on portability, reliability, observability, and secure container lifecycle management. You will architect, deploy, and migrate Kubernetes workloads while defining networking, storage, and security standards. 🗂️ Requirements: 6–8+ years of experience with Linux and container platforms, Deep expertise in Kubernetes (managed and on-prem), Strong understanding of cloud-native architectures, Experience designing Kubernetes networking, storage, and security, Hands-on experience with PostgreSQL in cloud or containerized environments, Infrastructure-as-Code experience, Strong troubleshooting and performance tuning skills 📃 Skills: Kubernetes, AKS, Docker, OpenTofu, Terraform, Bicep, Helm, ArgoCD, Flux, PostgreSQL, Linux, Azure, GitOps 🏢 Description: Tech stack: Kubernetes (AKS & on‑prem) Docker OpenTofu / Terraform / Bicep Helm GitOps tools (ArgoCD, Flux) PostgreSQL Linux Requirements: 6–8+ years of experience with Linux and container platforms Deep expertise in Kubernetes (managed and on‑prem) Strong understanding of cloud‑native architectures Experience designing Kubernetes networking, storage, and security Hands‑on experience with PostgreSQL in cloud or containerized setups Infrastructure‑as‑Code experience Strong troubleshooting and performance tuning skills Nice to have: ELK stack experience Azure Administrator certification or experience Open‑source storage solutions (Ceph, Longhorn) Policy engines (OPA, Kyverno) Experienced in using AI tools in day-to-day workflow Project description: You will design and deliver production‑grade Kubernetes platforms across cloud and on‑prem environments hosting critical in‑house applications. The role focuses on portability, reliability, observability, and secure container lifecycle management. Main responsibilities: Design and deploy on‑prem and AKS Kubernetes clusters Enable workload portability between Azure and on‑prem Define networking, ingress, storage, and security patterns Deliver Kubernetes workload migrations Build container image lifecycle processes Collaborate closely with Platform Engineers and Infrastructure teams

Technology

Creotech

DevOps Engineer (Networking-focused)

Mid

Hybrid

Warsaw, Poland

🏢 Summary: The offer is for a DevOps/Network Engineer role focused on designing and securing scalable network architectures across cloud and hybrid environments. The position involves automating infrastructure with IaC tools, managing cloud networking components, and integrating security into CI/CD pipelines. The role also includes monitoring, incident response, and supporting compliance and infrastructure hardening efforts. 🗂️ Requirements: Minimum 3 years experience in DevOps, Infrastructure, or Network Engineering, Hands-on experience with cloud networking in AWS, Azure, or GCP, Experience with Infrastructure as Code tools, Knowledge of CI/CD pipelines and DevOps practices, Experience with monitoring, logging, and alerting tools, Understanding of IAM and network security best practices, Ability to manage VPCs, VPNs, load balancers, and firewalls 📃 Skills: AWS, Azure, GCP, Terraform, Ansible, CICD, IAM, VPC, VPN, Firewalls, LoadBalancers, Kubernetes, ISO27001, SOC2, Monitoring, Logging 🏢 Description: Tasks Design and implement secure and scalable network architectures across cloud and hybrid environments Automate infrastructure using IaC tools (Terraform, Ansible) Manage and secure VPCs, VPNs, load balancers, and firewalls Ensure network segmentation and access control best practices Support CI/CD pipelines and integrate basic security mechanisms (e.g., secret handling) Monitor traffic, logs, and performance; respond to incidents Collaborate with security teams on infrastructure hardening Support compliance requirements (e.g., ISO 27001, SOC 2) Requirements At least 3 years of experience in DevOps, Infrastructure, or Network Engineering Hands-on experience with cloud networking (AWS / Azure / GCP) Experience with Infrastructure as Code and automation tools Knowledge of CI/CD pipelines and DevOps practices Ability to work with monitoring, logging, and alerting tools Basic understanding of security best practices (IAM, network security) Very good command of English (B2+/C1), both written and spoken Nice to have Experience with Kubernetes networking Exposure to security tools or vulnerability scanning Cloud or networking certifications (e.g., AWS, CCNA)

Technology

Link Group

DevOps Engineer (Observability)

Senior

Hybrid

Warsaw, Poland

130 - 145 PLN

🏢 Summary: Design and scale next-generation observability and logging solutions within an international DevOps team, focusing on building high-scale monitoring platforms and cloud-native infrastructure from the ground up. The role combines architecture, infrastructure as code, and reliability engineering for distributed systems. You will drive metrics, logging, tracing, and alerting solutions in a collaborative environment. 🗂️ Requirements: Hands-on experience with Prometheus and Grafana, Experience scaling observability tools such as Thanos or Mimir, Experience managing ELK stack or Loki logging platforms, Strong proficiency in Terraform and Terragrunt, Deep understanding of Kubernetes, Experience with distributed systems observability (metrics, logs, traces), Full professional proficiency in English 📃 Skills: Prometheus, Grafana, Thanos, Mimir, ELK, Loki, Terraform, Terragrunt, Kubernetes, Python, Go, GitHubActions, Puppet 🏢 Description: The Opportunity Join a high-performing, international team of six DevOps experts. This is not a "maintenance-only" role. You will have a seat at the table in designing, building, and scaling our next-generation observability and logging solutions from the ground up. We believe in "Attitude First." If you are an ambitious engineer who thrives on collaboration, knowledge sharing, and solving complex distributed systems challenges, we want to grow with you. Key Responsibilities Architect & Build: Design and implement end-to-end observability solutions, including metrics, logging, tracing, and advanced alerting. Platform Excellence: Operate and optimize high-scale monitoring platforms (Prometheus, Mimir, Grafana) and ELK stack logging infrastructure. Infrastructure as Code: Define and maintain all observability systems using Terraform and Terragrunt . Reliability Engineering: Ensure the scalability and performance of our systems while supporting incident detection and root cause analysis (RCA). Collaborate: Work across domains with a team that values mentoring, transparency, and collective problem-solving. Your Technical Core Observability Expert: Solid hands-on experience with Prometheus, Grafana, and scaling tools like Thanos or Mimir . Logging Architect: Proven experience managing enterprise-grade logging platforms (ELK stack or Loki). IaC Ninja: Strong proficiency in Terraform/Terragrunt to manage infrastructure. Cloud Native: Deep understanding of Kubernetes and the complexities of metrics/logs/traces in distributed systems. Language: Full proficiency in English for seamless global collaboration. Stand Out From The Crowd (Nice to Have) Coding: Ability to automate and integrate using Python or Go . CI/CD: Exposure to GitHub Actions and automated workflows. Configuration Management: Experience with Puppet. SRE Mindset: Understanding of Service Level Indicators (SLIs), Objectives (SLOs), and Error Budgets.

Technology

Spyrosoft

DevOps Engineer (Senior)

Senior

Remote

Krakow, Poland

110 - 200 PLN

🏢 Summary: The offer is for a Cloud Infrastructure Specialist responsible for ensuring production stability, secure cloud networking, and scalable infrastructure within AWS and Azure environments. The role focuses on hands-on infrastructure as code, observability, and CI/CD optimization while closely collaborating with a development team. It emphasizes autonomy and direct impact on production reliability rather than ticket-based support. 🗂️ Requirements: Proven experience maintaining production-grade environments, Hands-on experience with AWS services (Lambda, API Gateway, DynamoDB, RDS, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB/NLB, Cognito, IAM), Hands-on experience with Azure services, Strong practical experience with cloud networking (VPC/VNet, subnetting, routing, peering, NAT, security groups, firewalls), Hands-on experience with Datadog for monitoring, logging, alerting, In-depth commercial experience with AWS and Azure, Proficiency with Terraform or AWS CDK, Experience building and maintaining CI/CD pipelines using GitLab CI/CD, Ability to support automated deployments and infrastructure changes 📃 Skills: AWS, Azure, Terraform, CDK, TypeScript, Datadog, Prometheus, Grafana, Loki, Kubernetes, AKS, Lambda, APIGateway, DynamoDB, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB, NLB, Cognito, IAM, RDS, Redshift, BlobStorage, PostgreSQL, GitLabCI, AzureDevOps, RabbitMQ, InfluxDB 🏢 Description: You will join a substantial project as a key infrastructure specialist. You won't be managing a ticket queue; instead, you will partner directly with a mid-sized team of developers (~15 people) to ensure system stability and scalability. We are looking for someone who acts independently and is ready to ensure production reliability and secure cloud network configuration. Our Tech Stack You are not expected to know everything upfront, but this is the environment you will work with: IaC: Terraform, AWS CDK (TypeScript) Clouds: AWS & Azure Observability: Datadog, Prometheus, Grafana, Loki Core Azure: Kubernetes (AKS), Blob Storage, PostgreSQL Core AWS: Lambda, API Gateway, DynamoDB, S3, SNS, SQS, EC2, WAF, VPC CI/CD: GitLab CI, Azure DevOps Other: RabbitMQ, InfluxDB, Renovate Requirements: Production Experience: proven track record of configuring and maintaining production-grade environments. Hands-on experience with AWS (Lambda, API Gateway, DynamoDB, Redshift, RDS, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB/NLB, Cognito, IaM) Hands-on experience with Azure. Observability & monitoring: hands‑on experience with Datadog for monitoring, logging, alerting and performance analysis in production environments. Cloud Networking: strong practical experience in configuring Cloud Networks (VPC/VNet, Subnetting, Routing, Peering, NAT Gateways, Security Groups/Firewalls). Cloud Expertise: in-depth knowledge and commercial experience with Azure and AWS. Tooling: proficiency with Terraform or AWS CDK. Practical experience in building and maintaining CI/CD pipelines using GitLab CI/CD, supporting automated deployments and infrastructure changes. High autonomy and ability to communicate technical concepts to a cross-functional team. Nice to have: Experience with TypeScript , especially in AWS CDK or serverless applications. Main responsibilities: Production Stability: Maintain high availability and security of production environments. Cloud Networking: Configure and manage VPCs/VNets, subnets, routing tables, peering, and network isolation. Infrastructure as Code: Provision and manage resources using Terraform or AWS CDK. Developer Support: Optimize CI/CD pipelines and assist developers in understanding infrastructure constraints. Observability: Maintain monitoring stacks to ensure full system visibility.

Technology

Link Group

Senior AI Devops Engineer

Senior

Hybrid

Warsaw, Poland

200 - 220 PLN

🏢 Summary: Opportunity to join a large-scale DevOps transformation programme focused on migrating 90+ applications, modernising infrastructure, and standardising cloud-native engineering practices. The role centres on Kubernetes adoption, CI/CD optimisation, workflow orchestration modernisation, and AI-driven automation within a complex enterprise environment. You will collaborate across platform, infrastructure, and application teams to improve delivery efficiency and operational scalability. 🗂️ Requirements: Strong DevOps or Platform Engineering experience in large enterprise environments, Experience with Java-based application ecosystems, Hands-on Kubernetes experience in enterprise settings, Experience deploying and managing applications on Linux (Red Hat preferred), Strong CI/CD pipeline design and implementation experience, Experience with Azure DevOps, Experience supporting multiple applications or cross-team environments, Experience with workflow orchestration tools (Control-M or Airflow), Experience using AI to automate DevOps processes and workflows 📃 Skills: DevOps, Kubernetes, Java, Linux, RedHat, CI/CD, Azure, Control-M, Airflow, AI, Automation 🏢 Description: Our client, a leading international organisation operating within a highly complex technology environment, is undertaking a large-scale transformation programme focused on infrastructure modernisation, application migration and engineering efficiency. To support this initiative, a new central DevOps capability is being established to drive the migration and standardisation of a large application estate while introducing scalable engineering practices and modern delivery approaches across multiple technology teams. This is an opportunity to join a strategic programme with significant visibility and impact, working at the intersection of platform engineering, automation, cloud-native technologies and AI-enabled software delivery. Project Scope The programme includes: Migration of more than 90 applications across multiple technology domains Large-scale infrastructure and data centre transformation initiatives Adoption of enterprise Kubernetes platforms Migration to modern storage and infrastructure solutions Modernisation of workflow orchestration and scheduling platforms Standardisation of DevOps practices across engineering teams Introduction of AI-driven tooling to improve engineering productivity and delivery efficiency The successful candidate will work closely with platform, infrastructure, architecture and application teams to accelerate delivery, reduce operational complexity and support the organisation's long-term technology strategy. Key Responsibilities Support large-scale application and infrastructure migration initiatives Design, implement and optimise CI/CD pipelines and deployment processes Enable adoption of Kubernetes-based deployment models Drive automation and standardisation across engineering teams Improve software delivery processes and operational efficiency Leverage AI tools to automate DevOps workflows and engineering tasks Support workflow orchestration modernisation projects Collaborate with stakeholders across multiple technology teams Required Skills & Experience Strong DevOps or Platform Engineering experience gained within large-scale enterprise environments Solid exposure to Java-based application ecosystems Experience deploying and managing applications on Linux environments (Red Hat preferred) Hands-on experience with Kubernetes in enterprise settings Strong CI/CD experience, ideally with Azure DevOps Experience supporting multiple applications or working across various engineering teams Experience with workflow orchestration tools such as Control-M and/or Airflow Experience using AI to automate DevOps processes, workflows or engineering activities Nice to Have Experience with object storage technologies and infrastructure transformation programmes Background within financial services, banking or capital markets Exposure to trading, pricing or risk management systems Experience working with platforms such as Murex Experience leveraging AI tools to support migration initiatives, including Control-M to Airflow transitions

Technology

Yard Corporate

Site Reliability Engineer (SRE)

Senior

Hybrid

Warsaw, Poland

40,000 - 55,000 PLN

🏢 Summary: Senior Site Reliability Engineer role focused on building and standardizing SRE practices across a hybrid AWS and on-prem infrastructure. The position centers on ensuring scalability, resilience, and high availability of high-frequency, data-intensive platforms through observability, automation, and Kubernetes optimization. You will define SLOs, enhance monitoring architecture, and drive reliability culture across engineering teams. 🗂️ Requirements: 5+ years experience in SRE, DevOps, or Infrastructure Engineering supporting distributed production systems, Bachelor’s degree in Computer Science, Computer Engineering, or related field (or equivalent experience), Deep expertise in Grafana, Prometheus, Loki, and Tempo (OpenTelemetry), Strong production experience with Docker and Kubernetes, Experience managing hybrid infrastructure (AWS and on-premises), Proficiency in at least one language: Python, Go, or Bash, Hands-on experience with CI/CD pipelines and Infrastructure-as-Code, Experience defining and managing SLOs and SLAs, Willingness to participate in on-call rotation 📃 Skills: AWS, Kubernetes, Docker, Prometheus, Grafana, Loki, Tempo, OpenTelemetry, Python, Go, Bash, CI/CD, IaC, Git, Hypervisors 🏢 Description: About the Client Our client is a premier, global investment management firm operating at the intersection of finance and technology. Known for their sophisticated, data-intensive systems, they build and maintain high-performance platforms that process massive volumes of market and operational data. To support their expanding footprint, they are looking for a senior-level Site Reliability Engineer (SRE) who will take ownership of shaping, standardizing, and scaling their SRE frameworks and reliability culture from the ground up. The Role In this role, you will serve as a foundational force for SRE practices, partnering directly with Cloud, Infrastructure, and Software Engineering squads. You will work across a hybrid infrastructure (combining advanced AWS cloud environments and physical on-premises servers) to guarantee the scalability, resilience, and maximum uptime of critical, high-frequency transactional platforms. Core Responsibilities SRE Evangelism: Design, implement, and champion core reliability principles, helping technology teams adopt sustainable scaling practices. Observability Architecture: Implement, scale, and maintain end-to-end monitoring, telemetry, and distributed tracing systems utilizing Prometheus, Grafana, Loki, and Tempo (OpenTelemetry framework). Kubernetes Optimization: Establish best-practice configurations for containerized workloads, ensuring applications running on Kubernetes are highly resilient, cost-effective, and performant. Incident Management & Culture: Participate in a balanced, shared on-call rotation (averaging one week per month). Automation & Engineering: Build custom tooling and CI/CD pipelines to automate routine tasks, system health checks, and rapid disaster recovery workflows. SLO/SLA Definition: Partner with product and engineering teams to define, monitor, and enforce Service Level Objectives (SLOs) and Error Budgets. What We Look For Experience: 5+ years of hands-on experience in a dedicated SRE, DevOps, or Infrastructure Engineering role supporting complex, distributed production systems. Education: A Bachelor’s degree in Computer Science, Computer Engineering, or a related technical discipline (or equivalent practical experience). Observability Expertise: Deep, subject-matter knowledge of modern monitoring stacks, specifically Grafana, Prometheus, Loki, and Tempo (OTel). Orchestration & Containers: Strong, production-grade expertise in containerization (Docker) and orchestration (Kubernetes). Hybrid Infrastructure: Experience navigating hybrid models—managing both cloud services (AWS preferred) and physical on-premise hardware resources. Scripting/Coding: Proficiency in writing clean, maintainable code in at least one scripting or programming language (e.g., Python, Bash, or Go) to build reliable automation. Methodologies: Solid grounding in CI/CD concepts, infrastructure-as-code (IaC), and agile development processes. Soft Skills: Excellent verbal and written communication skills, with a proven ability to convey complex infrastructure and reliability concepts to both technical and non-technical stakeholders. What We Offer Stable Employment: Full-time employment contract ( Umowa o Pracę - UoP ). Tax Optimization: Eligibility for creative tax-deductible costs ( KUP - Koszty Uzyskania Przychodu). Financial Reward: Highly competitive base salary accompanied by a generous annual performance bonus . Comprehensive Health: Premium private medical care package that fully includes dental coverage (stomatologia) . Wellness & Lifestyle: MultiSport card to keep you active and healthy. Daily Perks: Pre-funded lunch card for your daily meals. Tech Stack at a Glance Cloud & Virtualization: AWS, Kubernetes, Docker, On-Premises Hypervisors Observability: Prometheus, Grafana, Loki, Tempo, OpenTelemetry (OTel) Languages: Python, Go, Bash CI/CD & Automation: Git-based pipelines, Configuration Management, IaC

Spyrosoft

Spyrosoft is a dynamic company operating in the technology industry, specifically focusing on fintech solutions. The company is engaged in a collaborative venture with Klarna, a global leader in digital payment solutions, to establish an IT hub that fosters innovation in online shopping and payment systems. Spyrosoft values independence, proactiveness, and flexibility, creating an environment that encourages forward-thinking and dynamic work. The company is based in Warsaw, Poland, and offers a hybrid work model, reflecting its commitment to modern and adaptable work practices. Spyrosoft is dedicated to facilitating a seamless recruitment process, emphasizing a strong cultural fit and technical expertise in its candidates. The company is known for its rigorous standards, particularly in the financial domain, ensuring high-quality and secure digital solutions.

Check if your resume is ATS-ready before applying →Build an ATS-optimized resume