June 8, 2026
Senior Platform Optimization & Observability Engineer
Senior • Remote
150 - 200 PLN
Wroclaw, Poland
Tech stack:
ELK stack (observability, APM, security)
VMware, Hyper‑V, KVM / Proxmox
Log Analytics (migration source)
Monitoring and alerting platforms
Requirements:
Hands‑on experience optimizing virtualization platforms
Strong storage performance and capacity optimization skills
Experience with platform security hardening
Deep operational experience with ELK stack
Experience migrating dashboards, queries, and reports from Log Analytics
Strong understanding of DR optimization and recovery metrics
Nice to have:
Compliance scanning tools (CIS)
SOC 1 / SOC 2 / C5 familiarity
Sentinel rule migration experience
Experienced in using AI tools in day-to-day workflow
Project description:
You will own platform health, optimization, and the observability stack in a complex enterprise environment. The project focuses on improving platform performance, security posture, DR effectiveness, and migrating monitoring and security capabilities to a new observability platform.
Main responsibilities:
Optimize virtualization and storage platforms
Expand observability with APM and security capabilities
Migrate monitoring and security assets from Azure tooling
Optimize logging, alerting, and retention strategies
Review and improve DR and firewall configurations
Collaborate with network and security engineers
Similar jobs you might like
Technology
Link Group
DevOps Engineer (Observability)
Senior
Hybrid
Warsaw, Poland
130 - 145 PLN
🏢 Summary: Design and scale next-generation observability and logging solutions within an international DevOps team, focusing on building high-scale monitoring platforms and cloud-native infrastructure from the ground up. The role combines architecture, infrastructure as code, and reliability engineering for distributed systems. You will drive metrics, logging, tracing, and alerting solutions in a collaborative environment. 🗂️ Requirements: Hands-on experience with Prometheus and Grafana, Experience scaling observability tools such as Thanos or Mimir, Experience managing ELK stack or Loki logging platforms, Strong proficiency in Terraform and Terragrunt, Deep understanding of Kubernetes, Experience with distributed systems observability (metrics, logs, traces), Full professional proficiency in English 📃 Skills: Prometheus, Grafana, Thanos, Mimir, ELK, Loki, Terraform, Terragrunt, Kubernetes, Python, Go, GitHubActions, Puppet 🏢 Description: The Opportunity Join a high-performing, international team of six DevOps experts. This is not a "maintenance-only" role. You will have a seat at the table in designing, building, and scaling our next-generation observability and logging solutions from the ground up. We believe in "Attitude First." If you are an ambitious engineer who thrives on collaboration, knowledge sharing, and solving complex distributed systems challenges, we want to grow with you. Key Responsibilities Architect & Build: Design and implement end-to-end observability solutions, including metrics, logging, tracing, and advanced alerting. Platform Excellence: Operate and optimize high-scale monitoring platforms (Prometheus, Mimir, Grafana) and ELK stack logging infrastructure. Infrastructure as Code: Define and maintain all observability systems using Terraform and Terragrunt . Reliability Engineering: Ensure the scalability and performance of our systems while supporting incident detection and root cause analysis (RCA). Collaborate: Work across domains with a team that values mentoring, transparency, and collective problem-solving. Your Technical Core Observability Expert: Solid hands-on experience with Prometheus, Grafana, and scaling tools like Thanos or Mimir . Logging Architect: Proven experience managing enterprise-grade logging platforms (ELK stack or Loki). IaC Ninja: Strong proficiency in Terraform/Terragrunt to manage infrastructure. Cloud Native: Deep understanding of Kubernetes and the complexities of metrics/logs/traces in distributed systems. Language: Full proficiency in English for seamless global collaboration. Stand Out From The Crowd (Nice to Have) Coding: Ability to automate and integrate using Python or Go . CI/CD: Exposure to GitHub Actions and automated workflows. Configuration Management: Experience with Puppet. SRE Mindset: Understanding of Service Level Indicators (SLIs), Objectives (SLOs), and Error Budgets.
Technology
emagine Polska
Observability Specialist
Senior
Hybrid
Warsaw, Poland
🏢 Summary: The offer is for an Observability Specialist responsible for designing, implementing, and maintaining a scalable telemetry and monitoring infrastructure in cloud-native environments. The role focuses on Kubernetes observability, Elastic Stack management, and performance optimization using modern telemetry standards. It involves driving SRE practices and ensuring high system reliability through advanced monitoring and AIOps solutions. 🗂️ Requirements: Experience monitoring Kubernetes (OpenShift) environments, Hands-on implementation of OpenTelemetry for logs, traces, and metrics, Strong expertise in ELK stack deployment and maintenance, Proficiency in automating Elastic environments using Ansible, Experience with Application Performance Monitoring for code-level analysis, Knowledge of shard optimization, mapping, and Index Lifecycle Management, Experience defining and monitoring SLOs and managing Error Budgets, Integration of observability solutions with major cloud providers 📃 Skills: Kubernetes, OpenShift, OpenTelemetry, Elasticsearch, Logstash, Kibana, Ansible, ElasticAPM, AIOps, SRE, ILM, Sharding, Mapping, Cloud 🏢 Description: Introduction & Summary We are seeking an experienced Observability Specialist dedicated to ensuring the reliability and performance of our systems. This role involves collaborating with enterprise architects and IT professionals to design, implement, and oversee a scalable telemetry infrastructure. The ideal candidate will possess deep expertise in ELK or similiar technologies and modern telemetry standards. Main Responsibilities As our Observability Engineer, your core duties will include: Architectural Collaboration: Partner with system architects and local engineering teams in Denmark to design resilient monitoring solutions. Monitor Kubernetes environments with OpenTelemetry (OTel) standards for logs, traces, and metrics. Manage centralized data collection and automate Elastic deployments using Ansible. Utilize Elastic APM for identifying code-level bottlenecks and resolving latency issues. Implement AIOps configurations for proactive anomaly detection and automated root-cause analysis. Drive Site Reliability Engineering (SRE) methodologies across teams. Elastic Stack Management: Deploy, scale, and maintain Elasticsearch, Logstash, and Kibana (ELK) environments. Key Requirements Cloud-Native Observability: Strong skills in monitoring Kubernetes (Openshift) environments and integrating with major cloud providers. APM & Distributed Tracing: Expertise in Application Performance Monitoring (APM) to identify code-level bottlenecks and latency issues. OpenTelemetry (OTel): Hands-on experience implementing OpenTelemetry (or similiar) standards for logs, traces, and metrics to ensure vendor-neutral telemetry. Infrastructure as Code (IaC): Proficiency in automating Elastic environments with Ansible. Performance Engineering: Expert-level knowledge of shard optimization, mapping, and Index Lifecycle Management (ILM) to balance high performance with cost control. SRE Methodology: Experience defining and monitoring Service Level Objectives (SLOs) and managing Error Budgets. Strong communication skills for collaboration with IT teams. NIce to Have: Elastic Stack Mastery: Deep expertise in architecting and managing Elasticsearch, Logstash, and Kibana (ELK) at scale. Data Ingestion & Fleet: Proven experience deploying Elastic Agent and Fleet for centralized agent management and data collection. AIOps & Machine Learning: Ability to configure Elastic ML models for proactive anomaly detection and automated root cause analysis. Other Details This is position based in Warsaw, flexible Hybrid model, focused on leading-edge observability solutions in a dynamic and collaborative environment.
Technology
Spyrosoft
Senior Kubernetes Platform Engineer
Senior
Remote
Wroclaw, Poland
150 - 200 PLN
🏢 Summary: Design and deliver production-grade Kubernetes platforms across Azure (AKS) and on-prem environments for critical in-house applications. The role focuses on portability, reliability, observability, and secure container lifecycle management. You will architect, deploy, and migrate Kubernetes workloads while defining networking, storage, and security standards. 🗂️ Requirements: 6–8+ years of experience with Linux and container platforms, Deep expertise in Kubernetes (managed and on-prem), Strong understanding of cloud-native architectures, Experience designing Kubernetes networking, storage, and security, Hands-on experience with PostgreSQL in cloud or containerized environments, Infrastructure-as-Code experience, Strong troubleshooting and performance tuning skills 📃 Skills: Kubernetes, AKS, Docker, OpenTofu, Terraform, Bicep, Helm, ArgoCD, Flux, PostgreSQL, Linux, Azure, GitOps 🏢 Description: Tech stack: Kubernetes (AKS & on‑prem) Docker OpenTofu / Terraform / Bicep Helm GitOps tools (ArgoCD, Flux) PostgreSQL Linux Requirements: 6–8+ years of experience with Linux and container platforms Deep expertise in Kubernetes (managed and on‑prem) Strong understanding of cloud‑native architectures Experience designing Kubernetes networking, storage, and security Hands‑on experience with PostgreSQL in cloud or containerized setups Infrastructure‑as‑Code experience Strong troubleshooting and performance tuning skills Nice to have: ELK stack experience Azure Administrator certification or experience Open‑source storage solutions (Ceph, Longhorn) Policy engines (OPA, Kyverno) Experienced in using AI tools in day-to-day workflow Project description: You will design and deliver production‑grade Kubernetes platforms across cloud and on‑prem environments hosting critical in‑house applications. The role focuses on portability, reliability, observability, and secure container lifecycle management. Main responsibilities: Design and deploy on‑prem and AKS Kubernetes clusters Enable workload portability between Azure and on‑prem Define networking, ingress, storage, and security patterns Deliver Kubernetes workload migrations Build container image lifecycle processes Collaborate closely with Platform Engineers and Infrastructure teams
Technology
emagine Polska
Site Reliability Engineer
Senior
Remote
Lisbon, Portugal
🏢 Summary: Hands-on Observability Engineer role focused on building and automating enterprise-grade monitoring and observability solutions across AWS-based cloud and distributed systems. The position centers on developing infrastructure as code, CI/CD pipelines, and monitoring ecosystems to improve reliability, performance, and incident response. Approximately 90% of the role involves coding in Python and Terraform. 🗂️ Requirements: Strong hands-on experience with AWS, Strong Python development and scripting experience, Strong experience with Terraform, Experience building and maintaining CI/CD pipelines using Jenkins, Experience with Elasticsearch and ELK Stack, Experience with Linux systems, Shell scripting skills, Understanding of monitoring, logging, and alerting concepts, Experience working in Agile or DevOps environments 📃 Skills: AWS, Python, Terraform, Jenkins, Elasticsearch, ELK, Linux, Bash, CI/CD, Kubernetes, Grafana, Prometheus, Datadog, NewRelic, Snowflake, Databricks, dbt, Matillion 🏢 Description: Role Overview We are looking for a skilled and proactive Observability Engineer to implement, automate, and support enterprise-grade observability and monitoring solutions across cloud and application platforms. The ideal candidate should have strong AWS infrastructure knowledge, hands-on automation skills, and experience building reliable monitoring and alerting ecosystems for modern distributed applications. The role involves working closely with Platform Engineering, Data Engineering, and Application teams to develop observability solutions and bring operational visibility, reliability, incident detection, and platform performance. Main Responsibilities · Design, implement, and maintain observability solutions for cloud-native and distributed systems. · Build monitoring, logging, alerting, and dashboarding solutions across infrastructure and applications. · Develop automation scripts and tooling using Python. · Implement and maintain Infrastructure as Code (IaC) using Terraform. · Build and support CI/CD pipelines using Jenkins and Git-based workflows. · Configure and optimize monitoring for AWS services, Kubernetes workloads, APIs, databases, and applications. · Create actionable alerts and operational dashboards to improve incident response and system reliability. · Work with engineering teams to onboard applications into observability platforms. · Support troubleshooting, root cause analysis, and performance optimization initiatives. · Ensure observability standards, governance, and best practices are followed across projects. Key Requirements · Strong hands-on experience with Amazon Web Services (AWS). · Solid Python development/scripting experience. · Strong experience with Terraform. · Experience building and maintaining CI/CD pipelines using Jenkins. · Elasticsearch / ELK Stack experience and building queries. · Worked with Data Platforms monitoring is preferred. · Experience with Linux systems and shell scripting. · Understanding of monitoring, logging, and alerting concepts. · Experience working in Agile/DevOps environments. Nice to Have Skills Experience with any of the following is highly desirable: · Snowflake · Databricks · dbt · Matillion · Grafana · New Relic · Datadog · Prometheus · Elasticsearch / ELK Stack experience NOTES: We are looking for an Engineer who loves to build. This is a highly technical role—90% of the job is hands-on coding in python and terraform.
Technology
Link Group
Senior Azure DevOps Engineer
Senior
Remote
Bialystok, Poland
140 - 155 PLN
🏢 Summary: Design, deploy, and maintain high-availability Azure cloud environments with a strong focus on AKS and Infrastructure as Code using Terraform. The role centers on secure, scalable, and well-monitored Azure infrastructure, including networking, identity, databases, and disaster recovery. You will drive automation and operational excellence across the Azure ecosystem. 🗂️ Requirements: Experience managing Azure Kubernetes Service (AKS) clusters, Proficiency in Infrastructure as Code using Terraform, Experience with YAML and Helm for Kubernetes deployments, Administration of Azure networking components (VNETs, NSGs), Management of Azure VMs, Storage Accounts, and ACR, Implementation of Azure AD (Entra ID) and IAM policies, Administration of Azure SQL and SQL Server environments, Configuration of monitoring with Azure Monitor, App Insights, and Log Analytics, Design and implementation of disaster recovery and backup strategies 📃 Skills: Azure, AKS, Kubernetes, Terraform, YAML, Helm, VNET, NSG, ACR, AzureAD, IAM, AzureSQL, SQLServer, AzureMonitor, AppInsights, LogAnalytics, Velero 🏢 Description: Role Overview We are looking for a highly skilled Azure Cloud & Platform Engineer to join our infrastructure team. In this role, you will be responsible for designing, deploying, and maintaining high-availability cloud environments with a heavy focus on container orchestration ( AKS ) and Infrastructure as Code ( Terraform ). You will ensure that our Azure ecosystem is secure, scalable, and monitored to the highest standards. Key Responsibilities Kubernetes Orchestration: Manage and optimize Azure Kubernetes Services (AKS) , including cluster configuration, scaling, and lifecycle management. Infrastructure as Code (IaC): Develop and maintain automated infrastructure deployments using Terraform , YAML , and Helm charts. Cloud Administration: Oversee core Azure resources including Networking (VNETs, NSGs), Storage Accounts, Azure VMs, and Container Registries (ACR). Security & Identity: Implement and manage Azure Active Directory (Azure AD/Entra ID) and Identity & Access Management (IAM) policies to ensure a "least privilege" environment. Database Management: Administer Azure SQL environments, including SQL Server, individual databases, and Elastic Pools. Observability & Monitoring: Set up and maintain robust monitoring solutions using Azure Monitor, App Insights, and Log Analytics . Disaster Recovery: Design and implement Disaster Recovery (DR) mechanisms and backup strategies (e.g., using Velero ). Technical Documentation: Create and maintain comprehensive documentation for system configurations, architecture setups, and operational procedures. Preferred Skills Experience with Velero for Kubernetes backups. Knowledge of the ELK Stack (ElasticSearch, Logstash, Kibana). Experience with Open Source monitoring tools: Prometheus, Grafana, and Loki . Familiarity with Ansible for configuration management. Exposure to Apache Kafka messaging systems. Candidate Profile The ideal candidate is a proactive engineer who prioritizes automation over manual intervention. You should be comfortable working in a fast-paced environment, taking ownership of cloud resources, and ensuring that all solutions are documented and resilient. Your approach should combine technical depth in Azure with a broader understanding of DevOps best practices.
Technology
Caspian One
Site Reliability Engineer
Senior
Hybrid
Krakow, Poland
1,400 - 1,800 PLN
🏢 Summary: Hands-on Site Reliability Engineer role focused on ensuring stability, scalability, and observability of a mission-critical distributed risk and analytics platform in hybrid cloud environments. The position centers on production reliability, incident response, automation, and continuous improvement of monitoring and deployment processes. You will collaborate with engineering teams to strengthen system resilience, performance, and operational standards. 🗂️ Requirements: Strong Java experience in distributed systems, Experience with observability and monitoring tools, Hands-on experience with hybrid cloud environments (preferably GCP), Experience with CI/CD pipelines and automation tools, Solid knowledge of Linux systems administration, Understanding of RDBMS fundamentals, Experience with job schedulers (e.g., Control-M), Ability to lead incident response and root-cause analysis 📃 Skills: Java, Grafana, Prometheus, Loki, OpenTelemetry, GCP, Jenkins, Ansible, Linux, SQL, Control-M, CI/CD 🏢 Description: We’re looking for a seasoned Site Reliability Engineer to support a high‑performance, mission‑critical risk and analytics platform used across global trading and finance environments. You’ll play a key role in ensuring the stability, scalability, and observability of complex distributed systems running across hybrid cloud infrastructure. In this role, you’ll take ownership of production reliability driving incident response, conducting root‑cause analysis, improving monitoring capabilities, and delivering automation that reduces operational toil. You’ll work closely with development teams, platform engineers, and service management leads to strengthen resilience, refine processes, and enhance the engineering culture around availability and performance. This is a hands on technical position suited to someone who thrives in high‑throughput environments, communicates clearly, and enjoys solving deep engineering problems in real time. Core Responsibilities Maintain and improve the reliability, uptime, and performance of distributed applications. Lead incident response, triage complex issues, coordinate recoveries, and deliver structured post‑incident reviews. Enhance observability—designing and evolving monitoring, alerting, logging, and tracing frameworks. Drive continuous improvement across automation, deployment processes, and service stability. Collaborate with cross‑functional teams to influence architecture, design, and operational standards. Support CI/CD pipelines, environment configuration, and vulnerability remediation. Contribute to a knowledge‑driven culture through documentation, tooling, and best‑practice adoption. Required Skills & Experience Strong Java background with proven experience supporting or developing distributed systems. Observability tooling expertise (Grafana, Prometheus, Loki, OpenTelemetry or similar). Hands‑on with hybrid cloud environments , ideally with GCP or another major cloud provider. CI/CD and automation experience (e.g., Jenkins, Ansible). Solid understanding of Linux , RDBMS fundamentals , and job schedulers (e.g., Control‑M or equivalents). Strong analytical mindset with a methodical approach to troubleshooting. Excellent communication skills and comfort working in Agile teams.
Technology
emagine Polska
Senior Virtualisierungsspezialist (m/w/d)
Senior
Hybrid
Gilching, BY, Germany
🏢 Summary: Senior Virtualisierungsspezialist zur Verwaltung, Optimierung und Weiterentwicklung hochverfügbarer virtualisierter Plattformen. Verantwortung für den Betrieb von VMware (vSphere, ESXi) und/oder Proxmox sowie für Fehleranalyse, Performance-Optimierung und Integration in bestehende Infrastruktur- und Sicherheitsarchitekturen. Enge Zusammenarbeit mit Infrastruktur- und Netzwerkteams in sicherheitskritischen Umgebungen. 🗂️ Requirements: Mehrjährige praktische Erfahrung in Virtualisierung, Fundierte Betriebserfahrung mit VMware vSphere und ESXi oder Proxmox, Erfahrung in Fehleranalyse und Performance-Optimierung, Verständnis von Hochverfügbarkeitskonzepten, Kenntnisse in Plattform-Sicherheitsarchitekturen, Sehr gute Deutschkenntnisse (C1) 📃 Skills: Virtualisierung, VMware, vSphere, ESXi, Proxmox, Hochverfügbarkeit, Sicherheitsarchitektur, Performanceanalyse, Troubleshooting 🏢 Description: Einführung & Zusammenfassung: Für die effiziente Verwaltung und Integration hochverfügbarer Plattformen suchen wir einen erfahrenen Senior Virtualisierungsspezialisten. Die ideale Kandidatin oder der ideale Kandidat bringt umfangreiche praktische Erfahrung in der Virtualisierung mit, insbesondere mit VMware und Proxmox, sowie Kenntnisse in Fehleranalyse und Performance-Optimierung. Die Rolle erfordert auch ein tiefes Verständnis für Hochverfügbarkeit und Sicherheitsarchitekturen. Hauptverantwortlichkeiten: Überwachung, Analyse und Weiterentwicklung virtualisierter Umgebungen. Betrieb und Management von VMware (vSphere, ESXi) und/oder Proxmox. Durchführung von Fehleranalysen, Performance-Optimierungen und Stabilitätsmaßnahmen. Integration virtueller Lösungen in bestehende Infrastruktur- und Sicherheitsarchitekturen. Enge Zusammenarbeit mit Infrastruktur- und Netzwerkteams. Anforderungen: Mehrjährige praktischen Erfahrung in der Virtualisierung. Fundiertes Betriebswissen (keine rein theoretischen Profile). Gutes Verständnis für Hochverfügbarkeit und Plattform-Sicherheit. Sehr gute Deutschkenntnisse (mindestens C1-Niveau). Wünschenswerte Qualifikationen: Kenntnisse in Cloud-Umgebungen. Erfahrung mit Backup- und Wiederherlösungssystemen. Vertrautheit mit Automatisierungstools. Sonstige Details Aufgrund unserer Zusammenarbeit mit Kunden im Bereich kritischer Infrastrukturen kann eine Sicherheitsüberprüfung nach Sicherheitsüberprüfungsgesetz (SÜG) erforderlich sein. Wir freuen uns auf Ihre Bewerbung!
Technology
Spyrosoft
DevOps Engineer (Senior)
Senior
Remote
Krakow, Poland
110 - 200 PLN
🏢 Summary: The offer is for a Cloud Infrastructure Specialist responsible for ensuring production stability, secure cloud networking, and scalable infrastructure within AWS and Azure environments. The role focuses on hands-on infrastructure as code, observability, and CI/CD optimization while closely collaborating with a development team. It emphasizes autonomy and direct impact on production reliability rather than ticket-based support. 🗂️ Requirements: Proven experience maintaining production-grade environments, Hands-on experience with AWS services (Lambda, API Gateway, DynamoDB, RDS, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB/NLB, Cognito, IAM), Hands-on experience with Azure services, Strong practical experience with cloud networking (VPC/VNet, subnetting, routing, peering, NAT, security groups, firewalls), Hands-on experience with Datadog for monitoring, logging, alerting, In-depth commercial experience with AWS and Azure, Proficiency with Terraform or AWS CDK, Experience building and maintaining CI/CD pipelines using GitLab CI/CD, Ability to support automated deployments and infrastructure changes 📃 Skills: AWS, Azure, Terraform, CDK, TypeScript, Datadog, Prometheus, Grafana, Loki, Kubernetes, AKS, Lambda, APIGateway, DynamoDB, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB, NLB, Cognito, IAM, RDS, Redshift, BlobStorage, PostgreSQL, GitLabCI, AzureDevOps, RabbitMQ, InfluxDB 🏢 Description: You will join a substantial project as a key infrastructure specialist. You won't be managing a ticket queue; instead, you will partner directly with a mid-sized team of developers (~15 people) to ensure system stability and scalability. We are looking for someone who acts independently and is ready to ensure production reliability and secure cloud network configuration. Our Tech Stack You are not expected to know everything upfront, but this is the environment you will work with: IaC: Terraform, AWS CDK (TypeScript) Clouds: AWS & Azure Observability: Datadog, Prometheus, Grafana, Loki Core Azure: Kubernetes (AKS), Blob Storage, PostgreSQL Core AWS: Lambda, API Gateway, DynamoDB, S3, SNS, SQS, EC2, WAF, VPC CI/CD: GitLab CI, Azure DevOps Other: RabbitMQ, InfluxDB, Renovate Requirements: Production Experience: proven track record of configuring and maintaining production-grade environments. Hands-on experience with AWS (Lambda, API Gateway, DynamoDB, Redshift, RDS, S3, SNS, SQS, EC2, ECS, WAF, VPC, Route53, ALB/NLB, Cognito, IaM) Hands-on experience with Azure. Observability & monitoring: hands‑on experience with Datadog for monitoring, logging, alerting and performance analysis in production environments. Cloud Networking: strong practical experience in configuring Cloud Networks (VPC/VNet, Subnetting, Routing, Peering, NAT Gateways, Security Groups/Firewalls). Cloud Expertise: in-depth knowledge and commercial experience with Azure and AWS. Tooling: proficiency with Terraform or AWS CDK. Practical experience in building and maintaining CI/CD pipelines using GitLab CI/CD, supporting automated deployments and infrastructure changes. High autonomy and ability to communicate technical concepts to a cross-functional team. Nice to have: Experience with TypeScript , especially in AWS CDK or serverless applications. Main responsibilities: Production Stability: Maintain high availability and security of production environments. Cloud Networking: Configure and manage VPCs/VNets, subnets, routing tables, peering, and network isolation. Infrastructure as Code: Provision and manage resources using Terraform or AWS CDK. Developer Support: Optimize CI/CD pipelines and assist developers in understanding infrastructure constraints. Observability: Maintain monitoring stacks to ensure full system visibility.
Technology
Link Group
Senior Devops Engineer
Senior
Hybrid
Warsaw, Poland
28,000 - 38,000 PLN
🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms that support large-scale data processing systems. The position combines hands-on engineering and strategic impact to ensure scalable, secure, and reliable production environments. You will design, automate, and optimize platform services enabling efficient delivery of data-driven applications. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting distributed production systems, Hands-on experience with public cloud platforms, Strong knowledge of containerization and orchestration, Experience with infrastructure as code, Strong scripting or programming skills, Experience building and maintaining CI/CD pipelines, Knowledge of observability practices and tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, CI/CD, Linux, Monitoring, Logging, Alerting 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards
Technology
Link Group
Senior Devops Engineer
Senior
Hybrid
Warsaw, Poland
28,000 - 38,000 PLN
🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms supporting large-scale data processing systems. The position combines hands-on engineering with strategic platform development to ensure scalable, secure, and reliable production environments. You will design, automate, and maintain infrastructure and observability solutions across distributed systems. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting production systems in distributed environments, Hands-on experience with public cloud platforms (AWS or similar), Strong knowledge of Docker and Kubernetes, Experience with infrastructure as code tools (Terraform), Strong scripting/programming skills (Python or Bash), Experience building and maintaining CI/CD pipelines, Knowledge of observability, monitoring, and logging tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, Linux, CICD, Observability, Automation, Infrastructure, Cloud 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards