June 8, 2026
DevOps Engineer
Mid • Remote
Łódź, Poland
About the role
We are looking for a DevOps Engineer to help build and operate automation, deployment, and reliability standards for large-scale GPU infrastructure used for AI training and inference workloads.
In this role, you will work on software-defined infrastructure supporting GPU clusters, high-performance networking, storage platforms, and internal AI services. This is a hands-on position for someone who is comfortable working close to infrastructure, improving operational processes, and building reliable automation in a complex technical environment.
Responsibilities
Design, implement, and maintain Infrastructure as Code solutions for provisioning and managing bare-metal GPU servers, networking, storage, and cluster orchestration components
Build and improve CI/CD pipelines for infrastructure, platform services, and internal tooling
Develop and maintain monitoring, logging, alerting, and observability solutions for large-scale GPU environments
Support reliability initiatives by defining and tracking SLIs/SLOs, automating incident response, and contributing to post-incident analysis
Automate operational tasks such as cluster scaling, firmware and BIOS updates, hardware validation, diagnostics, and capacity planning
Work closely with Infrastructure, Networking, Facilities, and AI/ML teams to ensure stable and scalable platform operations
Support DevSecOps practices, including infrastructure hardening, vulnerability management, and compliance automation
Identify repetitive manual work and replace it with efficient automation
Evaluate new tools and solutions related to GPU infrastructure, orchestration, and cloud-native operations
Requirements
4–7 years of experience in DevOps, SRE, Platform Engineering, or a similar role
Strong practical experience with infrastructure automation in complex production environments
Good hands-on knowledge of Terraform, Ansible, or similar Infrastructure as Code tools
Experience building and maintaining CI/CD pipelines and working with GitOps practices
Good understanding of infrastructure security, vulnerability management, and security best practices
Experience with security tools such as Snyk, CrowdStrike, or similar solutions
Practical experience with Kubernetes
Experience working with GPU-related technologies such as NVIDIA GPU Operator, device plugins, MIG, or time-slicing
Good scripting or programming skills in Python, Go, or Bash
Experience with bare-metal provisioning, low-level infrastructure automation, or data center operations
Good knowledge of observability tools such as Prometheus, Grafana, Loki, and OpenTelemetry
Ability to work independently, prioritize tasks, and communicate effectively with technical teams
English proficiency at least at a communicative level is required, as you will be working in an international team
Nice to have
Experience in AI infrastructure, HPC environments, hyperscale infrastructure, or data center operations
Familiarity with orchestration and scheduling tools such as Slurm, Ray, Run:ai, KServe, or Kubernetes-based schedulers
Experience integrating telemetry from power, cooling, or environmental systems
Experience building internal platforms or self-service tools for engineering teams
Understanding of compliance and audit requirements in security-sensitive environments
What we offer
Benefits package
Opportunity to work on advanced infrastructure supporting large-scale AI workloads
Real impact on the reliability and scalability of next-generation compute environments
Collaboration with experienced engineers across infrastructure, platform, and AI domains
A fast-moving environment with space for ownership, technical input, and professional growth
Similar jobs you might like
Technology
ALTER GPU CENTER
Lead DevOps Engineer
Senior
Remote
Łódź, Poland
🏢 Summary: Technical leadership role combining hands-on DevOps/SRE engineering with team management to build and operate large-scale GPU infrastructure for AI workloads. Focused on infrastructure automation, reliability, observability, and high-performance networking across complex production environments. Responsible for shaping IaC standards, CI/CD, and operational excellence for software-defined, GPU-based platforms. 🗂️ Requirements: 8+ years in DevOps, SRE, or Platform Engineering, 3+ years in technical leadership role, Experience with large-scale infrastructure automation, Proficiency in Infrastructure as Code tools, Experience with GitOps and CI/CD, Hands-on experience with Kubernetes, Experience with GPU technologies, Scripting or programming in Python, Go, or Bash, Experience with bare-metal provisioning, Knowledge of observability and monitoring tools, Understanding of distributed systems reliability, Experience with high-performance networking technologies, Ability to lead technical discussions and mentor engineers, English proficiency at communicative level 📃 Skills: Terraform, Ansible, Pulumi, Crossplane, GitOps, Kubernetes, NVIDIA, MIG, Python, Go, Bash, Prometheus, Grafana, Loki, OpenTelemetry, RDMA, InfiniBand, RoCE, CI/CD 🏢 Description: About the role We are looking for a Lead DevOps Engineer to provide technical leadership for DevOps and Site Reliability Engineering practices supporting large-scale GPU infrastructure used for AI training and inference workloads. This role combines hands-on engineering with team leadership. You will be responsible for shaping automation standards, improving platform reliability, and leading a team working on software-defined infrastructure, high-performance networking, observability, and operational excellence across complex production environments. Responsibilities Lead, mentor, and support a team of DevOps and SRE engineers working across the full lifecycle of GPU infrastructure platforms Design and implement Infrastructure as Code solutions for provisioning and managing bare-metal GPU servers, networking, storage, and cluster orchestration components Build and improve CI/CD pipelines for infrastructure, platform services, and internal tooling Develop and maintain monitoring, logging, alerting, and observability solutions for large-scale GPU environments Define and track SLIs/SLOs , improve incident response processes, and contribute to post-incident reviews and long-term reliability improvements Work closely with Infrastructure, Networking, Facilities, and AI/ML teams to ensure stable and scalable platform operations Automate operational processes such as cluster scaling, firmware and BIOS updates, hardware diagnostics, and capacity planning Support DevSecOps practices, including infrastructure hardening, vulnerability management, and compliance automation Identify operational inefficiencies and reduce repetitive manual work through automation Evaluate and introduce new tools and solutions related to GPU infrastructure, orchestration, and cloud-native operations Requirements 8+ years of experience in DevOps, SRE, Platform Engineering , or a similar area At least 3 years of experience in a technical lead, lead engineer, or team leadership role Strong practical experience with infrastructure automation in large-scale or complex production environments Very good knowledge of Terraform, Ansible, Pulumi, Crossplane , or similar Infrastructure as Code tools Experience with GitOps , configuration management, and CI/CD practices Hands-on experience with Kubernetes Experience working with GPU-related technologies such as NVIDIA GPU Operator, device plugins, MIG, or time-slicing Good scripting or programming skills in Python, Go, or Bash Experience with bare-metal provisioning, infrastructure automation, or data center environments Good knowledge of observability tools such as Prometheus, Grafana, Loki, and OpenTelemetry Good understanding of distributed systems reliability and production incident management Experience with high-performance networking technologies such as RDMA, InfiniBand, or RoCE will be a strong advantage Ability to lead technical discussions, support team development, and communicate effectively with both technical and business stakeholders English proficiency at least at a communicative level is required, as you will be working in an international team Nice to have Experience in AI infrastructure, HPC environments, hyperscale infrastructure, or data center operations Familiarity with orchestration and scheduling tools such as Slurm, Ray, Run:ai, KServe , or Kubernetes-based schedulers Experience integrating telemetry from power, cooling, or environmental systems Experience building internal platforms or self-service tools for engineering or research teams Understanding of security, compliance, and audit requirements in regulated or security-sensitive environments What we offer Benefits package Opportunity to shape the DevOps and SRE foundation for advanced GPU infrastructure supporting AI workloads Real impact on the scalability, reliability, and operational standards of next-generation compute environments Collaboration with experienced engineers across infrastructure, platform, and AI domains A dynamic environment with space for ownership, technical leadership, and professional growth
Technology
ALTER GPU CENTER
Junior DevOps Engineer
Junior
Remote
Łódź, Poland
🏢 Summary: Junior DevOps Engineer role focused on supporting and automating cloud infrastructure for AI training and inference workloads. The position involves working with CI/CD pipelines, containerization, Infrastructure as Code, and monitoring systems in a hands-on learning environment. It offers growth in DevOps, platform engineering, and cloud operations. 🗂️ Requirements: 0–2 years in DevOps, IT operations, infrastructure, or system administration, Knowledge of Linux and terminal usage, Knowledge of Python for automation and scripting, Ability to write automation scripts, Practical knowledge of AWS and basic cloud services, Understanding of cloud infrastructure concepts, Knowledge of Docker and containerization, Basic understanding of CI/CD concepts, Ability to use Git, Basic understanding of networking, security, logging, and monitoring, Interest in Infrastructure as Code tools (Terraform or Ansible), Basic understanding of Kubernetes, Communicative English 📃 Skills: Linux, Python, AWS, Docker, CI/CD, Git, Terraform, Ansible, Kubernetes, Bash, Prometheus, Grafana, Loki, OpenTelemetry, GitHub, GitLab, Jenkins 🏢 Description: About the role We are looking for a Junior DevOps Engineer to support the development, automation, and maintenance of infrastructure used for AI training and inference workloads. In this role, you will work with experienced engineers on cloud and infrastructure automation, CI/CD pipelines, application environments, monitoring, and operational support. This is a hands-on position for someone who wants to grow in DevOps, platform engineering, cloud infrastructure, and modern operations. Responsibilities Support the maintenance and development of cloud and infrastructure environments Help prepare, maintain, and troubleshoot application environments Automate repetitive tasks using Python, Bash, and scripts Support the creation and maintenance of CI/CD pipelines Assist with Infrastructure as Code solutions for servers, networking, storage, and cluster components Monitor systems, analyze logs, and help troubleshoot technical issues Work with development, infrastructure, networking, and AI/ML teams on application deployment and platform operations Support the stability, security, and reliability of infrastructure Help identify manual processes that can be automated Document technical solutions, runbooks, and operational processes Requirements 0–2 years of experience in DevOps, IT operations, infrastructure, system administration, or a similar area Good knowledge of Linux and working in the terminal Good knowledge of Python for automation, scripts, and simple internal tools Ability to write clear scripts automating repetitive tasks Practical knowledge of AWS and basic cloud services Understanding of cloud environments and basic cloud infrastructure concepts Good knowledge of Docker and application containerization Basic understanding of CI/CD concepts Ability to work with Git Basic understanding of networking, security, logs, and monitoring Interest in Infrastructure as Code tools such as Terraform or Ansible Basic understanding of Kubernetes or willingness to learn Strong problem-solving attitude and eagerness to learn Ability to communicate clearly and work in a technical team Communicative English, as you will work in an international environment Nice to have First experience with Kubernetes, Terraform, Ansible, or GitOps Familiarity with monitoring tools such as Prometheus, Grafana, Loki, or OpenTelemetry Basic understanding of DevSecOps practices and vulnerability management Familiarity with AI infrastructure, GPU environments, HPC, or data center operations Experience with GitHub Actions, GitLab CI, Jenkins, or similar tools Interest in platform engineering, SRE, or large-scale infrastructure What we offer Benefits package Opportunity to learn from experienced infrastructure, platform, cloud, and AI engineers Work on modern infrastructure supporting AI workloads Space for professional growth in DevOps and platform engineering Remote or hybrid work from Poland
Technology
Link Group
DevOps Cloud engineer
Senior
Remote
Krakow, Poland
120 - 150 PLN
🏢 Summary: The role involves designing, implementing, and maintaining scalable cloud infrastructure and CI/CD pipelines, with a strong focus on Google Cloud Platform and Azure. The DevOps Engineer will automate deployments, manage infrastructure as code, and ensure reliability, security, and performance of cloud environments. The position requires close collaboration with development teams to improve deployment processes and platform stability. 🗂️ Requirements: Proven experience as DevOps Engineer or similar cloud role, Expert-level experience with Google Cloud Platform, Hands-on experience with Microsoft Azure, Experience with CI/CD tools (Azure DevOps, GitHub, GitHub Actions), Experience with Infrastructure as Code using Terraform, Scripting and automation skills using Python, Knowledge of cloud architecture and deployment best practices, Strong troubleshooting skills, English proficiency 📃 Skills: GCP, Azure, Terraform, Python, AzureDevOps, GitHub, GitHubActions, CI/CD, IaC 🏢 Description: About the Role We are looking for a skilled DevOps Engineer to join our technology team and help design, implement, and maintain scalable cloud infrastructure and modern CI/CD pipelines. In this role, you will work closely with development and platform teams to automate processes, improve deployment efficiency, and ensure the reliability and security of cloud-based systems. The ideal candidate has strong experience with cloud platforms, infrastructure as code, and modern DevOps practices, with particular expertise in Google Cloud Platform and CI/CD tooling. Key Responsibilities Design, implement, and maintain CI/CD pipelines using Azure DevOps , GitHub , and GitHub Actions . Build and manage scalable cloud infrastructure on Google Cloud Platform and Microsoft Azure . Develop and maintain infrastructure using Infrastructure as Code practices with Terraform . Automate operational and deployment processes using Python . Monitor and optimize cloud environments for performance, reliability, and cost efficiency. Collaborate with development teams to improve application deployment processes and platform reliability. Implement best practices in security, access control, and cloud governance. Troubleshoot infrastructure and deployment issues across environments. Required Skills & Experience Proven experience as a DevOps Engineer or in a similar cloud/infrastructure role. Strong hands-on experience with Google Cloud Platform at a principal or expert level . Solid experience with Microsoft Azure . Hands-on experience with CI/CD tools such as Azure DevOps , GitHub , and GitHub Actions . Strong knowledge of Infrastructure as Code tools, particularly Terraform . Good scripting and automation skills using Python . Experience with cloud architecture, automation, and deployment best practices. Strong troubleshooting and problem-solving skills. Ability to work collaboratively in cross-functional teams. Good communication skills and proficiency in English.
Technology
Upvanta sp. z o.o.
DevOps Engineer with AI Integration Skills
Senior
Remote
Wroclaw, Poland
120 - 150 PLN
🏢 Summary: DevOps Engineer role focused on designing and maintaining AI-driven CI/CD pipelines and deploying machine learning models across cloud and hybrid environments. The position involves integrating AI/ML tools into DevOps processes, optimizing infrastructure for AI workloads, and ensuring security and high availability. The engineer will work with modern cloud platforms and data technologies in international, agile projects. 🗂️ Requirements: Bachelor’s degree in Engineering, IT, Science or related field, Minimum 5 years of experience in DevOps or similar role, Hands-on experience with DevOps tools in AI/ML workflows, Experience with cloud platforms and AI/ML services, Experience deploying and scaling ML models in production, Proficiency in scripting or programming languages, Experience implementing automated testing for AI workflows, Knowledge of secure environments for AI-driven systems 📃 Skills: Jenkins, GitHubActions, Kubernetes, Docker, Terraform, AWS, Azure, GCP, MLflow, Kubeflow, Airflow, Python, Bash, YAML, TensorFlow, PyTorch, Scikit-learn, Grafana, AIOps 🏢 Description: About the role As a DevOps Engineer with AI integration skills, you will collaborate across multiple implementation streams and work closely with agile teams to achieve project and client goals. We are looking for passionate engineers ready to move our projects to the next level by leveraging modern technology stacks and AI-driven solutions. You will participate in international projects based on the latest data technologies and cloud platforms. Your key responsibilities Design and maintain CI/CD pipelines incorporating AI/ML tools and frameworks. Collaborate with AI teams to deploy and scale machine learning models in production environments. Integrate AI-based monitoring, alerting, and analytics tools to improve pipeline visibility and enable predictive analysis. Optimize infrastructure resource utilization for AI applications across cloud, on-premises, and hybrid environments. Ensure strong security practices within AI-driven pipeline ecosystems. Troubleshoot pipeline and infrastructure issues, especially those involving AI components, and ensure high availability. Monitor system performance and implement improvements using AI-enhanced tools. Skills and attributes for success Hands-on experience with DevOps tools such as Jenkins, GitHub Actions, Kubernetes, Docker, and Terraform. Strong understanding of cloud platforms (AWS, Azure, Google Cloud) and their AI/ML services. Familiarity with AI workflow tools such as MLflow, Kubeflow, or Airflow. Proficiency in scripting and programming languages (Python, Bash, YAML, etc.). Ability to implement automated testing frameworks for validating AI models and workflows. Experience building secure environments for AI-driven systems. Experience integrating AI-powered monitoring tools (AIOps, Grafana with ML plugins, or custom AI diagnostics). Requirements Bachelor’s degree in Engineering, IT, Science, or a related technical field. At least 5 years of experience in a corporate IT environment in a similar role. Proven experience with DevOps tools and frameworks used in AI/ML workflows. Strong communication skills and ability to collaborate with global teams. Nice to have Certifications in DevOps or AI/ML (e.g., Kubernetes, AWS Machine Learning Specialty). Experience working with AIOps platforms. Knowledge of specialized data pipeline automation tools for machine learning. Hands-on experience with AI/ML frameworks such as TensorFlow, PyTorch, or Scikit-learn in production environments.
Technology
Link Group
Senior Devops Engineer
Senior
Hybrid
Warsaw, Poland
28,000 - 38,000 PLN
🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms that support large-scale data processing systems. The position combines hands-on engineering and strategic impact to ensure scalable, secure, and reliable production environments. You will design, automate, and optimize platform services enabling efficient delivery of data-driven applications. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting distributed production systems, Hands-on experience with public cloud platforms, Strong knowledge of containerization and orchestration, Experience with infrastructure as code, Strong scripting or programming skills, Experience building and maintaining CI/CD pipelines, Knowledge of observability practices and tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, CI/CD, Linux, Monitoring, Logging, Alerting 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards
Technology
Link Group
Senior Devops Engineer
Senior
Hybrid
Warsaw, Poland
28,000 - 38,000 PLN
🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms supporting large-scale data processing systems. The position combines hands-on engineering with strategic platform development to ensure scalable, secure, and reliable production environments. You will design, automate, and maintain infrastructure and observability solutions across distributed systems. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting production systems in distributed environments, Hands-on experience with public cloud platforms (AWS or similar), Strong knowledge of Docker and Kubernetes, Experience with infrastructure as code tools (Terraform), Strong scripting/programming skills (Python or Bash), Experience building and maintaining CI/CD pipelines, Knowledge of observability, monitoring, and logging tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, Linux, CICD, Observability, Automation, Infrastructure, Cloud 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards
Technology
Sii
DevOps Engineer with GCP (f/m/x)
Mid
Hybrid
Krakow, Poland
13,000 - 23,000 PLN
🏢 Summary: The offer is for an experienced DevOps Engineer to design, automate, and maintain cloud infrastructure in a regulated banking environment using Google Cloud Platform. The role focuses on managing GCP resources, Kubernetes clusters, Infrastructure-as-Code, and CI/CD pipelines to ensure secure and stable mission-critical systems. It requires hands-on cloud engineering and automation expertise within a distributed team. 🗂️ Requirements: Minimum 3 years of experience managing virtual machines in GCP (Compute Engine), Hands-on experience with Kubernetes (GKE), Experience creating and maintaining Infrastructure-as-Code with Terraform, Experience building and maintaining CI/CD pipelines, Practical experience with Python scripting, Understanding of software development best practices, Fluency in Polish and English, Willingness to work hybrid model in Cracow, Residence in Poland 📃 Skills: GCP, ComputeEngine, Kubernetes, GKE, Terraform, CI/CD, Python, CloudLogging, CloudMonitoring 🏢 Description: Join our team and contribute to the design, development, and ongoing maintenance of infrastructure within regulated banking environments. We are seeking an experienced DevOps Engineer with hands‑on expertise in Google Cloud Platform, who will play a key role in ensuring the stability, security, compliance, and automation of mission‑critical systems. Your tasks Managing virtual machine environments and containerized applications in Google Kubernetes Engine (GKE) Monitoring and troubleshooting GCP environments using Cloud Logging, Cloud Monitoring, and Alerting Policies Creating and maintaining Infrastructure‑as‑Code (IaC) using Terraform Designing, building, and maintaining CI/CD pipelines to automate software delivery Developing and maintaining automation scripts using Python to improve operational efficiency Ensuring compliance with software development best practices, security standards, and organizational policies Analyzing and fulfilling requests from internal customers Collaborating within a globally distributed team in a dynamic environment Requirements At least 3 years of experience managing virtual machine environments in GCP (Compute Engine) Hands-on experience with Kubernetes (GKE) Expertise in creating and maintaining Infrastructure-as-Code with Terraform Strong understanding of CI/CD concepts and experience in building automated workflows Practical knowledge of Python for scripting and automation tasks Awareness of the need to follow best practices in software development Ability to work in a global, distributed team Fluency in both Polish and English (spoken and written) Willingness to work in a hybrid model – 3 day per week in the Cracow office Residing in Poland required What we offer Great Place to Work since 2015 - it’s thanks to feedback from our workers that we get this special title and constantly implement new ideas Employment stability - revenue of PLN 2.1BN, no debts, since 2006 on the market We share the profit with Workers - over PLN 76M has already been allocated for this aim since 2022 Attractive benefits package - private healthcare, benefits cafeteria platform, car discounts and more Comfortable workplace – class A offices or remote work Dozens of fascinating projects for prestigious brands from all over the world PLN 1 000 000 per year for your ideas - with this amount, we support the passions and voluntary actions of our workers Investment in your growth – meetups, webinars, training platform and technology blog – you choose Fantastic atmosphere created by all Sii Power People If you want to work on systems with high operational significance — apply now!
Technology
N-iX
Middle DevOps Engineer (#5068)
Mid
Remote
Krakow, Poland
5,000 - 5,500 USD
🏢 Summary: DevOps Engineer role focused on building, scaling, and securing cloud infrastructure while enabling efficient CI/CD workflows. The position involves managing Kubernetes-based environments on AWS, optimizing automation, and ensuring high availability and performance of systems. The role also includes infrastructure as code, monitoring, database management, and secure authentication integration. 🗂️ Requirements: BA/BS in technical field or equivalent experience, 5+ years in DevOps, SRE, or Infrastructure Engineering, Strong experience with Kubernetes, Deep knowledge of AWS core services, Experience with containerization technologies, Strong understanding of networking concepts, Proficiency with infrastructure-as-code tools, Experience with monitoring tools, Strong scripting or programming skills, Solid understanding of system security best practices 📃 Skills: Kubernetes, AWS, EC2, S3, IAM, RDS, Docker, Helm, Terraform, Pulumi, CloudFormation, Prometheus, Grafana, CloudWatch, Python, Bash, Go, PostgreSQL, SAML, OAuth2, OIDC, ELK, Loki, FluentBit, GitHubActions, Jenkins, CircleCI, Ansible, EKS, GKE 🏢 Description: We are looking for a skilled and driven DevOps Engineer to join our growing team. In this role, you will take ownership of building, maintaining, and scaling the infrastructure that powers our platform. You will ensure our systems are secure, performant, and highly available, while enabling seamless development and deployment workflows. Responsibilities: Design, implement, and manage scalable infrastructure using Kubernetes and AWS. Optimize CI/CD pipelines to improve build and deployment times and reduce friction. Monitor and troubleshoot infrastructure performance and availability. Manage and maintain relational databases, primarily PostgreSQL. Implement and support secure authentication systems using SSO protocols (e.g., SAML, OIDC, OAuth2). Enhance infrastructure as code using tools like Terraform and Ansible. Ensure security best practices are applied across all infrastructure components. Collaborate cross-functionally with development, QA, and product teams. Drive automation of operational tasks to increase team efficiency and reduce manual toil. Required Skills: BA/BS in a technical or engineering discipline or equivalent experience 5+ years of experience in a DevOps, SRE, or Infrastructure Engineering role. Strong experience with Kubernetes (EKS, GKE, or self-managed). Deep knowledge of AWS core services (EC2, S3, IAM, RDS, etc.). Knowledge of containerization technologies (e.g., Docker, Kubernetes, Helm) Solid understanding of networking concepts (VPCs, subnets, routing, firewalls, DNS). Proficiency with infrastructure-as-code tools (Terraform, Pulumi, or CloudFormation). Comfortable with monitoring tools (Prometheus, Grafana, CloudWatch, etc.). Strong scripting or programming ability (Python, Bash, or Go). Solid understanding of system security and best practices. Preferred Skills: Familiarity with SSO protocols such as SAML, OAuth2, and OpenID Connect. Experience managing and tuning PostgreSQL in production environments. Exposure to log aggregation tools (ELK, Loki, or Fluent Bit). Experience with CI/CD tools like GitHub Actions, Jenkins, or CircleCI. We offer*: Flexible working format - remote, office-based or flexible A competitive salary and good compensation package Personalized career growth Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more) Active tech communities with regular knowledge sharing Education reimbursement Memorable anniversary presents Corporate events and team buildings Other location-specific benefits
Technology
xBerry Sp. z o.o.
DevOps Engineer
Senior
Remote
Wrocław, Poland
20,000 - 28,000 PLN/mo
🏢 Summary: DevOps Engineer role focused on maintaining and enhancing a complex, on-premise automation platform deployed globally on Linux and Kubernetes. The position involves advanced troubleshooting, incident response, and development of automation, monitoring, and self-healing mechanisms to reduce on-site interventions. Includes international travel and participation in an on-call rotation to ensure high system reliability. 🗂️ Requirements: Strong Linux (Ubuntu) administration and troubleshooting experience, Hands-on Kubernetes cluster management and troubleshooting, Practical Docker experience, Solid networking knowledge and network diagnostics skills, Experience with NFS and storage troubleshooting, Operational knowledge of GPU and CUDA environments, Experience with RabbitMQ, Experience with PostgreSQL, Ability to handle production incidents and system upgrades, Willingness to participate in on-call rotation, Readiness for international travel and on-site work 📃 Skills: Linux, Ubuntu, Kubernetes, Docker, Networking, NFS, CUDA, GPU, RabbitMQ, PostgreSQL 🏢 Description: Position Overview Important: Travel & On-Call Requirements This role requires readiness for long-distance international travel to customer sites . The systems are deployed globally and, when issues cannot be resolved remotely, on-site interventions may be necessary , including deployments, upgrades, and complex troubleshooting activities. Additionally, the position includes participation in a rotational on-call / standby schedule , ensuring operational continuity and the ability to respond to critical incidents outside of standard working hours. We are looking for an experienced DevOps Engineer to join a team responsible for the maintenance and further development of a complex automation system deployed on-premise at customer sites . The system is based on Linux (Ubuntu) and a containerized Kubernetes architecture . The platform consists of multiple cooperating application and infrastructure components, including: backend services GPU-based computing components (CUDA) communication layer storage networking components The environment is characterized by high operational complexity and strong dependencies between system layers (OS, Kubernetes, applications, networking, storage). Systems are deployed across multiple locations worldwide and often operate in environments with limited local IT support, which requires high reliability and well-defined operational procedures. Responsibilities Incident Handling and System Maintenance Diagnosing and resolving issues related to: Kubernetes clusters containers (Docker) Linux (Ubuntu) operating system networking storage (including NFS) Analyzing logs and service health across application and infrastructure layers Restoring full system functionality in production environments Performing system deployments and upgrades at customer sites Participating in on-site interventions when issues cannot be resolved remotely Automation, Observability, and System Resilience Designing and developing automated troubleshooting mechanisms Early detection of infrastructure and application-level issues Automated validation of the health of key system components: OS Kubernetes containers storage networking Building health checks and observability solutions (metrics, alerts, dashboards) Creating and maintaining: runbooks standard recovery procedures automated self-healing mechanisms Documenting common incidents, root causes, and resolution methods Technical Requirements Strong experience with Linux (Ubuntu) system administration and troubleshooting Hands-on experience with Kubernetes, including cluster troubleshooting and container analysis Practical knowledge of Docker Solid understanding of networking and diagnosing network-related issues Experience with NFS / storage troubleshooting Operational knowledge of GPU / CUDA environments (compatibility, stability) Experience working with: RabbitMQ PostgreSQL Additional Requirements Willingness to participate in an on-call / standby rotation Readiness for business travel, including on-site customer visits Ability to work independently in complex, distributed environments Strong analytical and problem-solving skills We offer Flexible working hours Remote work options Medical care program MultiSport Integration events A contract of employment or self-employment, depending on You
Technology
xBerry Sp. z o.o.
DevOps Engineer
Senior
Remote
Wroclaw, Poland
20,000 - 28,000 PLN/mo
🏢 Summary: DevOps Engineer role focused on maintaining and enhancing a complex, on-premise Kubernetes-based automation platform deployed globally. The position involves advanced troubleshooting across Linux, containers, networking, storage, and GPU layers, as well as building automation and observability to reduce on-site interventions. Includes international travel and participation in an on-call rotation to support production systems. 🗂️ Requirements: Strong Linux (Ubuntu) administration and troubleshooting experience, Hands-on Kubernetes cluster management and troubleshooting, Practical Docker experience, Solid networking diagnostics skills, Experience with NFS and storage troubleshooting, Operational knowledge of GPU/CUDA environments, Experience with RabbitMQ, Experience with PostgreSQL, Ability to handle production incidents across infrastructure and application layers, Willingness to participate in on-call rotation, Readiness for international travel and on-site support 📃 Skills: Linux, Ubuntu, Kubernetes, Docker, Networking, NFS, CUDA, GPU, RabbitMQ, PostgreSQL 🏢 Description: Position Overview Important: Travel & On-Call Requirements This role requires readiness for long-distance international travel to customer sites . The systems are deployed globally and, when issues cannot be resolved remotely, on-site interventions may be necessary , including deployments, upgrades, and complex troubleshooting activities. Additionally, the position includes participation in a rotational on-call / standby schedule , ensuring operational continuity and the ability to respond to critical incidents outside of standard working hours. We are looking for an experienced DevOps Engineer to join a team responsible for the maintenance and further development of a complex automation system deployed on-premise at customer sites . The system is based on Linux (Ubuntu) and a containerized Kubernetes architecture . The platform consists of multiple cooperating application and infrastructure components, including: backend services GPU-based computing components (CUDA) communication layer storage networking components The environment is characterized by high operational complexity and strong dependencies between system layers (OS, Kubernetes, applications, networking, storage). Systems are deployed across multiple locations worldwide and often operate in environments with limited local IT support, which requires high reliability and well-defined operational procedures. The DevOps role goes beyond reactive incident handling. A key objective of the project is to systematically reduce the need for on-site interventions by developing automated monitoring, diagnostics, and recovery mechanisms. Responsibilities Incident Handling and System Maintenance Diagnosing and resolving issues related to: Kubernetes clusters containers (Docker) Linux (Ubuntu) operating system networking storage (including NFS) Analyzing logs and service health across application and infrastructure layers Restoring full system functionality in production environments Performing system deployments and upgrades at customer sites Participating in on-site interventions when issues cannot be resolved remotely Automation, Observability, and System Resilience Designing and developing automated troubleshooting mechanisms Early detection of infrastructure and application-level issues Automated validation of the health of key system components: OS Kubernetes containers storage networking Building health checks and observability solutions (metrics, alerts, dashboards) Creating and maintaining: runbooks standard recovery procedures automated self-healing mechanisms Documenting common incidents, root causes, and resolution methods Collaboration and Architecture Improvement Close cooperation with development and architecture teams Contributing to architecture simplification and standardization Improving overall system stability and reliability Supporting long-term efforts to reduce operational overhead and manual interventions Technical Requirements Strong experience with Linux (Ubuntu) system administration and troubleshooting Hands-on experience with Kubernetes, including cluster troubleshooting and container analysis Practical knowledge of Docker Solid understanding of networking and diagnosing network-related issues Experience with NFS / storage troubleshooting Operational knowledge of GPU / CUDA environments (compatibility, stability) Experience working with: RabbitMQ PostgreSQL Additional Requirements Willingness to participate in an on-call / standby rotation Readiness for business travel, including on-site customer visits Ability to work independently in complex, distributed environments Strong analytical and problem-solving skills We offer Salary: 20–28k PLN B2B base + action fee Flexible working hours Remote work options Medical care program MultiSport Integration events A contract of employment or self-employment, depending on You