June 8, 2026

Lead DevOps Engineer

Senior • Remote

Łódź, Poland

About the role

We are looking for a Lead DevOps Engineer to provide technical leadership for DevOps and Site Reliability Engineering practices supporting large-scale GPU infrastructure used for AI training and inference workloads.

This role combines hands-on engineering with team leadership. You will be responsible for shaping automation standards, improving platform reliability, and leading a team working on software-defined infrastructure, high-performance networking, observability, and operational excellence across complex production environments.

Responsibilities

Lead, mentor, and support a team of DevOps and SRE engineers working across the full lifecycle of GPU infrastructure platforms
Design and implement Infrastructure as Code solutions for provisioning and managing bare-metal GPU servers, networking, storage, and cluster orchestration components
Build and improve CI/CD pipelines for infrastructure, platform services, and internal tooling
Develop and maintain monitoring, logging, alerting, and observability solutions for large-scale GPU environments
Define and track SLIs/SLOs, improve incident response processes, and contribute to post-incident reviews and long-term reliability improvements
Work closely with Infrastructure, Networking, Facilities, and AI/ML teams to ensure stable and scalable platform operations
Automate operational processes such as cluster scaling, firmware and BIOS updates, hardware diagnostics, and capacity planning
Support DevSecOps practices, including infrastructure hardening, vulnerability management, and compliance automation
Identify operational inefficiencies and reduce repetitive manual work through automation
Evaluate and introduce new tools and solutions related to GPU infrastructure, orchestration, and cloud-native operations

Requirements

8+ years of experience in DevOps, SRE, Platform Engineering, or a similar area
At least 3 years of experience in a technical lead, lead engineer, or team leadership role
Strong practical experience with infrastructure automation in large-scale or complex production environments
Very good knowledge of Terraform, Ansible, Pulumi, Crossplane, or similar Infrastructure as Code tools
Experience with GitOps, configuration management, and CI/CD practices
Hands-on experience with Kubernetes
Experience working with GPU-related technologies such as NVIDIA GPU Operator, device plugins, MIG, or time-slicing
Good scripting or programming skills in Python, Go, or Bash
Experience with bare-metal provisioning, infrastructure automation, or data center environments
Good knowledge of observability tools such as Prometheus, Grafana, Loki, and OpenTelemetry
Good understanding of distributed systems reliability and production incident management
Experience with high-performance networking technologies such as RDMA, InfiniBand, or RoCE will be a strong advantage
Ability to lead technical discussions, support team development, and communicate effectively with both technical and business stakeholders
English proficiency at least at a communicative level is required, as you will be working in an international team

Nice to have

Experience in AI infrastructure, HPC environments, hyperscale infrastructure, or data center operations
Familiarity with orchestration and scheduling tools such as Slurm, Ray, Run:ai, KServe, or Kubernetes-based schedulers
Experience integrating telemetry from power, cooling, or environmental systems
Experience building internal platforms or self-service tools for engineering or research teams
Understanding of security, compliance, and audit requirements in regulated or security-sensitive environments

What we offer

Benefits package
Opportunity to shape the DevOps and SRE foundation for advanced GPU infrastructure supporting AI workloads
Real impact on the scalability, reliability, and operational standards of next-generation compute environments
Collaboration with experienced engineers across infrastructure, platform, and AI domains
A dynamic environment with space for ownership, technical leadership, and professional growth

Similar jobs you might like

Technology

ALTER GPU CENTER

DevOps Engineer

Mid

Remote

Łódź, Poland

🏢 Summary: Hands-on DevOps Engineer role focused on building and operating automation, deployment, and reliability standards for large-scale GPU infrastructure supporting AI training and inference. The position involves Infrastructure as Code, CI/CD, observability, security, and low-level automation across bare-metal servers, networking, storage, and Kubernetes-based platforms. The role emphasizes reliability, scalability, and automation in complex, high-performance environments. 🗂️ Requirements: 4–7 years in DevOps, SRE, or Platform Engineering, Experience with infrastructure automation in production environments, Hands-on experience with Terraform or Ansible, Experience building and maintaining CI/CD pipelines, Knowledge of GitOps practices, Understanding of infrastructure security and vulnerability management, Experience with security tools (e.g., Snyk, CrowdStrike), Practical experience with Kubernetes, Experience with GPU technologies (e.g., NVIDIA GPU Operator, MIG), Scripting or programming skills in Python, Go, or Bash, Experience with bare-metal provisioning or low-level infrastructure automation, Knowledge of observability tools (Prometheus, Grafana, Loki, OpenTelemetry) 📃 Skills: Terraform, Ansible, Kubernetes, Python, Go, Bash, Prometheus, Grafana, Loki, OpenTelemetry, Snyk, CrowdStrike, NVIDIA, MIG, CI/CD, GitOps 🏢 Description: About the role We are looking for a DevOps Engineer to help build and operate automation, deployment, and reliability standards for large-scale GPU infrastructure used for AI training and inference workloads. In this role, you will work on software-defined infrastructure supporting GPU clusters, high-performance networking, storage platforms, and internal AI services. This is a hands-on position for someone who is comfortable working close to infrastructure, improving operational processes, and building reliable automation in a complex technical environment. Responsibilities Design, implement, and maintain Infrastructure as Code solutions for provisioning and managing bare-metal GPU servers, networking, storage, and cluster orchestration components Build and improve CI/CD pipelines for infrastructure, platform services, and internal tooling Develop and maintain monitoring, logging, alerting, and observability solutions for large-scale GPU environments Support reliability initiatives by defining and tracking SLIs/SLOs , automating incident response, and contributing to post-incident analysis Automate operational tasks such as cluster scaling, firmware and BIOS updates, hardware validation, diagnostics, and capacity planning Work closely with Infrastructure, Networking, Facilities, and AI/ML teams to ensure stable and scalable platform operations Support DevSecOps practices, including infrastructure hardening, vulnerability management, and compliance automation Identify repetitive manual work and replace it with efficient automation Evaluate new tools and solutions related to GPU infrastructure, orchestration, and cloud-native operations Requirements 4–7 years of experience in DevOps, SRE, Platform Engineering , or a similar role Strong practical experience with infrastructure automation in complex production environments Good hands-on knowledge of Terraform, Ansible , or similar Infrastructure as Code tools Experience building and maintaining CI/CD pipelines and working with GitOps practices Good understanding of infrastructure security, vulnerability management, and security best practices Experience with security tools such as Snyk, CrowdStrike , or similar solutions Practical experience with Kubernetes Experience working with GPU-related technologies such as NVIDIA GPU Operator, device plugins, MIG, or time-slicing Good scripting or programming skills in Python, Go, or Bash Experience with bare-metal provisioning, low-level infrastructure automation, or data center operations Good knowledge of observability tools such as Prometheus, Grafana, Loki, and OpenTelemetry Ability to work independently, prioritize tasks, and communicate effectively with technical teams English proficiency at least at a communicative level is required, as you will be working in an international team Nice to have Experience in AI infrastructure, HPC environments, hyperscale infrastructure, or data center operations Familiarity with orchestration and scheduling tools such as Slurm, Ray, Run:ai, KServe , or Kubernetes-based schedulers Experience integrating telemetry from power, cooling, or environmental systems Experience building internal platforms or self-service tools for engineering teams Understanding of compliance and audit requirements in security-sensitive environments What we offer Benefits package Opportunity to work on advanced infrastructure supporting large-scale AI workloads Real impact on the reliability and scalability of next-generation compute environments Collaboration with experienced engineers across infrastructure, platform, and AI domains A fast-moving environment with space for ownership, technical input, and professional growth

Technology

Link Group

Senior Devops Engineer

Senior

Hybrid

Warsaw, Poland

28,000 - 38,000 PLN

🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms that support large-scale data processing systems. The position combines hands-on engineering and strategic impact to ensure scalable, secure, and reliable production environments. You will design, automate, and optimize platform services enabling efficient delivery of data-driven applications. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting distributed production systems, Hands-on experience with public cloud platforms, Strong knowledge of containerization and orchestration, Experience with infrastructure as code, Strong scripting or programming skills, Experience building and maintaining CI/CD pipelines, Knowledge of observability practices and tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, CI/CD, Linux, Monitoring, Logging, Alerting 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards

Technology

Link Group

Senior Devops Engineer

Senior

Hybrid

Warsaw, Poland

28,000 - 38,000 PLN

🏢 Summary: Senior DevOps Engineer role focused on owning and evolving cloud-native infrastructure and CI/CD platforms supporting large-scale data processing systems. The position combines hands-on engineering with strategic platform development to ensure scalable, secure, and reliable production environments. You will design, automate, and maintain infrastructure and observability solutions across distributed systems. 🗂️ Requirements: 5+ years in DevOps, SRE, or infrastructure engineering, Experience supporting production systems in distributed environments, Hands-on experience with public cloud platforms (AWS or similar), Strong knowledge of Docker and Kubernetes, Experience with infrastructure as code tools (Terraform), Strong scripting/programming skills (Python or Bash), Experience building and maintaining CI/CD pipelines, Knowledge of observability, monitoring, and logging tools, Strong troubleshooting and incident response skills in Linux environments 📃 Skills: AWS, Docker, Kubernetes, Terraform, Python, Bash, Linux, CICD, Observability, Automation, Infrastructure, Cloud 🏢 Description: Senior DevOps Engineer We are looking for an experienced engineer to take ownership of our infrastructure and platform ecosystem, supporting large-scale data processing systems and enabling efficient, reliable software delivery. This role combines hands-on engineering with strategic impact — you will design, build, and evolve the platform that underpins data pipelines and production services, ensuring scalability, security, and operational excellence across environments. Key Responsibilities Own and evolve CI/CD and automation platforms to support fast and reliable delivery of data-driven applications Design and manage cloud-native infrastructure supporting high-volume data ingestion, processing, and serving Build and maintain infrastructure as code to ensure consistency and scalability across environments Manage containerized environments and orchestration platforms to deliver resilient and scalable services Implement observability solutions (monitoring, logging, alerting) to ensure full system visibility and reliability Automate deployment processes, configuration management, and system recovery workflows Collaborate with engineering, data, and compliance teams to deliver secure and production-ready solutions Drive incident management practices and continuous improvement initiatives Contribute to platform strategy, tooling decisions, and mentoring within the team Requirements 5+ years of experience in DevOps, SRE, or infrastructure engineering roles Strong experience supporting production systems in distributed environments Hands-on experience with public cloud platforms (AWS or similar) Solid knowledge of containerization and orchestration technologies (Docker, Kubernetes) Experience with infrastructure as code tools (e.g., Terraform) Strong scripting/programming skills (Python, Bash, or similar) Experience building and maintaining CI/CD pipelines and automation tooling Knowledge of observability practices and tools Strong troubleshooting and incident response skills in Linux environments Excellent communication skills and ability to work cross-functionally Nice to Have Experience working with large-scale data platforms Exposure to regulated environments or compliance requirements Experience contributing to platform or engineering standards

Technology

Link Group

Senior DevOps Engineer – Platform Engineering

Senior

Hybrid

Warsaw, Poland

35,000 - 46,000 PLN

🏢 Summary: Senior DevOps Engineer role focused on building and scaling cloud-ready platform infrastructure for trading and data systems. The position centers on designing automated, reliable CI/CD pipelines and infrastructure as code while supporting Kubernetes environments across AWS and on-prem. The role also drives DevOps and SRE best practices to improve system reliability and delivery speed. 🗂️ Requirements: 6+ years in DevOps or Platform Engineering, Strong programming skills in Python or Go, Hands-on experience with Kubernetes, Experience designing and operating CI/CD pipelines, Experience with Infrastructure as Code using Terraform, Experience with AWS cloud environments, Experience with observability and monitoring tools 📃 Skills: Python, Go, Kubernetes, AWS, Terraform, CI/CD, IaC, Observability, SRE 🏢 Description: Senior DevOps Engineer – Platform Engineering We are looking for an experienced DevOps Engineer to help build and scale the core platforms powering our trading and data systems. In this role, you will design reliable, automated, and cloud-ready infrastructure , while influencing engineering standards across the organization. What you’ll do Build and evolve shared platform services with a focus on scalability and automation Design and operate CI/CD pipelines and infrastructure as code Champion DevOps & SRE best practices , including observability and reliability engineering Support Kubernetes-based environments across cloud (AWS) and on-prem systems Partner with engineering teams to improve service stability and delivery speed What we’re looking for 6+ years in DevOps / Platform Engineering Strong coding skills in Python or Go or other language Hands-on experience with Kubernetes, CI/CD, and IaC (Terraform) Experience working with AWS and modern observability tooling Why join? High impact role shaping core platform architecture Strong engineering culture with real ownership Opportunity to influence how teams build and run production systems

Technology

Link Group

Tech Lead Devops

Senior

Remote

Krakow, Poland

170 - 200 PLN

🏢 Summary: DevOps Tech Lead role focused on leading two teams to design, build, and maintain scalable cloud infrastructure and data platforms on GCP. The position combines hands-on DevOps engineering with architecture oversight, ensuring reliable CI/CD pipelines, infrastructure as code, and secure cloud-native environments. Close collaboration with engineering and business stakeholders is required to deliver robust, production-ready solutions. 🗂️ Requirements: Proven experience in DevOps Engineer or DevOps Lead role, Strong hands-on experience with Google Cloud Platform, Experience designing and maintaining CI/CD pipelines, Advanced knowledge of Terraform and Infrastructure as Code, Experience with cloud-native and container-based environments, Understanding of solution architecture principles, Experience with authentication and identity management systems, Ability to lead technical discussions and guide DevOps teams 📃 Skills: GCP, GitHub, GitHubActions, AzureDevOps, Terraform, CI/CD, CloudRun, IaC, SSO, PingID, Containers 🏢 Description: About the Role We are looking for an experienced DevOps Tech Lead to guide and coordinate the work of two DevOps-focused teams responsible for building and maintaining modern cloud infrastructure and data platforms. In this role, you will combine technical leadership, architecture oversight, and hands-on DevOps expertise to ensure scalable, secure, and reliable delivery pipelines. You will work closely with engineering, data, and business stakeholders to translate requirements into robust technical solutions, while supporting teams in building and maintaining high-quality cloud environments. Key Responsibilities Technical Leadership Provide technical leadership for two DevOps teams working on cloud infrastructure, pipelines, and data platform components. Support engineers in designing scalable and maintainable cloud architectures. Guide teams in implementing best practices in DevOps, CI/CD, and infrastructure automation. Ensure consistency of architecture, tooling, and development standards across teams. DevOps & Cloud Engineering Design and oversee CI/CD pipelines using GitHub , GitHub Actions , and Azure DevOps . Build and maintain infrastructure on Google Cloud Platform , including services such as Google Cloud Run . Implement Infrastructure as Code using Terraform . Ensure reliable, automated deployment processes and maintain DevOps pipelines across environments. Support integration of authentication and identity services such as PingID and SSO solutions. Architecture & Data Platforms Contribute to solution architecture decisions for cloud-native platforms and data-driven systems. Support the development of data pipelines and analytical environments , particularly in domains related to life sciences or bioinformatics. Collaborate with architects and engineering teams to ensure alignment between architecture and delivery. Business & Stakeholder Collaboration Work closely with business analysts and stakeholders to translate business needs into technical solutions. Ensure clear communication between architecture, DevOps, and product teams. Facilitate technical discussions and support decision-making for complex technical challenges. Required Skills & Experience Proven experience in a DevOps Engineer, DevOps Lead, or DevOps Tech Lead role. Strong hands-on experience with Google Cloud Platform . Experience designing and maintaining CI/CD pipelines with GitHub , GitHub Actions , and Azure DevOps . Advanced knowledge of Terraform and Infrastructure as Code practices. Experience working with cloud-native services and container-based environments. Understanding of solution architecture principles . Experience working with authentication and identity management (SSO, PingID or similar tools). Strong problem-solving skills and ability to lead technical discussions. Experience collaborating with business stakeholders and cross-functional teams.

Technology

N-iX

Middle DevOps Engineer (#5068)

Mid

Remote

Krakow, Poland

5,000 - 5,500 USD

🏢 Summary: DevOps Engineer role focused on building, scaling, and securing cloud infrastructure while enabling efficient CI/CD workflows. The position involves managing Kubernetes-based environments on AWS, optimizing automation, and ensuring high availability and performance of systems. The role also includes infrastructure as code, monitoring, database management, and secure authentication integration. 🗂️ Requirements: BA/BS in technical field or equivalent experience, 5+ years in DevOps, SRE, or Infrastructure Engineering, Strong experience with Kubernetes, Deep knowledge of AWS core services, Experience with containerization technologies, Strong understanding of networking concepts, Proficiency with infrastructure-as-code tools, Experience with monitoring tools, Strong scripting or programming skills, Solid understanding of system security best practices 📃 Skills: Kubernetes, AWS, EC2, S3, IAM, RDS, Docker, Helm, Terraform, Pulumi, CloudFormation, Prometheus, Grafana, CloudWatch, Python, Bash, Go, PostgreSQL, SAML, OAuth2, OIDC, ELK, Loki, FluentBit, GitHubActions, Jenkins, CircleCI, Ansible, EKS, GKE 🏢 Description: We are looking for a skilled and driven DevOps Engineer to join our growing team. In this role, you will take ownership of building, maintaining, and scaling the infrastructure that powers our platform. You will ensure our systems are secure, performant, and highly available, while enabling seamless development and deployment workflows. Responsibilities: Design, implement, and manage scalable infrastructure using Kubernetes and AWS. Optimize CI/CD pipelines to improve build and deployment times and reduce friction. Monitor and troubleshoot infrastructure performance and availability. Manage and maintain relational databases, primarily PostgreSQL. Implement and support secure authentication systems using SSO protocols (e.g., SAML, OIDC, OAuth2). Enhance infrastructure as code using tools like Terraform and Ansible. Ensure security best practices are applied across all infrastructure components. Collaborate cross-functionally with development, QA, and product teams. Drive automation of operational tasks to increase team efficiency and reduce manual toil. Required Skills: BA/BS in a technical or engineering discipline or equivalent experience 5+ years of experience in a DevOps, SRE, or Infrastructure Engineering role. Strong experience with Kubernetes (EKS, GKE, or self-managed). Deep knowledge of AWS core services (EC2, S3, IAM, RDS, etc.). Knowledge of containerization technologies (e.g., Docker, Kubernetes, Helm) Solid understanding of networking concepts (VPCs, subnets, routing, firewalls, DNS). Proficiency with infrastructure-as-code tools (Terraform, Pulumi, or CloudFormation). Comfortable with monitoring tools (Prometheus, Grafana, CloudWatch, etc.). Strong scripting or programming ability (Python, Bash, or Go). Solid understanding of system security and best practices. Preferred Skills: Familiarity with SSO protocols such as SAML, OAuth2, and OpenID Connect. Experience managing and tuning PostgreSQL in production environments. Exposure to log aggregation tools (ELK, Loki, or Fluent Bit). Experience with CI/CD tools like GitHub Actions, Jenkins, or CircleCI. We offer*: Flexible working format - remote, office-based or flexible A competitive salary and good compensation package Personalized career growth Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more) Active tech communities with regular knowledge sharing Education reimbursement Memorable anniversary presents Corporate events and team buildings Other location-specific benefits

Technology

emagine Polska

AI DevOps Lead Engineer

Senior

Hybrid

Krakow, Poland

200 - 200 PLN/hr

🏢 Summary: Long-term B2B opportunity for an AI DevOps Lead Engineer to design and manage cloud infrastructure on Azure and GCP, focusing on IaC, CI/CD, and Kubernetes clusters. The role centers on building secure, scalable environments, automating processes, and enhancing monitoring and security frameworks. Hybrid work model with strong emphasis on cloud-native and DevOps best practices. 🗂️ Requirements: 4+ years in DevOps, SRE, or Cloud Engineering, Strong knowledge of Linux/UNIX, Production experience with GCP and/or Azure, Experience managing GKE and/or AKS clusters, Proficiency in Terraform for Infrastructure as Code, Experience building CI/CD pipelines with Jenkins, Scripting skills in Python and/or Bash 📃 Skills: GCP, Azure, GKE, AKS, Terraform, Jenkins, Python, Bash, Linux, Kubernetes, CI/CD, IaC 🏢 Description: Introduction & Summary: We are seeking an accomplished AI DevOps Lead Engineer with a strong foundation in cloud infrastructure management and DevOps practices. The ideal candidate will have over 4 years of experience in DevOps, SRE, or Cloud Engineering roles, with a proven record of designing and implementing robust cloud solutions, particularly on GCP and Azure platforms. Strong expertise in Infrastructure as Code (IaC) and CI/CD pipeline management is imperative, along with scripting capabilities in Python and Bash. What we offer: Long Term B2B Contract Rate: 200 PLN/ H +VAT Hybrid Cracow ( 1-2 times per week) Main Responsibilities: Design, build, and manage core infrastructure on Azure using IaC principles. Administer and enhance GKE and AKS clusters, ensuring security, scalability, and resilience. Evolve Jenkins pipelines and developer tooling for improved automation and efficiency. Implement security controls and automate vulnerabilities remediation. Develop a robust monitoring and alerting framework for operational visibility. Implement security gateways for improved governance. Key Requirements: 4+ years of experience in a DevOps, SRE, or Cloud Engineering role. Deep knowledge of Linux/UNIX operating systems. Production experience with GCP and/or Azure, including management of clusters (GKE, AKS). Strong proficiency with Infrastructure as Code using Terraform. Proven experience with CI/CD pipeline development using Jenkins. Scripting and automation skills in Python and/or Bash. Nice to Have: Experience with monitoring tools like Prometheus, Grafana, and Loki. Familiarity with container management using Docker. Knowledge of additional cloud services and platforms. Other Details: This position offers the opportunity to work in a dynamic environment that fosters innovation and collaboration. Candidates can work remotely or on-site, depending on personal preferences; flexibility in schedule is provided to facilitate a productive work-life balance.

Technology

ITMAGINATION

AI Lead DevOps Engineer

Senior

Remote

Warsaw, Poland

25,575 - 29,450 PLN

🏢 Summary: Remote AI Lead DevOps Engineer role responsible for defining and executing the MLOps and CI/CD strategy for enterprise AI platforms. The position focuses on architecting secure, compliant, and fully automated ML lifecycle governance, ensuring auditability, reproducibility, and large-scale reliability. The role combines technical leadership with hands-on design of cloud-native, DevSecOps-driven AI infrastructure. 🗂️ Requirements: 8–10 years DevOps or Cloud Engineering experience, Minimum 3 years in technical leadership or architect role, Strong knowledge of end-to-end ML lifecycle, Expertise in CI/CD pipeline design and implementation, Advanced Infrastructure as Code experience, Experience with SAST and DAST implementation, Strong IAM and access control management in cloud, Ability to design observability frameworks for ML systems, Experience with configuration management in multi-cloud environments, Knowledge of database scaling and security, Experience implementing model governance and auditability practices 📃 Skills: MLOps, CI/CD, DevSecOps, Azure, AzureDevOps, GitHubActions, Jenkins, Terraform, CloudFormation, SAST, DAST, IAM, Ansible, Puppet, MySQL, PostgreSQL, MongoDB, Observability, Git 🏢 Description: This is a remote position. We are looking for an AI Lead DevOps Engineer to spearhead the MLOps strategy for our high-impact AI accounts. With 8–10 years of experience, you will provide the technical leadership necessary to design robust, compliant, and highly automated AI platforms. You aren't just managing pipelines; you are architect the entire lifecycle governance—ensuring reproducibility, audibility, and security at an enterprise scale. Key Responsibilities: Strategic Leadership: Provide technical direction for the DevOps squad, defining the CI/CD and MLOps roadmap for the account. Model Governance & Evaluation: Implement automated model evaluation pipelines to track accuracy, precision, and recall metrics in production. Enterprise Security: Lead the DevSecOps strategy, ensuring all AI deployments comply with enterprise security standards and global data regulations. Platform Enablement: Architect self-service platforms that allow ML engineers to deploy models with minimal friction while maintaining strict governance guardrails. Auditability & Reproducibility: Ensure that every ML experiment is fully auditable through sophisticated pipeline and dataset versioning strategies. Mentorship: Mentor senior and junior engineers, driving best practices in automation, IaC, and cloud-native architecture. Requirements 8–10 years of experience in DevOps/Cloud Engineering, with at least 3 years in a technical leadership or architect-level role. Deep understanding of the end-to-end ML lifecycle (training, validation, deployment, and retraining loops). Mastery across Azure DevOps, GitHub Actions, and Jenkins. Expert-level Terraform or CloudFormation skills, including modular architecture and cross-account cloud deployments. Significant experience implementing SAST/DAST tools and managing complex IAM/Access Control frameworks in a cloud environment. Ability to design custom observability frameworks that track model drift, pipeline failures, and infrastructure ROI. Advanced knowledge of configuration management tools like Ansible or Puppet for complex multi-cloud environments. Solid understanding of database scaling and security for MySQL, PostgreSQL, and MongoDB. Understanding of how DevOps practices support responsible AI (e.g., bias tracking and audit logs). Exceptional ability to collaborate with Architects and Data Scientists to translate high-level AI needs into operational reality. Native or C1-level English, with the ability to present technical strategies to senior stakeholders. Benefits Professional training programs Work with a team that’s recognized for its excellence. We’ve been featured in the Deloitte Technology Fast 50 & FT 1000 rankings. We’ve also received the Great Place To Work® certification for five years in a row

Technology

B2Bnetwork

Senior DevOps Engineer (API Management Platform)

Senior

Hybrid

Warsaw, Poland

120 - 140 PLN

🏢 Summary: Senior DevOps Engineer role focused on building and developing a modern cloud-based API Management platform. The position involves infrastructure development, automation, CI/CD implementation, Kubernetes platform support, and observability solutions. The engineer will support API Gateway and collaborate on cloud-native platform evolution. 🗂️ Requirements: Several years of experience as DevOps Engineer or similar role, Strong knowledge of Kubernetes, Strong knowledge of Docker, Practical experience with AWS, Experience with Infrastructure as Code using Terraform, Knowledge of Ansible, Experience building and maintaining CI/CD pipelines, Experience with monitoring and observability tools 📃 Skills: Kubernetes, Docker, AWS, Terraform, Ansible, CI/CD, Grafana, Prometheus, VictoriaMetrics, Splunk, Kong, API, CloudNative, IaC 🏢 Description: Poszukujemy osoby na stanowisko Senior DevOps Engineer do projektu związanego z budową i rozwojem nowoczesnej platformy API Management działającej w środowisku chmurowym. Osoba na tym stanowisku będzie odpowiadać za rozwój infrastruktury, automatyzację procesów, wdrażanie rozwiązań observability oraz wsparcie zespołów projektowych w obszarze platform Kubernetes i API Gateway. Zakres obowiązków Projektowanie, rozwój i utrzymanie infrastruktury dla platformy API Management. Provisioning i zarządzanie środowiskami chmurowymi z wykorzystaniem Infrastructure as Code. Automatyzacja procesów administracyjnych i operacyjnych. Tworzenie oraz rozwój pipeline'ów CI/CD. Zarządzanie środowiskami Kubernetes oraz konteneryzacją aplikacji. Implementacja i rozwój rozwiązań monitoringu oraz observability. Migracja istniejących środowisk do nowych platform technologicznych. Współpraca z zespołami deweloperskimi i architektonicznymi przy wdrażaniu nowych usług. Wsparcie rozwoju platformy katalogowania i zarządzania API. Wymagania Minimum kilkuletnie doświadczenie na stanowisku DevOps Engineer lub pokrewnym. Bardzo dobra znajomość Kubernetes i Docker. Praktyczne doświadczenie z AWS. Doświadczenie w tworzeniu i utrzymaniu infrastruktury jako kodu z wykorzystaniem Terraform. Znajomość Ansible oraz automatyzacji procesów operacyjnych. Doświadczenie w budowie i utrzymaniu pipeline'ów CI/CD. Znajomość narzędzi monitoringu i observability (Grafana, Prometheus, Victoria Metrics, Splunk lub podobne). Komunikatywna znajomość języka angielskiego. Mile widziane Doświadczenie z Kong Gateway lub innymi rozwiązaniami API Gateway. Znajomość zagadnień związanych z API Management. Doświadczenie w środowiskach Cloud Native.

Technology

EPAM Systems

Lead Azure DevOps Engineer

Senior

Remote

🏢 Summary: The offer is for a Lead DevOps Engineer to design, implement, and maintain Azure-based DevOps solutions within an agile product team. The role focuses on infrastructure as code, CI/CD pipeline implementation, cloud resource optimization, monitoring, and ensuring security and reliability across Azure environments. It also includes technical leadership and process automation initiatives. 🗂️ Requirements: 5+ years of DevOps experience in agile environments, Expertise in Azure DevOps, Strong experience with Terraform or OpenTofu, Hands-on experience with Datadog, Experience managing Azure resources (Container Apps, App Configuration, Key Vault), Familiarity with Cosmos DB and Data Lake, Knowledge of at least one programming language (e.g., C#), Azure or DevOps certification, English proficiency B2 or higher 📃 Skills: Azure, AzureDevOps, Terraform, OpenTofu, Datadog, CI/CD, ContainerApps, CosmosDB, DataLake, AppConfiguration, KeyVault, C#, IaC 🏢 Description: We are seeking a skilled Lead DevOps Engineer to join our agile product development team and take charge of designing, implementing, and maintaining robust DevOps solutions. This role involves collaborating closely with product owners, front-end, back-end, and other DevOps professionals to ensure seamless integration and high-quality deliverables. Responsibilities Develop and manage infrastructure as code (IaC) using Terraform/Opentofu for Azure (azurerm/azapi) Design and implement Azure Pipelines in Azure DevOps for CI/CD processes Leverage Datadog to monitor and ensure the performance and reliability of applications Optimize and manage Azure resources such as Container Apps, Cosmos DB, Data Lake, App Configuration, and Key Vault Collaborate effectively with agile product teams to align DevOps strategies with business objectives Work cross-functionally to troubleshoot, resolve, and prevent operational issues Ensure a high standard of system security and compliance within Azure environments Provide mentorship and technical guidance to teammates regarding DevOps practices Proactively identify and implement opportunities for process improvements and automation Requirements 5+ years of experience working as a DevOps Engineer in agile environments Expertise in Azure DevOps, Terraform, and Datadog Competency in Azure resources, including Container Apps, App Configuration, and Key Vault Familiarity with additional Azure services like Cosmos DB and Data Lake Understanding of at least one programming language such as C# Capability to work effectively within cross-functional teams and adapt to changing workflows Strong problem-solving and communication skills Relevant certification in Azure or DevOps practices English proficiency at B2 level or higher We offer/Benefits We gather like-minded people: Engineering community of industry professionals Friendly team and enjoyable working environment Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Business-driven relocation opportunities We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills, and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru English classes We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Benefits package (health insurance, multisport, shopping vouchers) Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more Referral bonuses Corporate, social and well-being events Please, note: The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview. We will reach out to selected candidates exclusively. EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

ALTER GPU CENTER

ALTER GPU CENTER is a technology-focused company operating in the software development and computing sector, with an emphasis suggested by its name on GPU-related or high-performance computing solutions. The company develops modern web-based and backend systems using technologies such as Python, Django, microservices, and data-driven architectures. It promotes a culture of innovation, continuous learning, and collaboration, encouraging adherence to software development best practices and clean code principles. ALTER GPU CENTER supports professional growth, engagement with industry trends, and participation in tech events, while fostering a flexible and team-oriented work environment, including remote collaboration.

Check if your resume is ATS-ready before applying →Build an ATS-optimized resume