We are seeking a skilled Software Engineer to join our System Software Team focused on optimizing and maintaining the core infrastructure of our data center environments. You will work on enhancing operating system performance, scalability, and reliability, ensuring seamless operations for large-scale distributed systems.
KEY RESPONSIBILITIES:
- Develop, maintain, and optimize OS-level components for data center infrastructure.
- Collaborate with cross-functional teams to improve performance, security, and resource efficiency across distributed systems.
- Troubleshoot and resolve low-level OS issues, networking bottlenecks, and hardware-software integration challenges.
- Contribute to the development of automation, monitoring, and diagnostic tools to improve system reliability.
- Participate in code reviews, design discussions, and architectural decisions related to OS-level services.
- Stay up to date with emerging OS technologies, virtualization, and containerization trends in data center environments.
- Develop and optimize accelerator passthrough, sharing, and virtualization mechanisms for AI/ML workloads.
- Improve hypervisor-level support (KVM, Xen, VMware, QEMU) for virtualized AI accelerators.
- Enhance performance, isolation, and resource allocation for accelerators in virtualized environments.
If you’re passionate about low-level system engineering and optimizing OS performance at scale, we’d love to hear from you!