Introduction
At IBM Infrastructure & Technology, we design and operate the systems that keep the world running. From high-resiliency mainframes and hybrid cloud platforms to networking, automation, and site reliability. Our teams ensure the performance, security, and scalability that clients and industries depend on every day. Working in Infrastructure & Technology means tackling complex challenges with curiosity and collaboration. You’ll work with diverse technologies and colleagues worldwide to deliver resilient, future-ready solutions that power innovation. With continuous learning, career growth, and a supportive culture, IBM provides the opportunities to build expertise and shape the infrastructure that drives progress.
Your Role And Responsibilities
As an AI Back-End Developer specializing in Artificial Intelligence for IBM Z, you will work in an agile, collaborative environment to design, code, and deliver opensource solutions to AI frameworks. You will employ IBM's Design Thinking to create solutions that provide a great user experience along with world-class quality, resiliency, performance, security, and stability. Your primary responsibilities will include:
Design and Develop Solutions: Design, code, debug, test, and deliver creative solutions to problems and requirements in AI frameworks.
Collaborate with Teams: Work in an agile, collaborative environment to understand requirements and deliver high-quality solutions that meet client needs.
Apply Design Thinking: Employ IBM's Design Thinking to create solutions that provide a great user experience along with world-class quality, resiliency, performance, security, and stability.
Work on leading-edge projects using technologies like machine learning, deep learning, and other AI-related areas to drive innovation and solution delivery.
Ensure Solution Quality: Ensure that solutions meet the required standards for quality, resiliency, performance, security, and stability.
Required Technical And Professional Expertise
AI Development Experience: Experience with designing, coding, debugging, testing PyTorch Core & Internals.
Contribution to and extend PyTorch internals and backend architecture, including:
Optimize model execution paths for training and inference.
Open-source contributions to PyTorch or related projects
Experience with proprietary or custom AI accelerators
Experience integrating AI frameworks with custom or proprietary accelerators
Advanced skills in C++ and Python
Experience with GPU programming or accelerator SDKs
Advanced Python skills and strong debugging ability
DevOps Methodologies: Experience working with DevOps methodologies to deliver high-quality solutions on AI frameworks ensuring world-class quality, resiliency, performance, security, and stability.
Understanding OfPreferred technical and professional experience
AI runtimes
Memory hierarchies
Parallel execution models
PyTorch distributed runtime
Parameter sharding and memory management
Hands-on experience with torch.compile and TorchInductor
Design Thinking Application: Experience employing IBM's Design Thinking to create solutions that provide a great user experience, with a focus on delivering innovative and effective solutions.
Experience contributing to leading-edge projects, driving innovation and solution delivery in Core backend AI areas.