Skip to main content
Celestica

Remote Principal AI/ML Server System Architect

1w

Celestica

US · Full-time · $220,000 – $300,000

About this role

This position is for a Principal Engineer AI/ML Server System Architect. Define the architecture of leading and competitive AI systems, lead new technology research, study market trends, interface with customers, and lead the design team in AI/ML infrastructure. Primarily hardware-focused with strong knowledge of AI training, inference workloads, compute, connectivity, management, power, and cooling subsystems.

Define complex AI/ML infrastructure platform solutions and lead multi-disciplined teams through EVT to implementation, ensuring design quality metrics. Develop close technical relationships with suppliers on product roadmaps and technologies. Lead RFQ technical responses, platform concept development, trade-off analysis, and competitive guidance.

Act as primary technical interface with customers, especially in escalations, and provide support to business development for winning new business. Participate in hiring design team resources, process improvements, standards committees, and industry groups. Work in a dynamic team environment with minimal supervision and strong interpersonal liaison.

Prepare and present technical presentations to customers, industry groups, and tradeshows. Maintain current on industry trends, future technologies, and market opportunities. Engage extensively through EVT, phasing out thereafter while delivering SRDs, compliance assessments, and technical proposals.

Requirements

  • Minimum of 10 years server and/or AI/ML accelerator hardware design experience
  • Minimum of 5 years of AI infrastructure system design experience
  • Experience with Nvidia, AMD, and/or Intel GPU accelerators and support systems
  • Successful experience architecting and delivering server/storage/converged products to market
  • Excellent customer presentation skills and ability to interface with large OEM customers
  • Broad knowledge of current digital and analogue components
  • Successful New Product Introduction (NPI) launch experience
  • Extensive knowledge of system level power, cooling, mechanical, SW/FW and electrical design integration and trade-offs

Responsibilities

  • Define complex AI/ML infrastructure platform solutions and lead multi-disciplined teams to implementation through EVT
  • Develop and maintain close technical relationships with suppliers/partners on product roadmaps and technologies
  • Lead RFQ technical responses and Celestica platform concept development efforts
  • Provide effective trade-off analysis and competitive guidance to engineering teams and customers
  • Act as primary technical interface between Celestica and customers, particularly in technical escalations
  • Provide technical support to business development teams to win new business
  • Prepare and present technical presentations to customers, industry groups, and tradeshows
  • Maintain current on industry trends and future technology and market opportunities

Benefits

  • Fully remote position for US employees
  • International travel required, approximately 4-6 trips per year
  • Domestic travel to support customers and internal teams