$172,000 - $230,000 Posted: 5 hours ago
Job Description
<p><strong><span ><span >Huawei Canada has an immediate permanent opening for a Distinguished Engineer - AI Computing System</span></span></strong></p><p ></p><p><strong><span ><span >About the team:</span></span></strong></p><p><span >The Advanced Computing and Storage Lab, currently a part of the Vancouver Research Centre, aims to explore adaptive computing system architectures to address the challenges posed by flexible and variable application loads in the future. It assists in ensuring the stability and quality of training clusters, constructs dynamic cluster configuration strategy solvers, and establishes precision control systems to create stable and efficient computing power clusters. One of the lab's goals is to focus on key industry AI application scenarios such as large model training/inference, based on key technologies like low-precision training, multi-modal training, and reinforcement learning, responsible for bottleneck analysis and the design and development of optimization solutions, thereby improving training and inference performance as well as usability.</span></p><p ></p><p><strong><span ><span >About the job:</span></span></strong></p><ul><li><p><span >As a leading expert in the industry in the field of training cluster software frameworks and technologies, gain insights into the evolution direction of industry AI large model training frameworks and key features. Plan and layout AI frameworks and software features for scenarios such as large model pre-training, post-training, and integrated training and inference, building key capabilities for the company's training cluster software framework.</span></p></li><li><p><span >Focusing on the company's large model training optimization field, lead the team to build key technologies such as low-precision training, parallel strategy tuning, and training resource optimization, promoting the commercial implementation of large model perception optimization-related technologies.</span></p></li><li><p><span >Focusing on the company's training servers and super nodes and other products, lead the team to build large model AI training frameworks, operator libraries, acceleration libraries, and other software frameworks and acceleration features, fully leveraging system engineering and software-hardware collaboration capabilities to enhance AI cluster computing efficiency.</span></p></li><li><p><span >Identify high-quality academic resources in the direction of large model training, collaborate with domain experts and scholars on projects, layout related standards and patents, support the company's continuous innovation in the training cluster field, and build long-term competitiveness in the AI training cluster direction.</span></p></li><li><p><span >Cultivate a team of technical experts and key technical backbone in the direction of AI training cluster frameworks and software optimization.</span><strong><span > </span></strong></p></li></ul><p ></p><p><span >The base salary for this position ranges from $172,000 to $230,000 depending on education, experience and demonstrated expertise.</span></p>Create Your Resume First
Give yourself the best chance of success. Create a professional, job-winning resume with AI before you apply.
It's fast, easy, and increases your chances of getting an interview!
Application Disclaimer
You are now leaving Hiringgg.com and being redirected to a third-party website to complete your application. We are not responsible for the content or privacy practices of this external site.
Important: Beware of job scams. Never provide your bank account details, credit card information, or any form of payment to a potential employer.