AI/Machine Learning Research Engineer (ML System, Inference …, Markham
AI/Machine Learning Research Engineer (ML System, Inference …, Markham
-
Markham I3P, Canada
-
Posted: less than a week ago
-
Save
Description
General Summary
As a member of the Low Power AI Solution team, you will conduct advanced research on model efficiency, model compression techniques, and ML system optimization to push the boundaries of efficient on‑device inference. You will lead and contribute to high‑impact research initiatives, understand hardware–software interactions at a fundamental level, and collaborate with global teams to develop systems that shape future Qualcomm AI accelerator capabilities.Key Responsibilities
Conduct cutting‑edge research in inference efficiency and ML system optimization: efficient architecture design, model compression, PEFT, compiler stack optimization, etc. Prototype and develop system solutions with software–hardware co‑design to align architectural choices, dataflows, and memory behavior with Qualcomm’s low‑power AI accelerators for optimal model deployment.Collaborate closely with modeling, compiler, and hardware teams to convert research into production‑ready low power AI solutions, enabling real‑world applications and commercial impact. Influence future accelerator features and model deployment and contribute to Qualcomm’s strategic initiatives in efficient AI and embedded intelligence.Requirements
Proven research excellence on inference efficiency and ML system, demonstrated by publications, community contributions, or equivalent evidence of impact. Deep expertise in neural network architectures, model compression (e.g., quantization, pruning, knowledge distillation) and efficient inference algorithm.Strong background on compiler stack and ML system optimization for AI accelerators (e.g., graph transformation, graph tiling and scheduling, tensor layout/memory optimization). Strong understanding of Machine Learning fundamentals, strong programming skills with ML frameworks. Hands‑on experience with model development pipelines for AI accelerator, including training, fine‑tuning, evaluation, and performance optimization.Preferred Qualifications
PhD in Computer Science, Electrical Engineering, or related fields or MS with 3+ years of AI research, or related work experience. Extensive experience in deep learning research and impactful publications in top‑tier machine learning venues (NeurIPS, ICML, ICLR, CVPR, ICCV, ACL, EMNLP, etc.). Experience in on‑device model deployment and optimization algorithms for AI hardware accelerators.Experience working with a variety of stakeholders and ability to communicate complex outcomes to a wide range of audiences. Minimum Qualifications
Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master’s degree in Computer Science, Engineering, Information Systems, or related field and 1+ year of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.PhD in Computer Science, Engineering, Information Systems, or related field. Pay Range and Other Compensation&Benefits
$114,400.00 - $164,400.00 The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Salary is only one component of total compensation. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants. Our highly competitive benefits package is designed to support your success at work, at home, and at play.Equal Opportunity Employer
Qualcomm is an equal opportunity employer. Qualcomm provides reasonable accommodations to support individuals with disabilities.
#J-18808-Ljbffr
As a member of the Low Power AI Solution team, you will conduct advanced research on model efficiency, model compression techniques, and ML system optimization to push the boundaries of efficient on‑device inference. You will lead and contribute to high‑impact research initiatives, understand hardware–software interactions at a fundamental level, and collaborate with global teams to develop systems that shape future Qualcomm AI accelerator capabilities.Key Responsibilities
Conduct cutting‑edge research in inference efficiency and ML system optimization: efficient architecture design, model compression, PEFT, compiler stack optimization, etc. Prototype and develop system solutions with software–hardware co‑design to align architectural choices, dataflows, and memory behavior with Qualcomm’s low‑power AI accelerators for optimal model deployment.Collaborate closely with modeling, compiler, and hardware teams to convert research into production‑ready low power AI solutions, enabling real‑world applications and commercial impact. Influence future accelerator features and model deployment and contribute to Qualcomm’s strategic initiatives in efficient AI and embedded intelligence.Requirements
Proven research excellence on inference efficiency and ML system, demonstrated by publications, community contributions, or equivalent evidence of impact. Deep expertise in neural network architectures, model compression (e.g., quantization, pruning, knowledge distillation) and efficient inference algorithm.Strong background on compiler stack and ML system optimization for AI accelerators (e.g., graph transformation, graph tiling and scheduling, tensor layout/memory optimization). Strong understanding of Machine Learning fundamentals, strong programming skills with ML frameworks. Hands‑on experience with model development pipelines for AI accelerator, including training, fine‑tuning, evaluation, and performance optimization.Preferred Qualifications
PhD in Computer Science, Electrical Engineering, or related fields or MS with 3+ years of AI research, or related work experience. Extensive experience in deep learning research and impactful publications in top‑tier machine learning venues (NeurIPS, ICML, ICLR, CVPR, ICCV, ACL, EMNLP, etc.). Experience in on‑device model deployment and optimization algorithms for AI hardware accelerators.Experience working with a variety of stakeholders and ability to communicate complex outcomes to a wide range of audiences. Minimum Qualifications
Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master’s degree in Computer Science, Engineering, Information Systems, or related field and 1+ year of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.PhD in Computer Science, Engineering, Information Systems, or related field. Pay Range and Other Compensation&Benefits
$114,400.00 - $164,400.00 The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Salary is only one component of total compensation. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants. Our highly competitive benefits package is designed to support your success at work, at home, and at play.Equal Opportunity Employer
Qualcomm is an equal opportunity employer. Qualcomm provides reasonable accommodations to support individuals with disabilities.
#J-18808-Ljbffr
Highlights
-
Company nameQualcomm
-
Job positionAI/Machine Learning Research Engineer (ML System, Inference Efficiency), Senior/Staff Engineer [...]
Safety Tips
Be careful: if it seems too good to be true, it most likely is.
More info about this ad
AI/Machine Learning Research Engineer (ML System, Inference … has been posted in the Markham Engineering category on Locanto.
If you’re looking for something similar, check out Engineering Manager, Greater Toronto Area, Lead ML Engineer, Greater Toronto Area or Project Engineer Manager - Building Restoration in Greater Toronto Area, also posted in Engineering. Currently, there are 9 ads posted in the Engineering category in Markham.
Interested in more? Widen your search to view ads in nearby areas of Markham. This includes Engineering in Richmond Hill, Pickering and Whitchurch-Stouffville. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.