Senior Machine Learning Researcher, On-Device Optimization

HP


Date: 2 days ago
City: Spring, TX
Salary: $150,000 - $250,000 per year
Contract type: Full time
Who We Are

HP IQ is HP’s new AI innovation lab. Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.

We’re assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP’s portfolio. Together, we’re developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.

We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.

By embedding AI advancements into every HP product and service, we’re expanding what’s possible for individuals, organisations, and the future of work.

Join us as we reinvent work, so people everywhere can do their best work.

About The Role

As a Machine Learning Researcher, you’ll focus on advancing the state-of-the-art in on-device AI optimization. This role bridges applied research and product development, with a heavy focus on techniques like quantization, pruning, and efficient model representation. You’ll bring academic expertise into real-world systems that power intelligent assistants running directly on HP laptops and edge devices.

What You Might Do

  • Research and implement model compression techniques including quantization, low-rank factorization, distillation, and pruning
  • Develop methods to deploy SOTA transformer and vision models on-device under hardware constraints
  • Lead investigations into hardware-aware training strategies to optimize latency, throughput, and memory usage
  • Collaborate with software engineers and system architects to integrate models into AI companion apps
  • Evaluate and benchmark different frameworks and quantization strategies (e.g., AWQ, GPTQ, SmoothQuant)

Essential Qualifications

  • PhD in Computer Science, Electrical Engineering, or related field with focus on efficient ML, systems ML, or compiler design for ML
  • 2+ years of industry or applied research experience
  • Strong background in model optimization for edge computing or mobile/embedded deployment
  • Familiarity with PyTorch, ONNX, TensorRT, OpenVINO, QNN, or Llama.cpp
  • Understanding of tradeoffs in asymmetric/symmetric quantization, calibration methods, and inference tuning

Preferred Skills

  • Experience publishing at top ML/Systems conferences (e.g., NeurIPS, ICML, MLSys)
  • Familiarity with embedded ML for consumer devices
  • GPU and system-level profiling tools (e.g., CUDA, nvprof, perf)
  • Contributions to open-source ML optimization frameworks

The pay range for this role is $150,000 to $250,000 USD annually with additional opportunities for pay in the form of bonus and/or equity (applies to United States of America candidates only). Pay varies by work location, job-related knowledge, skills, and experience.

Benefits

HP offers a comprehensive benefits package for this position, including:

  • Health insurance
  • Dental insurance
  • Vision insurance
  • Long term/short term disability insurance
  • Employee assistance program
  • Flexible spending account
  • Life insurance
  • Generous time off policies, including;
  • 4-12 weeks fully paid parental leave based on tenure
  • 11 paid holidays
  • Additional flexible paid vacation and sick leave (US benefits overview)

The compensation and benefits information is accurate as of the date of this posting. The Company reserves the right to modify this information at any time, with or without notice, subject to applicable law.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume