ML Research Scientist

Festanstellung, Vollzeit · Dresden, DE (primary site), Austin, TX (office opening Q3 2025)

About the Role
As an ML Research Engineer at SEMRON, you will design the algorithms and quantization schemes that unlock efficient, high-accuracy inference on our analog in-memory compute platform. Your work will bridge cutting-edge quantization research, mathematical modeling, and hardware-aware algorithm design, ensuring that deep neural networks execute with maximal accuracy and throughput on our custom silicon.
What you will do:
  • Research and develop novel analog-aware quantization methods (PTQ and QAT) tailored to in-memory compute constraints
  • Design mathematically principled matrix-vector multiplication algorithms that exploit sparsity, noise resilience, and non-idealities to improve hardware efficiency
  • Collaborate with analog hardware engineers to define algorithmic requirements and guide co-development of compute primitives

What you should bring in:
  • PhD or equivalent research experience in machine learning, applied mathematics, or a related field
  • Strong understanding of quantization, model optimization, and numerical methods for DNNs
  • Proficiency in Python and PyTorch, with the ability to rapidly prototype and evaluate research ideas
  • A research mindset: curiosity, rigor, and the ability to explore and discard ideas efficiently
Helpful but not required:
  • Contributions to quantization libraries or novel compression methods
  • Publications in top-tier ML venues (NeurIPS, ICLR, ICML, etc.)
  • Familiarity with analog computation challenges (noise, nonlinearity, limited precision, etc.) and the ability to abstract them into robust algorithms
  • Experience collaborating with hardware teams or formulating algorithm-hardware co-design strategies
Why us?
We’re building at the intersection of math, hardware, and machine learning, pushing the boundaries of what's possible in compute. If you’ve implemented your own MVM kernels just to see what happens, trained quantized models for fun, or love thinking deeply about efficiency, sparsity, and how to make models run faster and better, you’ll feel right at home. As a small, technical team, early work defines the future of the stack, and we treat it that way. You'll own critical pieces of what we build, with equity to match. No hierarchy, no bureaucracy, just ideas, experiments, and real impact. You’ll grow as fast as you can grow.
About us
At SEMRON, we’re redefining what’s possible in AI hardware. Our core innovation lies in analog in-memory computing for deep neural network acceleration, enabling us to build compute architectures that scale vertically into the third dimension, much like NAND flash revolutionized memory. This leap in physical density allows us to deploy models with billions of parameters on chip areas as small as a few square millimeters.
Wir freuen uns auf Dich!
Wir freuen uns über dein Interesse an SEMRON. Bitte fülle das folgende kurze Formular aus. Solltest du Schwierigkeiten mit dem Upload deiner Daten haben, wende dich gerne per E-Mail an contact@semron.net.
Dokument wird hochgeladen. Bitte warten Sie.
Fügen Sie alle erforderlichen (mit einem * gekennzeichneten) Angaben hinzu, um Ihre Bewerbung abzusenden.