MOTS - Inference

18764 Posted: 02/04/2025

300000
Bay Area
Permanent

What We're Building:
We're entering an exciting new phase focused on collaborating with commercial partners to adapt and fine-tune our cutting-edge AI models for specific business needs. Our achievements in developing, aligning, and deploying state-of-the-art models for a high-EQ consumer-facing chatbot have built a strong foundation for success. With robust infrastructure, efficient finetuning processes, and ample H100 resources, this is a unique opportunity to contribute to innovation in a collaborative environment.

About Us:
We are a small, interdisciplinary AI studio that has trained several state-of-the-art language models and developed a popular personal assistant chatbot. Our focus is now on fine-tuning and deploying models for enterprise-specific use cases in partnership with commercial clients. As a public benefit corporation, we prioritize the well-being of our partners, users, and the broader community.

About the Role:
Member of Technical Staff, Research Engineer (Inference)

This role is crucial to deploying high-performance models for enterprise applications. As part of the inference team, research engineers optimize model inference processes, reduce latency, and improve throughput while ensuring robust enterprise deployment.

Ideal Candidate Will Have:

Experience deploying and optimizing LLMs for inference in both cloud and on-prem environments
Proficiency with model optimization frameworks like ONNX, TensorRT, or TVM
Strong problem-solving skills for complex model performance and scaling issues
Deep understanding of model inference trade-offs, including hardware constraints and real-time processing requirements
Proficiency with PyTorch and familiarity with infrastructure tools like Docker and Kubernetes for inference pipelines

Derek Gemski Recruitment Consultant

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 04/02/2025

MOTS - Inference

18764

300000
Bay Area
Permanent

What We're Building:We're entering an exciting new phase focused on collaborating with commercia...

View Job

Posted: 13/12/2024

Staff Physical Design Engineer

None5

$220,000-$280,000
Bay Area
Permanent

Acceler8 Talent is working with the most exciting up & coming startup company that is developing spe...

View Job

Posted: 13/12/2024

Staff RTL Design Engineer

None4

$220,000-$280,000
Bay Area
Permanent

Acceler8 Talent has partnered with a well-supported data center acceleration company that is actively se...

View Job

Posted: 13/12/2024

Design Verification Engineer

None3

$200,000-$280,000
Bay Area
Permanent

Acceler8 Talent is working with the most exciting up & coming startup company that is developing spe...

View Job

Posted: 13/12/2024

Physical Design Engineer

None2

$200,000-$250,000
Bay Area
Permanent

Acceler8 Talent has teamed up with a well-funded data center acceleration company seeking a Principal Ph...

View Job

Posted: 13/12/2024

ASIC Design Engineer

None

$200,000-$250,000
Bay Area
Permanent

Acceler8 Talent is working with an innovative AI hardware company delivers energy-efficient AI inference...

View Job

Posted: 13/12/2024

Machine Learning Researcher

vp34567

220000
Palo Alto, CA
Permanent

Seeking an Innovative ML Research Scientist to Transform Semiconductor & Electronics Technology!Abou...

View Job

Posted: 13/12/2024

Member of Technical Staff

790593

250000
Palo Alto, CA
Permanent

Shape the Future of Conversational AIAbout UsWe are a public benefit corporation dedicated to harnessing...

View Job

Posted: 13/12/2024

Senior GPU Performance Engineer

BBBH17189

220000
San Francisco, California
Permanent

Senior GPU Performance Engineer - CUDA IntroductionIn the realm of cutting-edge cryptography and de...

View Job

Posted: 13/12/2024

Lead Compiler Engineer

BBBH14354

215000
Remote
Permanent

Lead Compiler EngineerWe are actively on the lookout for a Lead Compiler Engineer to join a well-funded ...

View Job

Quick Resume Dropoff

MOTS - Inference

Apply for this role

Still looking? What about...

Featured Jobs

MOTS - Inference

Staff Physical Design Engineer

Staff RTL Design Engineer

Design Verification Engineer

Physical Design Engineer

ASIC Design Engineer

Machine Learning Researcher

Member of Technical Staff

Senior GPU Performance Engineer

Lead Compiler Engineer

Contact Us

Find us on social

Useful Links

Legal