Research Engineer

GW03010525 Posted: 05/01/2025

$180,000-$250,000
Palo Alto
Permanent

Shape the Future of Conversational AI

About Us

We are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI solutions.

About the Role: Research Engineer (Inference)

As a key player in our commitment to deploying high-performance models for enterprise applications, you will be part of our inference team, which focuses on ensuring that these models operate efficiently and effectively in real-world scenarios. Research engineers will optimize model inference processes, minimize latency, and enhance throughput while maintaining model performance, all to ensure robust deployment in enterprise settings.

Key Responsibilities

Deploy and optimize large language models (LLMs) for inference in both cloud and on-premises environments.
Utilize model optimization and acceleration tools and frameworks, such as ONNX, TensorRT, or TVM.
Tackle complex challenges related to model performance and scalability.
Understand the trade-offs involved in model inference, including hardware limitations and real-time processing needs.
Demonstrate proficiency in PyTorch and be familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.

What We Are Looking For

If you have a strong background in deploying and optimizing LLMs, enjoy solving intricate problems, and have a deep understanding of model inference challenges, we would love to hear from you! Join us in building impactful enterprise AI solutions that will shape the future.

Victor Pascoe ML Research & Engineering Recruiter

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 01/05/2025

Compiler/Toolchain Engineer

GW08010525

$180,000-$250,000
Boston
Permanent

Job Description This company is the world’s 5th largest global fabless semiconductor company ...

View Job

Posted: 01/05/2025

Compiler Engineer

GW07010525

$120,000-$400,000
Mountain View, CA
Permanent

About the job This company is on a mission to be the compute platform for AGI. We are developing ve...

View Job

Posted: 01/05/2025

ML Performance Engineer

GW06010525

$180,000-$250,000
Seattle or Bay Area
Permanent

Why we exist: This company is on a mission to make frontier AI truly open for all. We are founded o...

View Job

Posted: 01/05/2025

ML Engineer

GW05010525

$180,000-$250,000
Seattle or Bay Area
Permanent

Why we exist: his company is on a mission to make frontier AI truly open for all. We are founded on...

View Job

Posted: 01/05/2025

Training Infrastructure Engineer

GW04010525

$225,000-$250,000
Boston, San Francisco, or Remote
Permanent

Join Us as a Training Infrastructure Engineer (ML Systems & Foundation Models)We're building the...

View Job

Posted: 01/05/2025

Research Engineer

GW03010525

$180,000-$250,000
Palo Alto
Permanent

Shape the Future of Conversational AIAbout UsWe are a public benefit corporation dedicated to harnessing...

View Job

Posted: 01/05/2025

Machine Learning Engineer

GW2050125

$100,000-$300,000
New York
Permanent

Machine Learning Engineer We’re hiring a Machine Learning Engineer to take ownership of developing...

View Job

Posted: 01/05/2025

Backend Engineer

GW050125

$200,000-$250,000
Bay Area (Hybrid)
Permanent

Start up that is redefining AI infrastructure by enabling seamless deployment across a distributed netwo...

View Job

Posted: 29/04/2025

Founding Engineer

GW142925

130-185K
Bay Area
Permanent

About this role We are looking for a Founding Engineer to help build out the future of a startup. T...

View Job

Posted: 22/04/2025

Founding Engineer

NB3

$180,000 - $200,000
San Francisco
Permanent

Founding Software Engineer | AI-Powered HealthTech 🚀A YC-backed, seed-funded startup is revolution...

View Job

Quick Resume Dropoff

Research Engineer

Apply for this role

Still looking? What about...

Featured Jobs

Compiler/Toolchain Engineer

Compiler Engineer

ML Performance Engineer

ML Engineer

Training Infrastructure Engineer

Research Engineer

Machine Learning Engineer

Backend Engineer

Founding Engineer

Founding Engineer

Contact Us

Find us on social

Useful Links

Legal