Member of Technical Staff

790593 Posted: 13/12/2024

250000
Palo Alto, CA
Permanent

Shape the Future of Conversational AI

About Us

We are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI solutions.

About the Role: Research Engineer (Inference)

As a key player in our commitment to deploying high-performance models for enterprise applications, you will be part of our inference team, which focuses on ensuring that these models operate efficiently and effectively in real-world scenarios. Research engineers will optimize model inference processes, minimize latency, and enhance throughput while maintaining model performance, all to ensure robust deployment in enterprise settings.

Key Responsibilities

Deploy and optimize large language models (LLMs) for inference in both cloud and on-premises environments.
Utilize model optimization and acceleration tools and frameworks, such as ONNX, TensorRT, or TVM.
Tackle complex challenges related to model performance and scalability.
Understand the trade-offs involved in model inference, including hardware limitations and real-time processing needs.
Demonstrate proficiency in PyTorch and be familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.

What We Are Looking For

If you have a strong background in deploying and optimizing LLMs, enjoy solving intricate problems, and have a deep understanding of model inference challenges, we would love to hear from you! Join us in building impactful enterprise AI solutions that will shape the future.

Victor Pascoe Recruitment Consultant

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 13/12/2024

Staff Physical Design Engineer

None5

$220,000-$280,000
Bay Area
Permanent

Acceler8 Talent is working with the most exciting up & coming startup company that is developing spe...

View Job

Posted: 13/12/2024

Staff RTL Design Engineer

None4

$220,000-$280,000
Bay Area
Permanent

Acceler8 Talent has partnered with a well-supported data center acceleration company that is actively se...

View Job

Posted: 13/12/2024

Design Verification Engineer

None3

$200,000-$280,000
Bay Area
Permanent

Acceler8 Talent is working with the most exciting up & coming startup company that is developing spe...

View Job

Posted: 13/12/2024

Physical Design Engineer

None2

$200,000-$250,000
Bay Area
Permanent

Acceler8 Talent has teamed up with a well-funded data center acceleration company seeking a Principal Ph...

View Job

Posted: 13/12/2024

ASIC Design Engineer

None

$200,000-$250,000
Bay Area
Permanent

Acceler8 Talent is working with an innovative AI hardware company delivers energy-efficient AI inference...

View Job

Posted: 13/12/2024

DRAM Engineer

HL4

250,000
Cupertino, CA
Permanent

We’re urgently seeking a skilled DRAM Engineer to advance memory systems at a fast-growing start-u...

View Job

Posted: 13/12/2024

Physical Design Engineer

HL3

250,000-350,000
Mountain View, CA
Permanent

Join a pioneering startup as they build the compute platform for AGI! This innovative company creates ve...

View Job

Posted: 13/12/2024

Hardware Emulation Engineer

HL2

200,000
Mountain View, CA
Permanent

Revolutionizing computing with cutting-edge hardware and software, this startup, led by industry titans ...

View Job

Posted: 13/12/2024

Senior GPU Performance Engineer

HL1

$200,000
United States
Permanent

An early start-up is seeking an experienced and highly skilled Senior GPU Performance Engineer to join i...

View Job

Posted: 13/12/2024

Machine Learning Researcher

vp34567

220000
Palo Alto, CA
Permanent

Seeking an Innovative ML Research Scientist to Transform Semiconductor & Electronics Technology!Abou...

View Job

Quick Resume Dropoff

Member of Technical Staff

Apply for this role

Still looking? What about...

Featured Jobs

Staff Physical Design Engineer

Staff RTL Design Engineer

Design Verification Engineer

Physical Design Engineer

ASIC Design Engineer

DRAM Engineer

Physical Design Engineer

Hardware Emulation Engineer

Senior GPU Performance Engineer

Machine Learning Researcher

Contact Us

Find us on social

Useful Links

Legal