All roles

Senior/Staff AI Engineer

Remote · USA Full-time New today

Job Description:

  • Build and optimize LLM serving and inference systems for production environments
  • Improve performance across GPU and CPU pathways
  • Work on KV cache, memory, storage, and throughput bottlenecks
  • Design and scale systems that support RAG and retrieval-heavy AI workloads
  • Contribute to infrastructure where storage architecture and systems efficiency materially affect AI performance
  • Solve engineering problems at the intersection of AI, high-performance systems, and distributed infrastructure

Requirements:

  • An engineer who has spent meaningful time building or optimizing production AI systems, not just experimenting with models
  • Someone who understands how inference performance is shaped by the interaction between compute, memory, storage, and serving architecture
  • Deep hands-on experience working close to the systems layer — for example, improving how workloads run across GPU and CPU resources, reducing bottlenecks, or tuning infrastructure for better throughput and latency
  • Evidence of real ownership in areas like model serving, retrieval, caching, storage, or distributed performance, rather than purely application-layer AI work
  • The ability to move comfortably between architecture decisions and hands-on implementation, especially in environments where efficiency and scale matter
  • A background that suggests you can operate in technically demanding environments, whether that comes from AI infrastructure, high-performance systems, storage platforms, or adjacent distributed systems work
  • PhD preferred, but far less important than having built serious systems in the real world.

Benefits: Apply tot his job Apply To this Job

Related roles

Senior Machine Learning Engineer- Ads Personalization

Remote · USA Full-time

Senior Machine Learning Engineer - Scan, Match and Catalog

Remote · USA Full-time

Staff Machine Learning Engineer - Content and Contributor Intelligence (Remote - United States)

Remote · USA Full-time

Machine Learning Engineer - LLM Evaluation & Automation

Remote · USA Full-time

Edge AI Engineer

Remote · USA Full-time

Lead Machine Learning Engineer - Remote (US) or CA - Only W2

Remote · USA Full-time

ML/AI Engineer - Junior Level

Remote · USA Full-time

FPGA AI/ML Engineer – Part Time

Remote · USA Full-time

Temporary Micro-Credential Grader – Industry-Focused Prompt Engineering for ROI-Driven Results

Remote · USA Full-time

English Prompt Engineer: LLM Migration & Optimization

Remote · USA Full-time

Math Teaching Assistant

Remote · USA Full-time

Experienced Accounting & Data Entry Clerk - Remote Travel Concierge Service

Remote · USA Full-time

IT Process Manager

Remote · USA Full-time

Global Talent Community

Remote · USA Full-time

Experienced Customer Service Representative – Work from Home Opportunity at arenaflex

Remote · USA Full-time

Experienced Remote UPS Data Entry Specialist – Earn $1800 Weekly at arenaflex

Remote · USA Full-time

Experienced Customer Support Representative – Delivering Exceptional Service to Apple Customers Worldwide (Remote)

Remote · USA Full-time

Senior Infrastructure Engineer, Government Systems

Remote · USA Full-time

Experienced Data Entry Specialist – Remote Opportunity with arenaflex

Remote · USA Full-time

Snowflake Data Engineer - French Speaker

Remote · USA Full-time