All roles

Production Systems Engineer, AI Systems

Remote · USA Full-time New today

Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on Meta Training and Inference Accelerator (MTIA) program as a part of the AI/ML initiatives supporting large scale AI Training and Inference. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers, including prototyping of experimental HW, pre-production hands-on system and hardware debugging and stress testing, enabling production-ready system monitoring, automated provisioning and automated remediation of issues. RTP team also helps in exploring, developing and productizing high-performance software and hardware technologies for AI at datacenter scale. RTP Engineers have a large swath of cross-functional partners they work closely with e.g. HW/SW co-design teams, hardware designers, networking teams, system manufacturers, component vendors, capacity engineering, production engineering, production services, and data center operations teams to enable new systems that will be deployed in our production data centers. We are looking for a candidate to work on scale up and scale out network technologies (e.g RDMA NIC) for Meta Training and Inference Accelerator (MTIA) systems that are powering Meta’s tremendous leaps in the AI space. The ideal candidate is knowledgeable about network protocols (TCP/IP, RDMA) and has hands-on experience driving post-Silicon validation for networking platforms, all the way to mass production and deployment. Apply Job!

Related roles

Adjunct (Psychology)

Remote · USA Full-time

Logistic Documentation Coordinator

Remote · USA Full-time

Associate Security Analyst

Remote · USA Full-time

Experienced Registered Dental Assistant

Remote · USA Full-time

Mechanical Turbine & Compressor Specialist (Oil & Gas - LNG)

Remote · USA Full-time

Aerospace Manufacturing - Supply Chain & Vendor Management - Dallas-Fort Worth, TX

Remote · USA Full-time

Construction Superintendent - Solar Farm

Remote · USA Full-time

Medical Transcription Manager

Remote · USA Full-time

Data Entry - Fiverr - Montana, Conrad, USA - DoScouting

Remote · USA Full-time

Shop Hand

Remote · USA Full-time

Experienced Remote Customer Service Representative for Dynamic Team – Competitive Hourly Rate and Flexible Scheduling

Remote · USA Full-time

Walgreens Work From Home Job Data Entry [Entry Level/No Experience] - Embark on a New Adventure

Remote · USA Full-time

Registered Nurse, Oncology - On Call, Evenings (Remote)

Remote · USA Full-time

American Express Remote Jobs (Data Entry)

Remote · USA Full-time

Experienced Customer Service and Sales Representative – Insurance Solutions and Client Support

Remote · USA Full-time

Registered Nurse Navigator

Remote · USA Full-time

Remote Customer Support Representative – arenaflex Home‑Based Service Role – $19/hr Flexible Schedule

Remote · USA Full-time

System Administration and Information Systems Security Officer ISSO in Hanover, MD

Remote · USA Full-time

Experienced Customer Service Representative – Remote Part-Time Opportunity at arenaflex

Remote · USA Full-time

Sage Intacct Implementation Lead

Remote · USA Full-time