All roles

Python Engineer, AI

Remote · USA Full-time New today
Software Engineer, AI

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.

What is Needed

  • 4+ years of professional software-engineering experience using Python and Constraint.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is better than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to thrive in a fully asynchronous, low-oversight remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and security issues quickly.

What is Not Needed
  • No prior RLHF or AI-training experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Logistics

  • Location: Fully remote (work from anywhere).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, zero fluff. If this fits your profile, apply here.

Apply to this Job

Related roles

Product Engineer

Remote · USA Full-time

Senior FullStack Engineer (Creator Team)

Remote · USA Full-time

Senior Account Executive, Data Solutions

Remote · USA Full-time

Solutions Engineer

Remote · USA Full-time

Senior Software Engineer (Platform)

Remote · USA Full-time

Account Manager, Mid-Market

Remote · USA Full-time

Join Our Talent Pool

Remote · USA Full-time

(Plugins) Senior Software Engineer

Remote · USA Full-time

Strategic Outreach Specialist

Remote · USA Full-time

Associate, Investment - North America

Remote · USA Full-time

Experienced Data Scientist for Innovative Technology Development and Strategic Decision Making – Full Time Remote Opportunity with arenaflex

Remote · USA Full-time

Associate Account Executive, Strategic Accounts

Remote · USA Full-time

Claims Adjuster - Liability | GL and Litigation | Jurisdiction: FL | Licensing: Reciprocal License Required (REMOTE - Jacksonville, FL)

Remote · USA Full-time

Exciting Illustrator Jobs Since Yesterday – Join Our Creative Team Today

Remote · USA Full-time

Experienced Full Stack Customer Service Agent – Seasonal Remote Opportunity at arenaflex

Remote · USA Full-time

Netflix.Com Jobs Tagger, Jobs Netflix Remote, Binge Watching Netflix Job

Remote · USA Full-time

Remote Customer Support Associate – Flexible Hours, $19/hr Starting Pay, No Degree Required, Work‑From‑Home Customer Service Role

Remote · USA Full-time

Experienced Full-Time Customer Service Representative - Inbound Call Center Operations - Remote Work from Home Opportunity with blithequark

Remote · USA Full-time

Inbound Operations Team Leader (Overnight Shift) - Orem, UT - Leading Inbound Processes for a Seamless Guest Experience

Remote · USA Full-time

Join Today: Remote Job: Amazon Product Tester in united states

Remote · USA Full-time