← All roles
About the role
Who you'll join
A pre-seed AI startup building a taste layer for AI models. The founders are teaching AI to understand subjective domains like design, writing, personality, and visual style. They closed a $2M angel round and are moving fast toward product-market fit with pilots already running at major AI labs.
What you'll do
- Train reward models and classifiers that evaluate subjective qualities in text, images, and design.
- Build evaluation frameworks and benchmarks for domains where objective metrics fail.
- Run post-training experiments on open-source models to test new data formats and techniques.
- Work directly with AI lab researchers and creative domain experts to design and execute pilots.
- Own end-to-end research pipelines from data preparation through results publication.
- Publish blogs and whitepapers documenting findings and methods.
- Move between one-week iteration cycles and multi-month research projects.
Who you are
- You have hands-on experience training and fine-tuning large language or diffusion models through post-training techniques like SFT, RLHF, or DPO.
- You think like a researcher but move like an engineer. You iterate fast, test ideas quickly, and ship results.
- You've worked at a data company, with data vendors, or on post-training infrastructure. You understand data pipelines and quality deeply.
- You care intellectually about the problem space. A personal interest in design, writing, art, or creative fields is a strong signal.
- You're comfortable with ambiguity and early-stage constraints. You've either worked in startups or understand what it takes to build one.
- You have low ego and collaborate well. You're excited to work alongside a small, tight team.
- Bonus: Experience with multimodal models or background from companies like Pika, Luma, Midjourney, or Black Forest Labs.
Tech stack
Python, PyTorch, LLMs, diffusion models, post-training frameworks, data pipelines.
Why you'll thrive
- The problem is genuinely hard and intellectually stimulating. You're teaching AI what subjective quality means across domains nobody else is tackling at this scale.
- Compensation is $200K–$350K with 0.25–1.5% equity depending on experience. The founders flex on both for the right senior candidate.
- You ship fast and see impact immediately. Research cycles run one week to one quarter, not months in a vacuum.
- The team actively publishes findings and talks publicly about the work. Your research gets visibility and credit.
- You join at the exact right moment. The startup has resources to grow aggressively, pilots running with major labs, and clear revenue visibility within months.