We are a small, friendly and multi-disciplinary AI studio creating a personal AI for everyone.
April 26
🏢 In-office - Bay Area
We are a small, friendly and multi-disciplinary AI studio creating a personal AI for everyone.
• Inflection has trained state-of-the-art models such as Inflection 2.5, but to make these models appropriate for deployment to our enterprise partners and to Pi, they require finetuning and alignment. Research engineers collect and curate datasets, experiment with novel finetuning techniques, and evaluate the resulting models to ensure they meet our standards for safety, helpfulness, and personality.
• Have worked on finetuning and evaluating LLMs before, either with in-house models or via API • Like working in a fast-paced environment and are comfortable working with ambiguous technical requirements • Are comfortable operating on large compute clusters via tools like Slurm • Have a strong understanding of modern machine learning techniques (transformer architectures, RLHF, DPO, etc.) and associated Python frameworks (Torch, JAX, etc.) • Knowledge of SQL and data tooling like Snowflake, Dagster, and Airbyte is a bonus, but is not expected
• Unlimited paid time off • Parental leave and flexibility for all parents and caregivers • Generous medical, dental and vision plans for US employees • Compliance with country-specific benefits for non-US employees • Visa sponsorship for new hires • Avenues for personal growth such as coaching, conference attendance, or specific trainings
Apply Now