August 12
š¢ In-office - Bay Area
ā¢ Develop custom LLM serving systems and corresponding datacenter infrastructure to deliver high model quality at very low latency and cost. ā¢ Improve our LLM training software both in terms of model architecture changes that improve model quality or increase iteration speed and reliability. ā¢ Build indexing systems capable of serving queries instantly from many terabytes of data within customer deployments. ā¢ Contribute to infrastructure for petabyte-scale data pipelines to accelerate our ML research work.
ā¢ Strong software engineering skills. There are no pure research scientists at the company. ā¢ Excellent quantitative, analytical and estimation skills. ā¢ Strong grasp of computer and networking architecture, particularly with GPU hardware and HPC networks. ā¢ Familiarity with AI-powered developer tools like Codeium, Copilot, ChatGPT, and others is a strong plus.
ā¢ Offers Equity
Apply Now