Award-winning voice-first team collaboration app for meetings and conversations. Otter Voice Meeting Notes.
May 10
🏢 In-office - Bay Area
Award-winning voice-first team collaboration app for meetings and conversations. Otter Voice Meeting Notes.
• Model optimization: Collaborate with machine learning researchers to understand model architectures and algorithms. • Implement optimization techniques to enhance machine learning models' efficiency and inference speed on production • Deployment and Integration: Work closely with product engineers to integrate machine learning models into production systems in a scalable way • Optimize models for real-time inference, ensuring low latency and high-throughput • Set up monitoring systems to track model performance in real time. • Ensure models can scale horizontally to handle the increased load. • Implement strategies for resource-efficient inference, considering factors such as memory usage and CPU/GPU utilization. • Collaborate with cross-functional teams to understand requirements and constraints. • Provide technical expertise on inference-related matters during the model development lifecycle. • Document the deployment and optimization processes for machine learning models.
• Masters degree + 3 years of industry experience or Ph.D. degree in computer science, machine learning, speech/language processing or related field • Experience in PyTorch • Proficiency in Python • Experience in C++ • Basic knowledge of CUDA • Strong understanding of machine learning models, algorithms, and deployment strategies • Experience with model optimization techniques and performance profiling • Familiarity with docker and Kubernetes • Knowledge of AWS • Experience with monitoring tools
• 11 paid holidays • Generous Accrued Time Off increasing with years of service • Generous paid sick time • Annual day of service
Apply Now