This role involves leading the fine-tuning, optimization, and deployment of AI models with a strong emphasis on on-device inference. The engineer will adapt foundation models (LLMs, multimodal models) for specialized domains like orchestration and multi-agent coordination, ensuring they are fast, accurate, and efficient for resource-constrained environments.
• Lead the fine-tuning, optimization, and deployment of AI models • Strong experience with on-device inference • Expertise in adapting foundation models (LLMs, multimodal models) to specialized domains • Experience with applications such as orchestration, planning, and multi-agent coordination • Ability to make models fast, accurate, and efficient for resource-constrained environments • Focus on ensuring model robustness and safety
• Health insurance • Dental insurance • Vision insurance • Long term/short term disability insurance • Employee assistance program • Flexible spending account • Life insurance • 4-12 weeks fully paid parental leave • 11 paid holidays • Flexible paid vacation and sick leave