Cohere is seeking a Member of Technical Staff, Training Performance Engineer to join their mission of scaling intelligence to serve humanity. This role, part of the Pre-Training team, focuses on optimizing the performance of advanced language models and systems. The position combines software engineering, machine learning, and low-level kernel development expertise to enhance model performance and training throughput.
The ideal candidate will work on critical aspects of model optimization, including writing high-performance software, developing CUDA kernels, and implementing distributed training strategies. They will be responsible for identifying and removing performance bottlenecks while working with cutting-edge training and profiling tools.
Cohere offers a collaborative environment working alongside world-class researchers and engineers, with offices in major tech hubs like London, Toronto, San Francisco, and New York, while maintaining a remote-friendly culture. The company provides comprehensive benefits including health and dental coverage, mental health support, parental leave, and generous vacation time.
This role presents a unique opportunity to impact the future of AI development, working with frontier models and contributing to systems that power next-generation AI applications. The position requires strong technical expertise but also offers growth potential and the chance to work with leading researchers in the field. Cohere values diversity and maintains an inclusive work environment, welcoming applicants from all backgrounds.