RegScale, a leading continuous controls monitoring (CCM) platform, is seeking a Generative AI Engineer specializing in model optimization and evaluation. This role sits at the intersection of cutting-edge AI development and practical deployment challenges, focusing on making transformer-based models more efficient and effective in both cloud and on-premises environments.
The ideal candidate will be deeply experienced in the ML lifecycle, with particular expertise in model quantization, fine-tuning, and evaluation techniques. You'll be working on pushing the boundaries of what's possible in AI deployment efficiency, balancing performance with resource constraints across various computing environments.
Key aspects of the role include optimizing transformer models through techniques like quantization and pruning, developing comprehensive evaluation frameworks, and collaborating with domain experts on dataset engineering. You'll be responsible for making critical architectural decisions that impact model deployment across different environments, from cloud to edge computing scenarios.
The position requires both technical depth in AI/ML and the ability to communicate complex technical concepts to various stakeholders. You'll need to stay current with the latest advancements in model compression and efficient inference while maintaining a practical focus on production-grade implementations.
This is an excellent opportunity for an experienced AI engineer who wants to work on challenging problems in model optimization while contributing to a platform that delivers significant value in the governance, risk, and compliance space. The role offers the flexibility of remote work while being part of a team that's pushing the boundaries of AI efficiency and effectiveness.