✨ Fill and validate PDF forms with InstaFill AI. Save an average of 34 minutes on each form, reducing mistakes by 90% and ensuring accuracy. Learn more

Principal AI Innovation Specialist

Capital One Malden, MA
ai ai cloud training computing capital infrastructure clusters engineering scalability networking computer science science
April 24, 2024
Capital One
Malden, MA
OTHER

Principal AI Innovation Specialist

Job Summary:

A skilled and experienced Senior Distinguished Engineer is being sought by Capital One to lead the company's enterprise AI capabilities. This means working together with AI engineers and researchers to create a sturdy infrastructure, distributed training clusters, and advanced AI systems within the company's public cloud environment. The chosen candidate will play a major role in shaping the future of Capital One's AI capabilities while also supervising the reliability, scalability, and performance of its systems.

Job Duties and Responsibilities:

  • Create and design an infrastructure capable of maintaining stability during large-scale training tasks in our public cloud environment.
  • Our focus is on developing a comprehensive infrastructure that can effectively serve large ML models deployed across our cloud network.
  • Optimize the storage and networking components of training clusters by implementing parallel processing methodologies.
  • Once benchmarks have been established, performance testing can be conducted to measure how well the software system operates under different conditions. These tests are key to identifying areas where the system needs improvement.
  • Develop solutions by integrating Large-Language Models (LLMs) and Foundation Models (FMs).
  • Incorporate automated testing and validation methods that can evaluate model performance in an ongoing manner and flag potential issues.

Qualifications and Experience:

  • The minimum educational requirement for this job role is a Bachelor's degree in Computer Science, Computer Engineering, or a related technical discipline.
  • The ideal candidate must possess 9 or more years of expertise building and designing distributed computing and HPC systems for large-scale ML projects.
  • We are searching for someone with a six-year experience minimum in the development of AI and ML algorithms through Python or C/C++.
  • The ideal candidate should have a minimum of 3 years of experience in executing the complete ML development lifecycle employing prominent ML and AI frameworks coupled with public cloud platforms.

Preferred Qualifications:

  • Obtaining a Master’s or a PhD degree in the areas of Engineering, Computer Science, or a similar technical field.
  • Experienced in creating cloud environments and distributed platforms comprising multiple components on AWS, Azure, or GCP.
  • Expertise in architecting cloud systems with attention to key features like availability, scalability and performance, while still adhering to stringent security policies.
  • Expertise in designing and implementing innovative storage and computing solutions that are scalable and can handle massive models.
  • Building GPU clusters with tightly-coupled storage and networking in the public cloud is a complex task, requiring expertise and experience in cloud computing and high-performance computing.
  • If you're looking to become an expert in machine learning development, you'll need to have a thorough understanding of distributed training and the different frameworks available for modeling, such as Pytorch, Tensorflow, and Lightning.
  • Skilled in working with different AI tools and systems like prompt engineering, guardrails, vector databases/knowledge bases, LLM hosting, and fine-tuning.
  • Established a strong reputation for delivering innovative research papers at reputed conferences and invaluable contributions to the field of neural networks, distributed training, and SysML in the industry.

Benefits of the Position:

  • Perforamance-based incentive compensation.
  • Comprehensive protection for physical, financial, and overall wellness through various benefits.
  • An opening to be part of groundbreaking AI undertakings that are at the forefront of the sector.
  • When employees feel valued and included in the workplace, they are more likely to share ideas and work creatively with one another.

About Company:

At Capital One, making a difference in the lives of their customers is paramount. They understand that by fostering an inclusive work environment and promoting collaboration, they can deliver better banking experiences to their customers. Join their team and become a part of their commitment to innovation, customer empowerment, and diversity.


Report this job

Similar jobs near me

Related articles