Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using
Job Description About the Team The Compression and Parity team in GM’s Autonomous Vehicle (AV) Organization enables repeatable, high-velocity model deployments through principled and automated model compression under strict safety guarantees. We partner closely with model developers and deployment
Company Description It started with a simple idea: what if surgery could be less invasive and recovery less painful? Nearly 30 years later, that question still fuels everything we do at Intuitive. As a global leader
NVIDIAs success builds on a foundation of industry leading hardware. A key strategy in achieving this is our combining of the best of external EDA with highly optimized, internal EDA tools. Our team develops these tools
Job Details: Job Description: Our Mission At Intel, our journey is to transform AI into something safer, more trustworthy, and respectful of human privacy by design. We believe transformative AI should have a positive impact on
About the Team We are dedicated to building the inference infrastructure for ultra-large-scale language models, vision-language models, and frontier multimodal AI systems. Our mission is to provide a robust, scalable, and high-performance foundation for distributed serving, heterogeneous scheduling,
About the Team We are dedicated to building the training infrastructure for ultra-large-scale language models, vision-language models, and frontier agentic models. Our mission is to provide a robust, scalable, and high-performance foundation for post-training, multimodal learning, and reinforcement learning
About the Team We are dedicated to building the training infrastructure for ultra-large-scale language models, vision-language models, and frontier agentic models. Our mission is to provide a robust, scalable, and high-performance foundation for post-training, multimodal learning, and reinforcement learning
About the Team We are dedicated to building the inference infrastructure for ultra-large-scale language models, vision-language models, and frontier multimodal AI systems. Our mission is to provide a robust, scalable, and high-performance foundation for distributed serving, heterogeneous scheduling,
About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of
Role: Hybrid in Redwood City, CA. (Will consider Remote for the right candidate) Must have: Experience building large-scale prediction or optimization systems PubMatic is the leading AI-powered ad tech company delivering measurable advertising performance through an intelligent,
At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our
Minimum qualifications: Bachelors degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or equivalent practical experience. 2 years of experience in Physical Design (RTL-to-GDS) or Technology Development, focusing on advanced nodes (e.g., 7nm,
Role: Hybrid in Redwood City, CA. Must have: 3+ years of solid experience building machine learning, data science, ranking, prediction, recommendation, optimization, or large-scale data systems PubMatic is the leading AI-powered ad tech company delivering measurable advertising
About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential.
Responsibilities Distributed training framework optimization. Own the R&D and tuning of distributed training frameworks for large models (LLMs, multimodal), resolving scalability bottlenecks at the scale of 10k–100k GPU clusters. Kernel & performance tuning. Work close to the underlying
Founding Quant Engineer Goal: Build the future of wealth technology and investment optimization - through hiring a someone who can build systematic frameworks and tools that identify, model, and optimize tax outcomes for individual clients in a highly
About FranklinWH FranklinWH is a leading provider of whole-home energy management and storage solutions, headquartered in San Jose, California. Our system intelligently manages solar, battery, grid, generator, and EV power through the AI-driven smart circuit aGate
About Us GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only seven cloud providers worldwide to earn NVIDIAs prestigious Reference Platform Cloud Partner designation. We operate 8 of our
NVIDIA Engineering Manager At NVIDIA, we arent just powering the AI revolutionwere accelerating it. The TensorRT inference platform is the backbone of modern AI, delivering the industrys fastest and most efficient deployment of cutting-edge deep learning