Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using

Premium Full-time Trails Humanity Ml Nemo Clarity 286,200 USD

Capital One 28 days ago

Senior ML Engineer - Model Compression

General Motors ( Sunnyvale CA )

Job Description About the Team The Compression and Parity team in GM’s Autonomous Vehicle (AV) Organization enables repeatable, high-velocity model deployments through principled and automated model compression under strict safety guarantees. We partner closely with model developers and deployment

Premium Full-time Modeling Retirement Savings PyTorch Sensitivity Intellectually Curious 261,300 USD

General Motors 25 days ago

Upload Your Resume — Let employers contact you directly

Senior AI/ML Research Engineer – Model development

Intuitive ( Sunnyvale CA )

Company Description It started with a simple idea: what if surgery could be less invasive and recovery less painful? Nearly 30 years later, that question still fuels everything we do at Intuitive. As a global leader

Premium Full-time Surgery CFR Data Engineering Dexterity Rapid Prototyping

Intuitive 19 days ago

Software R&D Engineer, RTL Optimization Tools

Nvidia ( Santa Clara CA )

NVIDIAs success builds on a foundation of industry leading hardware. A key strategy in achieving this is our combining of the best of external EDA with highly optimized, internal EDA tools. Our team develops these tools

Premium Full-time VLSI Software Development Computer Science Algorithm Development RTL 218,500 USD

Nvidia 18 days ago

Inference Optimization Engineer (local / edge runtime)

Intel ( Santa Clara CA )

Job Details: Job Description: Our Mission At Intel, our journey is to transform AI into something safer, more trustworthy, and respectful of human privacy by design. We believe transformative AI should have a positive impact on

Premium Internship Intelligence EDGE Kernel Business Strategy CUDA 315,490 USD

Intel 14 days ago

AI Infra Engineer - Large Model Inference Systems (Multimo...

TikTok ( San Jose CA )

About the Team We are dedicated to building the inference infrastructure for ultra-large-scale language models, vision-language models, and frontier multimodal AI systems. Our mission is to provide a robust, scalable, and high-performance foundation for distributed serving, heterogeneous scheduling,

Premium Full-time Kernel Frontier Architecture AI Throughput

TikTok 14 days ago

Get Hired 2x Faster
Connect with Top Employers Directly

Senior AI Infra Engineer - Large Model Training Infrastruc...

TikTok ( San Jose CA )

About the Team We are dedicated to building the training infrastructure for ultra-large-scale language models, vision-language models, and frontier agentic models. Our mission is to provide a robust, scalable, and high-performance foundation for post-training, multimodal learning, and reinforcement learning

Premium Full-time MOE Architecture Reinforcement Learning Frontier Rollout

TikTok 14 days ago

AI Infra Engineer - Large Model Training Infrastructure (L...

TikTok ( San Jose CA )

Premium Full-time MOE Architecture Reinforcement Learning Frontier Rollout

TikTok 14 days ago

Senior AI Infra Engineer - Large Model Inference Systems (...

TikTok ( San Jose CA )

Premium Full-time Latency Kernel Throughput Frontier Architecture

TikTok 14 days ago

Principal Engineer, Model Development Platform

Wayve ( Sunnyvale CA )

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of

Premium Full-time Linear Programming Apache Spark Technical Leadership Architects Cross-functional Collaborations 335,300 USD

Wayve 11 days ago

Senior Principal Machine Learning Engineer - Optimization

PubMatic ( Redwood City CA )

Role: Hybrid in Redwood City, CA. (Will consider Remote for the right candidate) Must have: Experience building large-scale prediction or optimization systems PubMatic is the leading AI-powered ad tech company delivering measurable advertising performance through an intelligent,

Premium Remote Friendly Full-time Ml Media buying Hybrid Algorithms Advertising 330,000 USD

PubMatic 7 days ago

Senior Staff ML Researcher - LLM Algorithmic Optimization

D-Matrix ( Santa Clara CA )

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our

Premium Full-time AI STEM fields OOPS Algorithms Ml 235,000 USD

D-Matrix 7 days ago

Design Technology Co-Optimization Engineer

Google ( Sunnyvale CA )

Minimum qualifications: Bachelors degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or equivalent practical experience. 2 years of experience in Physical Design (RTL-to-GDS) or Technology Development, focusing on advanced nodes (e.g., 7nm,

Premium Full-time Silicon Interpreting Systematics What-if Analysis Place and Route 198,000 USD

Google 7 days ago

Senior/Machine Learning Engineer — Performance Optimization

PubMatic ( Redwood City CA )

Role: Hybrid in Redwood City, CA. Must have: 3+ years of solid experience building machine learning, data science, ranking, prediction, recommendation, optimization, or large-scale data systems PubMatic is the leading AI-powered ad tech company delivering measurable advertising

Premium Remote Friendly Full-time Harness Core ML Wellness Activate Ml 330,000 USD

PubMatic 5 days ago

Principal Research Engineer, Model Training & Post-Training

Inflection ( Palo Alto CA )

About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential.

Premium Full-time Synthetics Hybrid Loops Mandates SIT 550,000 USD

Inflection 1 day ago

AI Infrastructure — Training Engineer (Large Model) [33251]

Stealth Startup ( Menlo Park CA )

Responsibilities Distributed training framework optimization. Own the R&D and tuning of distributed training frameworks for large models (LLMs, multimodal), resolving scalability bottlenecks at the scale of 10k–100k GPU clusters. Kernel & performance tuning. Work close to the underlying

Full-time Fine-Tuning Kernel Cluster Debugging Parallels

Stealth Startup 4 days ago

Founding Quant Engineer - Wealth Optimization

Paragon Alpha - Hedge Fund Talent Business ( Menlo Park CA )

Founding Quant Engineer Goal: Build the future of wealth technology and investment optimization - through hiring a someone who can build systematic frameworks and tools that identify, model, and optimize tax outcomes for individual clients in a highly

Contract Systematics Equities Stealth

Paragon Alpha - Hedge Fund Talent Business 4 days ago

AI Expert Advisor — Large Language Models & Generative AI ...

FranklinWH Energy Storage Inc. ( San Jose CA ) +3 other locations

About FranklinWH FranklinWH is a leading provider of whole-home energy management and storage solutions, headquartered in San Jose, California. Our system intelligently manages solar, battery, grid, generator, and EV power through the AI-driven smart circuit aGate

Part-time Governance Journals Benchmarking Circuit Publications

FranklinWH Energy Storage Inc. 1 day ago

Machine Learning Engineer, LLM Inference Optimization

GMI Cloud ( San Jose CA ) +3 other locations

About Us GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only seven cloud providers worldwide to earn NVIDIAs prestigious Reference Platform Cloud Partner designation. We operate 8 of our

Full-time GMI Upstream Headlines Throughput Publishing

GMI Cloud 1 day ago

Manager, Large Language Model Inference

Nvidia ( Santa Clara CA )

NVIDIA Engineering Manager At NVIDIA, we arent just powering the AI revolutionwere accelerating it. The TensorRT inference platform is the backbone of modern AI, delivering the industrys fastest and most efficient deployment of cutting-edge deep learning

Full-time Software Engineering Realm Deep Learning AI Interfacing

Nvidia 1 day ago