Create Alert
Email me similar jobs

Senior Software Development Engineer - DCGPU Diagnostics Quality Team

Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.THE ROLE
Support development and deployment of diagnostic tests that validate AMD Data Center GPU products at all test stages, from silicon screening to server rack assembly.KEY RESPONSIBILITIES
Test Development (60%)Design and implement diagnostic tests for AMD silicon and server platformsDevelop test automation frameworks and infrastructureDebug test failures and hardware issues across production stagesOptimize test coverage and execution timeCross‑Team Coordination (40%)Lead root cause analysis and debug efforts for failures on production systems, often in time‑sensitive and urgent scenariosInterface with silicon design, firmware, performance, systems integration, and manufacturing teams to investigate and resolve issuesSupport manufacturing partners in test bring‑up and issue resolutionCoordinate test deployment schedules and deliverablesTrack and report on test coverage, quality metrics, and production readinessAdditional DutiesParticipate in code reviews and maintain test code qualityDocument test specifications and deployment proceduresOccasional lab work and limited factory visits as neededPREFERRED EXPERIENCEProven experience with software development or test engineering experienceProven experience with hardware/silicon validation or manufacturing test environmentsHands‑on debugging and root cause analysis in low‑level hardware/software systemsExperience with server or datacenter systems architectureDOMAIN KNOWLEDGEUnderstanding of silicon validation processes and test methodologiesFamiliarity with manufacturing workflows and production test environmentsKnowledge of server architectures (BMC, firmware, system integration)Experience with GPU/accelerator performance metrics including computational throughput, memory bandwidth, power efficiency, thermal characteristics, and whole‑system performanceBackground in AMD GPU or CPU technologies is a plusTECHNICAL SKILLSStrong proficiency in Python and C++SQL and Snowflake for data analysis and reportingLinux system administration and shell scriptingGit version control and code review practicesExperience with diagnostic tools and hardware debugging methodologiesKnowledge of at least one GPU programming framework (ROCm/CUDA/OpenCL/Vulkan/OpenGL), with ROCm strongly preferredCOMMUNICATIONExcellent written and verbal communication skills is an absoluteAbility to document technical designs, test plans, and procedures clearlyProven ability to coordinate with cross‑functional teamsACADEMIC CREDENTIALSBS in Computer Science, Computer Engineering, Electrical Engineering, or related field preferredEquivalent experience consideredLOCATION
Markham, ONELIGIBILITY FOR VISA SPONSORSHIP
This role is not eligible for visa sponsorshipBenefits offered are described: AMD benefits at a glance.AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.This posting is for an existing vacancy.#J-18808-Ljbffr
Similar jobs

More from Advanced Micro Devices
Advanced Micro Devices 2 days ago
Advanced Micro Devices 1 day ago
Advanced Micro Devices 18 hours ago

Senior Software Development Engineer - DCGPU Diagnostics Quality Team

Apply Now
Back to search page