JOB DETAILS

Deep Learning Performance Architect

CompanyNVIDIA
LocationShanghai
Work ModeOn Site
PostedJune 1, 2026
About The Company
Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.
About the Role

NVIDIA is developing GPU and system architectures that accelerate deep learning, automotive and high-performance computing applications. We are looking for an expert deep learning performance architect to join our deep learning modelling, performance projections, analysis and optimization effort. In this position, you will have the chance to optimize deep learning hardware and software architecture and make a significant impact in a dynamic technology focused company

What you’ll be doing:

  • Analyze performance of various deep learning algorithms on different architectures

  • Identify architecture and software performance bottlenecks and propose optimizations

  • Explore new features and hardware capabilities on deep learning applications

What we need to see:

  • BSc. MS or PhD in relevant discipline (CS, EE, Math, etc.,)

  • 5+ years of working experience in relevant directions will be a plus

  • Be familiar with GPU or accelerator-based deep learning platform and software stack

  • A strong background in computer architecture

  • Experience on system architecture design and performance optimization

  • Familiar with machine learning and deep learning frameworks

    Key Skills
    Deep LearningComputer ArchitectureGPU ArchitecturePerformance OptimizationSystem Architecture DesignMachine Learning FrameworksPerformance AnalysisHardware-Software Co-design
    Categories
    EngineeringTechnologySoftwareScience & ResearchData & Analytics
    Job Information
    📋Core Responsibilities
    Analyze the performance of deep learning algorithms across various architectures to identify bottlenecks. Propose optimizations and explore new hardware capabilities to improve deep learning software and hardware architecture.
    📋Job Type
    full time
    📊Experience Level
    5-10
    💼Company Size
    48177
    📊Visa Sponsorship
    No
    💼Language
    English
    🏢Working Hours
    40 hours
    Apply Now →

    You'll be redirected to
    the company's application page