Senior IC Failure Analysis and Fault Isolation Engineer

Gelugor, Penang, Malaysia
Backend
Senior Software Engineer
Hybrid
10+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Azure Messaging Team – Senior Software Engineer

Senior Software Engineer role at Microsoft's Azure Messaging Team, building large-scale distributed systems and real-time analytics solutions with up to 100% remote work flexibility.

Senior Software Engineer

Senior Software Engineer role at Microsoft's DPU group, developing compilers and system software for cloud infrastructure, offering competitive pay and benefits.

Senior Software Engineer - Backend

Senior Backend Engineer role at Microsoft Teams Developer Platform, building scalable services and bot solutions with competitive pay and benefits in Vancouver.

ROP - Senior Software Engineer

Senior Software Engineer position at Microsoft's Azure Core Compute Team, focusing on building and maintaining cloud infrastructure components with emphasis on performance, reliability, and scale.

Senior Software Engineer

Senior Software Engineer role at Microsoft's Azure Networking team, building software for global-scale AI networks and data center infrastructure.

Description For Senior IC Failure Analysis and Fault Isolation Engineer

Microsoft is leading the way into the growth of High-Performance Computing and Artificial Intelligence. Recently Microsoft has announced Cobalt and Maia Custom Silicon for Azure Data Center and AI. Microsoft Silicon Engineering is pushing technology hard in all areas including advanced packaging technology. We are looking for candidates with strong technical experience in the area of HPC/AI/GPU/CPU failure analysis and fault isolation, manufacturing/functional testing, product RMA execution and dispositioning, test hole resolution, ATE test bring up and system-to-tester correlation.

The position will have responsibilities for Silicon Test Engineering for Microsoft's Cobalt and Maia Silicon that is used for Azure Data Center and AI.

Responsibilities:

  • Responsible for Failure Analysis and Fault Isolation activities of Microsoft's Cobalt and Maia, including reliability qual fail and customer RMA product.
  • Providing comprehensive failure analysis by using combination of in-house and 3rd party FA lab facilities to get the analysis done and co-working/communicating the FA findings with cross-organization team members including hardware designer, product, package, testing engineer, etc. for root-cause finding.
  • Using the state-of-the-art optical fault isolation systems to perform electrical failure analysis for circuit critical path tracing and process latent defect location finding.
  • Involve in 1st silicon debug and able to debug issues from test hardware, test program or test content. Able to provide workaround if there is any issue found from test hardware and/or test program.
  • Analyzing test data to identify test program or silicon issues and working with cross functional teams to root cause will also be a focus.
  • Part of a larger team that correlates test solution to lab/OSAT/customer platform to validate silicon design, process, package, and manufacturability to product specifications.
  • Knowledge of platform, package, and silicon power management and thermal is a plus and the ability to connect the dots will help in addressing any product quality issues.
  • Knowledge of system level test is a plus.

We fundamentally believe that we need a culture founded in a Growth Mindset. It starts with a belief that everyone can grow and develop; that potential is nurtured, not pre-determined; and that anyone can change their mindset.

Last updated 7 months ago

Interested in this job?