Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform — and at the heart of it is custom silicon. Annapurna Labs, an AWS organization with development centers in the U.S. and Israel, designs the custom chips (Graviton, Trainium, Inferentia, Nitro) that power millions of customers worldwide. Our team combines cloud-scale innovation with world-class expertise across silicon engineering, hardware design, verification, software, and operations to solve technical challenges no one has tackled before.
We are seeking a Sr. Technical Manager to lead a team of networking development engineers, data center operations technicians, and facility engineers responsible for designing, building, operating, and scaling the critical lab and data center infrastructure that accelerates silicon development and validation. You will own the strategy, design, construction, and ongoing operations of lab environments spanning hundreds of rack positions, megawatts of power capacity, and specialized testing environments including thermal chambers, liquid cooling systems, and high-density compute clusters.
This is a hands-on technical leadership role. You'll drive the physical infrastructure buildout — power distribution, cooling architecture, structured cabling, network fabric design, and environmental monitoring — while simultaneously managing the operational excellence of a live, production-class lab environment. You'll operate in ambiguous spaces where the problems aren't pre-defined, defining goals, building teams, establishing processes, and influencing stakeholders across hardware, software, facilities, and infrastructure organizations to deliver results that directly impact AWS's ability to ship custom silicon faster.
Key job responsibilities
Data Center & Lab Infrastructure Design and Buildout
Lead end-to-end design and construction of lab and data center environments — power (MW-scale), cooling (air/liquid), structured cabling, and network fabric. Define technical requirements including electrical capacity planning, thermal modeling, rack density, and redundancy architectures. Drive infrastructure decisions across UPS, PDUs, generators, switchgear, chillers, and emerging technologies (liquid cooling, DC distribution). Own full lifecycle from design through commissioning and decommissioning.
Data Center Operations & Network Infrastructure
Lead networking engineers in designing and operating high-performance lab network fabrics (spine-leaf, 400G+). Own operational excellence — availability, capacity management, change management, and incident response. Establish monitoring/alerting across all systems, define SLAs, and drive automation of operational workflows including infrastructure-as-code and predictive maintenance.
Strategic Leadership & Execution
Lead a multi-disciplinary, multi-location team of networking engineers, DC operations, and facility engineers. Define vision and goals working backwards from silicon engineering needs. Attract and develop exceptional talent; build future leaders through coaching and delegation. Manage trade-offs between tactical operations and long-term strategic buildouts.
Process Improvement & Cross-Functional Influence
Establish scalable, repeatable processes for facility operations and lab provisioning. Drive standardization to reduce mean-time-to-provision. Influence senior leaders through written narratives and partner cross-functionally with silicon engineering, hardware design, security, and real estate teams.
A day in the life
No two days look the same. You might start the morning reviewing power and cooling dashboards, analyzing utilization trends across your facilities, and triaging a thermal alert in a high-density rack zone. Mid-morning, you're in a design review with your networking engineers evaluating a new topology for an upcoming lab expansion. After lunch, you're walking the data center floor with your facility engineers, inspecting a new liquid cooling loop installation and reviewing commissioning test results. Later, you lead a cross-functional planning session with silicon and hardware teams on capacity requirements for the next chip program — translating their compute and power needs into concrete infrastructure builds. You close the day with 1:1s focused on career development, a quick sync on a vendor negotiation for critical power equipment, and approving a change request for a weekend network maintenance window.
Basic Qualifications
Bachelor's degree in Electrical or Mechanical Engineering, or Bachelor's degree in computer science, engineering, mathematics or equivalent
7+ years of experience in data center operations, critical facility infrastructure, lab engineering, or network engineering
3+ years of experience directly managing technical teams (networking engineers, facility/operations engineers, or DC technicians)
Deep technical knowledge of data center power and cooling systems (UPS, generators, PDUs, chillers, CRAHs, CDUs, liquid cooling)
Experience designing or operating data center network infrastructure (spine-leaf architectures, BGP/OSPF, 100G/400G fabrics)
Experience with MW-scale power distribution design and capacity planning. Demonstrated ability to lead facility buildout or expansion projects from design through commissioning
Experience establishing operational metrics, SLAs, and incident management processes for critical infrastructure
Preferred Qualifications
Master's degree in engineering, management, or technology, or Master's degree in Electrical or Mechanical Engineering
Experience working with cross-functional teams
Background in silicon development lab environments, hardware validation, or chip bring-up operations
Hands-on knowledge of liquid cooling technologies (direct-to-chip, rear-door, immersion) and thermal management at high rack densities
Experience with BMS systems and infrastructure monitoring/observability tools
Track record of building teams and operational processes from scratch in data center environments
Experience managing vendor relationships, capital equipment procurement, and construction project delivery
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, TX, Austin - 139,300.00 - 208,000.00 USD annually
Company - Annapurna Labs (U.S.) Inc.
Job ID: A10427129
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Information Technology, Consulting, and Engineering
Industries
IT Services and IT Consulting
Referrals increase your chances of interviewing at Amazon Web Services (AWS) by 2x