Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Communication @ 3
Compliance @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Join NVIDIA’s modern team as a Data Center Operations Controls Engineer. This opportunity allows you to define and manage operational readiness, support, and governance of Cronus, NVIDIA's monitoring and control platform. You will partner with data center teams, Engineering, and the FOC to achieve a smooth technical roadmap and drive operational improvements.
Responsibilities
- Work together with Controls engineering to prioritize and coordinate the resolution of critical UI, stability, and interoperability issues affecting data center operations.
- Lead operational cleanup at live sites, including nuisance alarm reduction, disabled point remediation, and restoration of a usable monitoring baseline.
- Collaborate with engineering and operations to establish and uphold a consistent Controls version and configuration baseline, including setpoints, thresholds, and alarm defaults.
- Help establish naming standards, topology mapping methods, and configuration governance to ensure consistency across sites.
- Own the development and delivery of training, documentation, and knowledge transfer for data center operators and FOC teams using the Controls system.
- Support the planning and rollout of integrations between Controls and key infrastructure tools (asset, power, and monitoring systems), focusing on operational value and adoption.
- Define and track key operational metrics, such as incident response times, alarm quality, and configuration compliance, and drive continuous improvement.
Requirements
- 12+ years of experience in operations, controls, or monitoring systems in data center, industrial, or large-scale infrastructure environments.
- B.S. in a related field or equivalent experience.
- Strong understanding of controls systems, monitoring platforms, or SCADA-like tools, including alarms, setpoints, and configuration management.
- Proven success partnering with engineering, operations, and vendor teams to stabilize and improve technical platforms.
- Excellent communication skills, with the ability to translate technical issues into clear operational actions for frontline teams.
- Track record of defining and using operational metrics to drive performance and reliability improvements.
Ways to Stand Out
- Experience in managing data center operations or critical facilities.
- Experience with Ignition control systems.
- Background in process control, industrial automation, or building management systems.
- Experience leading integrations between monitoring platforms and other infrastructure tools.
- Experience with change process oversight, incident response, and configuration control approaches.
Compensation & Benefits
- Base salary range: 184,000 USD - 287,500 USD (will be determined based on location, experience, and pay of employees in similar positions).
- Eligible for equity and benefits.
Additional Information
- Applications for this job will be accepted at least until June 12, 2026.
- This posting is for an existing vacancy.
- NVIDIA uses AI tools in its recruiting processes.
- NVIDIA is an equal opportunity employer committed to fostering an inclusive work environment.