Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Grafana @ 4
Kafka @ 4
Linux @ 4
Prometheus @ 4
Python @ 6
Distributed Systems @ 4
Communication @ 7
.NET @ 4
OpenTelemetry @ 4
Observability @ 4
AI @ 4
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Reliability Engineers in Electronic Trading (ET) at Bloomberg fill the mission-critical role of ensuring our enterprise products are resilient, observable, and ready for the future. The products supported include large over-the-counter electronic trading platforms across fixed income, currencies, commodities, derivatives, and equities. The Engineering Practices team ensures engineers deliver reliable, compliant software at high efficiency.
Responsibilities
- Build automation and frameworks that improve resiliency, observability, and recovery
- Partner with teams to review system health, mitigate reliability risks, and enhance performance
- Automate operational tasks to boost scalability and safety
- Promote best practices in reliability, alerting, and incident response
Requirements
- 4+ years of experience in Python (or another object-oriented language)
- Strong collaboration and communication skills
- Solid understanding of distributed systems and system reliability
- Experience with Linux, databases, web services, and storage
- Interest in AI-driven operations
- Degree in Computer Science, Engineering, or equivalent experience
Nice to Have
- Prior Reliability Engineering experience or open-source contributions
- Familiarity with modern container/orchestration technologies
- Experience with Kafka, Prometheus, Grafana, OpenTelemetry and automation tools
- Experience in anomaly detection and predictive alerting
Why It Matters
You’ll shape the future of operational resiliency through automation and AI, ensuring Bloomberg's global systems are stable, efficient, and adaptive.
Compensation and benefits
Salary Range = 160000 - 240000 USD Annually + Benefits + Bonus
The posting notes a comprehensive benefits plan that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long term disability benefits, 401(k) + match, life insurance, and various wellness programs. The Company does not provide benefits directly to contingent workers/contractors and interns.