Site Reliability Engineer - MQ, NATS/Event Broker
Bangalore, IN
About the Job
As a specialist in electronics and software for the past 20 years, in-tech is a dynamic, fast-growing engineering company headquartered in Munich, Germany employing around 2350 employees globally across 20 project locations in 8 countries.
in-tech develops innovative engineering solutions for the automotive, rail, aerospace, defence, and industrial sectors.
We are committed to a flexible, modern work culture and work-life balance. Our colour orange stands for liveliness, warmth, and dynamism. We value a strong team spirit, fresh ideas and a positive work culture. We call it the “Orange Spirit”! Since 2024, in-tech became a subsidiary of Infosys Ltd. This strategic partnership enables us to offer our customers even more comprehensive development and digitalisation services and a greater offshore capability.
We are looking for an experienced Site Reliability Engineer supporting MQ, NATS/Event Broker, you will be responsible for the stability and resilience of Mastercard’s messaging backbone to join our customer location at Pune. If you’re passionate about joining a growing and dynamic team with a company with a positive culture and team spirit, we’d love to connect with you!
Roles & Responsibilities
- Ensure high availability, performance, and resilience of MQ, NATS/Event Broker platforms across environments.
- Participate in on‑call rotations and provide hands‑on support during production incidents.
- Lead or contribute to incident triage, mitigation, and service restoration.
- Perform root cause analysis (RCA) and drive corrective and preventive actions to closure.
- Design, implement, and maintain monitoring, alerting, and dashboards to enable proactive detection.
- Support and govern production changes, including upgrades, patching, certificate renewals, and configuration changes.
- Assess operational readiness for changes and ensure rollback and validation plans are in place.
- Automate operational tasks and workflows to reduce manual effort and improve recovery times.
- Partner with application teams to support onboarding, scaling, and operational best practices.
- Create and maintain runbooks, SOPs, and operational documentation.
- Contribute to continuous improvement of reliability, observability, and operational processes.
Requirements
- Minimum 6 - 10 yrs of experience required.
- Experience supporting mission‑critical production systems with on‑call responsibility.
- Strong understanding of distributed systems and messaging platforms.
- Hands‑on experience with MQ, NATS/Event Broker, or similar middleware technologies.
- Experience with monitoring, logging, and alerting tools.
- Proficiency in at least one scripting or programming language (e.g., Python, Bash, Java).
- Solid knowledge of Linux, networking fundamentals, and system troubleshooting.
- Ability to troubleshoot complex, multi‑component issues under pressure.
Apply with us
If you have experience and team spirit and are looking for a great place to work, then start your job with us.
As part of our dedication to the diversity of our workforce, in-tech is committed to equal employment opportunity without regard for age, race, colour, national origin, ethnicity, gender, protected veteran status, disability, sexual orientation, gender identity, or religion.