Job Description
Mainframe - SRE (Cobol)
We are FIS. Our technology powers the world’s economy, and our teams bring innovation to life. We champion diversity to deliver the best products and solutions for our colleagues, clients, and communities. If you are ready to start learning, growing, and making an impact with a career in fintech, we encourage you to apply.
About the Role:
In this role, you will play a key role in modernizing critical applications with a focus on improving observability, automation, and resiliency. You’ll work across both mainframe technologies (COBOL, RPG) and modern server-based environments (Java, Angular, .NET), giving you a unique opportunity to operate at the intersection of legacy systems and contemporary microservices. This is a great opportunity to drive engineering improvements that directly enhance production support operations.
What you will be doing :
Automation: Identify automation opportunities and implement tools and processes that streamline routine tasks, enable scalable infrastructure, and support seamless deployments.
Reliability: Ensure the reliability, availability, and performance of applications and services. Develop and track new service level indicators to support SLO and SLA compliance.
Monitoring: Design and maintain monitoring and alerting solutions that improve visibility into infrastructure, application performance, and user experience.
Capacity/Performance: Conduct capacity planning, performance tuning, and resource optimization in partnership with development and operations teams.
Documentation: Create and maintain clear documentation and knowledge base articles to promote knowledge sharing.
Disaster Recovery: Recommend and implement improvements to disaster recovery plans, backup strategies, and failover mechanisms.
Incident Response:
Lead incident response as a subject matter expert, including identification, triage, resolution, and post-incident analysis.
Identify and drive improvements in reliability, performance, and efficiency through data and root cause analysis.
Participate in an oncall rotation to support critical production incidents. You’ll join a globally distributed team that provides 24/7 coverage, ensuring fast triage, coordinated response, and seamless resolution of ‑high priority‑ issues.
Application Enhancement: Partner with development, QA, DevOps, and product teams to influence design and drive application resiliency improvements.
Continuous Learning: Continue your skill development progress through product training and technical training with Pluralsight across multiple technologies.
Requirements & Skills
What you bring :
4 to 8 Years of Experience in IT Industry
Mainframe Technologies (Required): COBOL, RPG, JCL, CICS, SQL, CL, DDS, DDL, JES.
Modern Languages & Frameworks (Required): Java, C#, Python, JavaScript, Spring Boot, Hibernate, JDBC, Angular, Oracle PL/SQL.
Automation & IaC (Required): Python/Bash/PowerShell scripting, Terraform, Ansible, Jenkins, GitHub, Bitbucket, ServiceNow, Jira, Azure DevOps.
Monitoring Tools (Preferred): Splunk, Dynatrace, Resolve, Nobl9, JMeter, Zabbix.
Added bonus if you have :
Experience with development environments and tools including V7.4, Eclipse, Visual Studio, Azure DevOps, MDCMS, Git, and Microsoft Office tools such as Visio, RDi, X Analysis, Hawkeye, and CheckMarx.
How to Apply
Apply directly through the application link above