# Site Reliability Engineer

> Trade Nation · Kuala Lumpur, Malaysia (Hybrid) · Full-time · Posted 2026-01-13

**Workplace:** hybrid

**Department:** Software Development

## Description

The Site Reliability Engineer ensures the reliability, availability, and performance of web services and applications within the squad. This role bridges development and operations, focusing on building scalable systems, automating processes, and maintaining high service uptime. SREs work closely with developers, QA engineers, and product teams to embed reliability into every stage of the software lifecycle.

### **Who we are**

Trade Nation is a global CFD and spread betting broker. We help traders make better decisions through clear market insights, transparent pricing and fairer approach to trading.

Since 2014, we’ve grown into a market-leading, low-cost broker with our headquarters in London and offices across Europe, South Africa, Asia-Pacific, and key offshore regions including the Caribbean and Indian Ocean. Our platform is available in 14 languages, making it accessible to traders worldwide.

Built on transparency and trust, and driven by our people, our focus is simple: helping customers trade more effectively. We do that by keeping costs low, cutting unnecessary complexity and using technology to put traders first.

### Our commitments to each other

**We have each other’s backs**

There when we need each other most

 **We challenge each other**

Be more creative, more curious, more bold

**We thrive together**

Taking our work to the next level

**We form strong bonds**

Through team building and social events

**We don’t judge**

Instead, we teach and are open to learning

**We step up**

Taking ownership and supporting each other to do the same

### Responsibilities

-   System Design & Maintenance: Design, implement, and maintain scalable, secure, and reliable systems.
-   Monitoring & Troubleshooting: Implement and manage monitoring, alerting, and logging systems; proactively identify and resolve performance issues
-   Automation: Develop and maintain automation tools to streamline operations and reduce manual intervention
-   Collaboration: Work with development squads to ensure new features are designed with reliability in mind; participate in Agile ceremonies
-   Incident Management: Conduct root cause analysis for incidents and implement corrective actions to prevent recurrence; participate in on-call rotations for critical systems
-   Continuous Improvement: Drive initiatives to improve system performance, reliability, and scalability through best practices.

## Requirements

-   Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
-   Experience working in SRE, DevOps, or similar roles.
-   Proficiency in C#, Python, Java or another similar language, willing to work in C#.
-   Experience with cloud platforms (AWS, GCP, Azure) and containerisation/orchestration tools (Kubernetes, Docker).
-   Familiarity with Infrastructure-as-Code tools (Terraform, CloudFormation).
-   Strong problem-solving skills and ability to work under pressure.
-   Excellent communication and collaboration skills for cross-functional teamwork.
-   Passion for automation and continuous improvement.

## Apply

[Apply at Trade Nation](https://apply.workable.com/trade-nation/j/22F71BF554/apply)

---
Powered by [Workable](https://www.workable.com)
