192 Site Reliability Engineer jobs in Singapore
Site Reliability Engineer
Posted today
Job Viewed
Job Description
Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd.
On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer
OverviewOn behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer.
Responsibilities- To be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.
- Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity management, security policy planning, capacity planning, toil reduction through automation.
- Introduce service governance initiatives based on latest technologies to consistently increase reliability and user experience components of Huawei mobile services on cloud to provide world class user experience with high reliability.
- Effectively utilize our world class AIOPS and autonomous service governance platform to ideate new ways to streamline process, accuracy of alerts, time series-based trend analysis, anomaly detection, risk identifications.
- Support platform/service expansions, migrations to new architectures, upgrades and drill activities across different technology domains.
- Incorporate mature chaos engineering for risk identification, IPDRR for security, comprehensive automation frameworks to reduce ops effort to reach lowest possible level and make time, space for engineering related focus for the team.
- Bachelor/Master of computer science engineering or related majors
- Have knowledge of Linux, Network, Database, Containers, Container management systems, etc.
- Have knowledge of at least one programming language or scripting such as Java, Python, Shell, Ansible, Terraform
- Have knowledge in big data analytics.
- Explored new technology trends, opensource technologies, methodologies in internet service domain.
- Entry level
- Contract
- Engineering and Information Technology
- IT Services and IT Consulting
Site Reliability Engineer
Posted today
Job Viewed
Job Description
As a leading internet technology company based in China, NetEase, Inc. (NASDAQ: NTES and HKEX:999, “NetEase”) provides premium online services centered around content creation. With extensive offerings across its expanding gaming ecosystem, the Company develops and operates some of China’s most popular and longest-running mobile and PC games. Powered by industry-leading in-house R&D capabilities in China and globally, NetEase creates superior gaming experiences, inspires players, and passionately delivers value for its thriving community worldwide. By infusing play with culture and education with technology, NetEase transforms gaming into a meaningful vehicle to build a more entertaining and enlightened world.
NetEase’s ESG initiatives are among the best in the global media and entertainment industry, earning it a distinction as one of the S&P Global Industry Movers and an “A” rating from MSCI. For more information, please visit:
Job Description- Site Reliability Engineering (SRE) refers to using software engineering methods to manage systems, solve problems, and achieve operational automation to reduce trivial tasks and improve service availability. Responsibilities include but are not limited to:
- Manage the operational work of NetEase Interactive Entertainment services, such as Eggy Party, Marvel Rivals, UU Accelerator, Ace Racer, and other online services, as well as internal research projects.
- Design and select basic runtime environments (including servers, virtualization, cloud services, networks, databases, etc.) for game servers based on different games' service architecture, performance requirements, and business conditions, providing high-quality and efficient operational services at controllable costs.
- Establish and monitor various operational metrics and customize data analysis standards.
- Collaborate with product departments to identify issues, optimize technical architecture, and enhance user experience based on game and infrastructure conditions.
- Participate in in-depth research on cutting-edge open-source software, virtualization, databases, and web services, and develop technical solutions for business implementation.
- Bachelor's degree or above, majors in computer science, networking, communications, automation, or related fields are preferred.
- Familiar with the Linux operating system; knowledgeable about computer network architectures and common network protocols such as TCP/IP and HTTP.
- Proficient in at least one programming language, including but not limited to C/C++, Shell, Python, Golang, Rust, or Java.
- Passionate about open-source; experience or knowledge in open-source software such as Linux, Nginx, MySQL, K8S, and Istio is preferred.
- Strong logical thinking, communication, and learning abilities; adept at research and problem-solving.
- Skilled at teamwork, with a strong sense of collective honor, responsibility, and service awareness.
- Open to trying new things, with excellent problem-solving skills and strong technical sensitivity; experience in contributing to open-source communities is a plus.
- Proficiency in Chinese is required for this role, as daily communication and collaboration with key stakeholders and team members based in China are essential to the responsibilities of the position.
Apply on the NetEase Careers page.
#J-18808-LjbffrSite Reliability Engineer
Posted 2 days ago
Job Viewed
Job Description
We treat Infrastructure and operations as Software Engineering problems. Our mission is to build and progress software platforms which enables the provisioning and managing of all Digibank services in safe, reliable and scalable ways. We consistently challenge the status quo, use new technologies to build platforms and tooling for engineering teams. In this role you will make significant decisions with a huge impact on building modern banking technology. You would be part of a team, responsible for designing & architecting new solutions, finding creative ways to optimize existing solutions which will improve agility for managing hundreds of microservices infrastructures in a stable & reliable way.
Roles and Responsibilities- Using InfrastructureAsCode tooling like Terraform, and Ansible to manage AWS, Azure & Kubernetes resources
- Configuring and installing various network devices and services (e.g., routers, switches, firewalls, load balancers, VPN).
- Support IT network infrastructure-related work, such as installing Internet connections, WiFi APs, network upgrades, office builds, expansions, and relocations
- Actively participate in engaging with Business Stakeholders, internal IT Teams, and Vendors to manage the outcome of the projects.
- Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
- Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
- Build and drive adoption for greater self-healing and resiliency patterns
- Performance and cost optimization for infrastructure
- Be part of an on-call rotation for the team’s tooling and 24x7 support coverage as needed
- Succeed, fail, and learn together with other talented people. We believe in an environment that provides an opportunity for growth and see education as an outcome of failure that gets us closer to the next breakthrough
- Bachelor's degree in information systems, information technology, computer science, or similar.
- 3-5 years of professional experience.
- Extensive routing, switching, security, and wireless LAN design, implementation, and troubleshooting experience
- Cloud (AWS/Azure) network configuration and integration with on-premises network equipment.
- Network Automation experience using any scripting language (Python, Go, Perl, Bash).
- Experience with managing Infrastructure as code using Terraform
- Direct production operations experience in a cloud environment.
Mid-Senior level
Employment typeFull-time
Job functionEngineering and Information Technology
IndustriesFinancial Services and Banking
#J-18808-LjbffrSite Reliability Engineer
Posted 2 days ago
Job Viewed
Job Description
Through proprietary software and AI, Sleek makes the back-office easy for micro SMEs. We operate 3 business segments: Corporate Secretary : Automating the company incorporation, secretarial, filing, Nominee Director, mailroom and immigration processes via custom online robots and SleekSign. We are the market leaders in Singapore with :5% market share of all new business incorporations; Accounting & Bookkeeping : Redefining what it means to do Accounting, Bookkeeping, Tax and Payroll thanks to our proprietary SleekBooks ledger, AI tools and exceptional customer service; FinTech payments : Overcoming a key challenge for Entrepreneurs by offering digital banking services to new businesses. Sleek launched in 2017 and now has around 15,000 customers across our offices in Singapore, Hong Kong, Australia and the UK. We have around 500 staff with an intact startup mindset. We have recently raised Series B financing off the back of >70% compound annual growth in Revenue over the last 5 years. Sleek has been recognised by The Financial Times, The Straits Times, Forbes and LinkedIn as one of the fastest growing companies in Asia. Backed by world-class investors, we are on track to be one of the few cash flow positive, tech-enabled unicorns based out of Singapore.
RequirementsWe are looking for a Service Reliability Engineer that is excited about the below Mission and Outcomes.
MissionThe primary mission of a Service Reliability Engineer is to serve as the highest level of technical escalation within the support team. This role focuses on resolving complex technical issues that cannot be addressed by frontline support, ensuring minimal downtime, and maintaining the highest level of customer satisfaction. The engineer collaborates with development teams, product management, and other stakeholders to diagnose, troubleshoot, and resolve issues, as well as to contribute to continuous improvement initiatives that enhance the overall quality of products and services.
Responsibilities- Collaborate with friendly Product, Tech and Data teams to reproduce and resolve an array of enquiries
- Support our internal business stakeholders on questions about Sleek platform functionality
- Provide technical support through our channels via Zendesk, JIRA, and phone
- Identification of fixes and development of code
- Configure servers and networks in the Cloud
- Update database records to keep client records up-to-date
- Update knowledge base to improve self-servicing
- Minimum of 5 years experience as Software Engineer
- Location: Singapore, Philippines, India or Malaysia (preferred)
- Experience with JavaScript tech stack (Node, NestJS, React, Vue)
- Experience with MongoDB, RDBMS (PostgreSQL, MySQL, etc.)
- Experience with Cloud Platforms (e.g. AWS)
- Bitbucket and Github (CI/CD knowledge)
- Proficiency in reading logs, especially Splunk
- Understanding load testing tools and analyzing results
- Possess a strong academic foundation, ideally from reputable universities
- Demonstrated track record working in well-established or recognized organizations
- Over 5 years of experience as a Software Engineer
- Able to demonstrate an attention to detail
- Shows customer-attentiveness and have faced customers in past positions
- Strong analytical and problem-solving skills
- Experience with JavaScript tech stack (Node, NestJS, React, Vue)
- Experience with MongoDB, RDBMS (PostgreSQL, MySQL, etc.)
- Experience with Cloud Platforms (e.g. AWS)
- Bitbucket and Github(CI/CD knowledge)
- Proficiency in reading logs, especially Splunk
- Understanding load testing tools and analyzing results
- Ownership
- Humility
- Structured Thinking
- Data driven
- Can have tough conversations in a positive way
- Analytical Mindset
- Collaboration-Driven
Humility and kindness: Humility is a core attribute we hire for, which means we have a culture of not taking ourselves too seriously and being able to laugh. Kindness is also incredibly important. We are committed to creating and nurturing a diverse and inclusive environment.
Flexibility: You'll be able to work from home 5 days per week. If you need to start early or start late to cater to your family or other needs, we don't mind, so long as you get your work done and proactively communicate. You can also work fully remote from anywhere in the world for 1 month each year
Financial benefits: We pay competitive market salaries and provide staff with generous paid time off and holiday schedules. Certain staff at Sleek are also eligible for our employee share ownership plan and can share in the upside of our stellar growth trajectory as we work toward listing on a prominent stock exchange in the Asia Pacific region.
Personal growth: You'll get a lot of responsibility and autonomy at Sleek - we move at a fast pace so you'll be making decisions, making mistakes and learning. There's also a range of internal and external facing training programmes we run. We're also at the forefront of utilising AI in our space and are developing a regional centre of AI excellence. It is our intention that if you leave Sleek, you leave as a more well-rounded person and professional.
Sleek is also a proudly certified B Corp. Since we started our journey in 2017, we've been committed to building Sleek as a force for good. In just over 5 years, we've joined a community of industry leaders like Patagonia, Ben & Jerry's, and P&G who are building an inclusive, equitable, and a regenerative economy. We have planted over 29,271 trees to reforest our ecosystem and saved 7 tons of paper from landfills by processing over 1.4M pages through SleekSign. We aim to be Carbon Neutral by 2030.
Seniority level- Mid-Senior level
- Full-time
- Other
- IT Services and IT Consulting
Site Reliability Engineer
Posted 2 days ago
Job Viewed
Job Description
Through proprietary software and AI, along with a focus on customer delight, Sleek makes the back-office easy for micro SMEs.
We give Entrepreneurs time back to focus on what they love doing - growing their business and being with customers. With a surging number of Entrepreneurs globally, we are innovating in a highly lucrative space.
We operate 3 business segments:
Corporate Secretary: Automating the company incorporation, secretarial, filing, Nominee Director, mailroom and immigration processes via custom online robots and SleekSign. We are the market leaders in Singapore with ~5% market share of all new business incorporations
Accounting & Bookkeeping: Redefining what it means to do Accounting, Bookkeeping, Tax and Payroll thanks to our proprietary SleekBooks ledger, AI tools and exceptional customer service
FinTech payments: Overcoming a key challenge for Entrepreneurs by offering digital banking services to new businesses
Sleek launched in 2017 and now has around 15,000 customers across our offices in Singapore, Hong Kong, Australia and the UK. We have around 500 staff with an intact startup mindset.
We have recently raised Series B financing off the back of >70% compound annual growth in Revenue over the last 5 years. Sleek has been recognised by The Financial Times, The Straits Times, Forbes and LinkedIn as one of the fastest growing companies in Asia.
Backed by world-class investors, we are on track to be one of the few cash flow positive, tech-enabled unicorns based out of Singapore.
We are looking for Service Reliability Engineer that is excited about the below Mission and Outcomes.
Mission: The primary mission of a Service Reliability Engineer is to serve as the highest level of technical escalation within the support team. This role focuses on resolving complex technical issues that cannot be addressed by frontline support, ensuring minimal downtime, and maintaining the highest level of customer satisfaction. The engineer collaborates with development teams, product management, and other stakeholders to diagnose, troubleshoot, and resolve issues, as well as to contribute to continuous improvement initiatives that enhance the overall quality of products and services.
Outcomes:
- Collaborate with friendly Product, Tech and Data teams to reproduce and resolve an array of enquiries.
- Support our internal business stakeholders on questions about Sleek platform functionality.
- Provide technical support through our channels via Zendesk, JIRA, and phone.
- Identification of fixes and development of code.
- Configure servers and networks in the Cloud.
- Update database records to keep client records up-to-date.
- Update knowledge base to improve self-servicing.
To do this, you will have a minimum of 5 years experience as Software Engineer and you will most likely be located in Singapore, Philippines, India or Malaysia.
Performance Standard
- Possess a strong academic foundation, ideally from reputable universities.
- Demonstrated track record working in well-established or recognized organizations.
- Over 5 years of experience as a Software Engineer.
- Able to demonstrate an attention to detail.
- Shows customer-attentiveness and have faced customers in past positions.
- Strong analytical and problem-solving skills.
- Experience with JavaScript tech stack (Node, NestJS, React, Vue).
- Experience with MongoDB, RDBMS (PostgreSQL, MySQL, etc.).
- Experience with Cloud Platforms (e.g. AWS).
- Bitbucket and Github(CI/CD knowledge).
- Proficiency in reading logs, especially Splunk.
- Understanding load testing tools and analyzing results.
Behavioural fit is also important at Sleek, and we will be looking for candidates that have a proven track record of embodying the below attributes in their recent roles:
Ownership : This shows reliability and helps build trust within the team. We move fast and need to know that everyone will see things through to completion and proactively help to get things back on track when challenges arise. Accountability is really important to us.
Humility: There is so much we don’t know. Humility allows for open-mindedness to feedback and a willingness to learn from others. It paves the way for collaboration and creates a positive work environment. It is a key ingredient of self awareness and emotional intelligence.
Structured Thinking: Our business is complex with many layers (many services, many countries, many cultures). Regardless of whether you’re more analytical or creative in nature, being able to show sound judgement is important to us. It ensures solutions are pragmatic and balance the needs of the organisation, team and customers.
Data driven: We are a data rich business with ~15,000 small customers. Each decision we make can impact many more people than we realise - so it’s critical that we use sound data to support our strategies and review the success of our initiatives.
Can have tough conversations in a positive way: It’s not a matter of if, but when difficult interpersonal situations arise. Disagreement, conflict and disappointment are a given in a fast moving business where people care about their work. People that proactively have tough conversations with kindness build empathy, trust and great working relationships.
Analytical Mindset: You have a keen eye for detail and a methodical approach to dissecting problems. You excel at analysing complex systems and processes to identify weaknesses and inefficiencies, and your ability to evaluate multiple scenarios enables you to devise the best testing strategies. You apply data-driven decisions to enhance testing coverage and performance metrics, ensuring the highest standards of software quality.
Collaboration-Driven: You thrive in a cross-functional team environment, working closely with developers, product managers, and operations teams to ensure alignment on requirements and testing goals. You communicate effectively, advocate for quality throughout the development process, and proactively address potential issues before they arise, fostering a culture of shared responsibility for delivering exceptional software.
Some other great things about working at Sleek…
Humility and kindness: Humility is a core attribute we hire for, which means we have a culture of not taking ourselves too seriously and being able to laugh. Kindness is also incredibly important. We are committed to creating and nurturing a diverse and inclusive environment.
Flexibility: You’ll be able to work from home 5 days per week. If you need to start early or start late to cater to your family or other needs, we don’t mind, so long as you get your work done and proactively communicate. You can also work fully remote from anywhere in the world for 1 month each year
Financial benefits: We pay competitive market salaries and provide staff with generous paid time off and holiday schedules. Certain staff at Sleek are also eligible for our employee share ownership plan and can share in the upside of our stellar growth trajectory as we work toward listing on a prominent stock exchange in the Asia Pacific region.
Personal growth: You’ll get a lot of responsibility and autonomy at Sleek - we move at a fast pace so you’ll be making decisions, making mistakes and learning. There’s also a range of internal and external facing training programmes we run. We’re also at the forefront of utilising AI in our space and are developing a regional centre of AI excellence. It is our intention that if you leave Sleek, you leave as a more well-rounded person and professional.
Sleek is also a proudly certified B Corp. Since we started our journey in 2017, we’ve been committed to building Sleek as a force for good. In just over 5 years, we’ve joined a community of industry leaders like Patagonia, Ben & Jerry's, and P&G who are building an inclusive, equitable, and a regenerative economy. We have planted over 29,271 trees to reforest our ecosystem and saved 7 tons of paper from landfills by processing over 1.4M pages through SleekSign. We aim to be Carbon Neutral by 2030.
#J-18808-LjbffrSite Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Responsibilities
To be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.
Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity management, security policy planning, capacity planning, toil reduction through automation.
Introduce service governance initiatives based on latest technologies to consistently increase reliability and user experience components of Huawei mobile services on cloud to provide world class user experience with high reliability.
Effectively utilize our world class AIOPS and autonomous service governance platform to ideate new ways to streamline process, accuracy of alerts, time series-based trend analysis, anomaly detection, risk identifications.
Support platform/service expansions, migrations to new architectures, upgrades and drill activities across different technology domains.
Incorporate mature chaos engineering for risk identification, IPDRR for security, comprehensive automation frameworks to reduce ops effort to reach lowest possible level and make time, space for engineering related focus for the team.
Requirements and Qualifications
Bachelor/Master of computer science engineering or related majors
Have knowledge of Linux, Network, Database,Containers, Container management systems, etc.
Have knowledge of at least one programming language or scripting such as Java, Python, Shell, Ansible, Terraform
Have knowledge in big data analytics.
Explored new technology trends, opensource technologies, methodologies in internet service domain.
Site Reliability Engineer
Posted 9 days ago
Job Viewed
Job Description
Join to apply for the Site Reliability Engineer role at IDEMIA
Join to apply for the Site Reliability Engineer role at IDEMIA
Get AI-powered advice on this job and more exclusive features.
Purpose
This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and operations teams to build and maintain robust infrastructure and tools that support high availability, monitoring and rapid deployment.
Job Description
Purpose
This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and operations teams to build and maintain robust infrastructure and tools that support high availability, monitoring and rapid deployment.
Key Missions
- Maintains platforms or products after go live by measuring and monitoring their availability, performance and overall system health
- Recovers platforms or products during production incidents to meet targeted service-level agreements
- Set up, enhance and maintain observability tools.
- Assist in incident response, perform root cause analysis, and postmortem documentation.
- Develop tools/applications/scripts to improve operational efficiency.
- Maintain and enhance CI/CD pipelines.
- Collaborate with software engineers to design scalable and resilient systems.
- Participate in on-call and on-site rotations and contribute to reducing alert fatigue.
- Document processes, configurations, and best practices.
- Support other software efficiency improvement initiatives.
- At least 1-3 years’ experience in software development, Devops or SRE.
- Curious, Strong communicator and ready to work in a fast-paced environment and willing to pick up new skills and technologies as necessary.
- Degree in Electrical / Electronics / Computer Engineering / Computer Science or a relevant discipline
- Basic understanding of Linux/Unix systems and shell scripting.
- Familiarity with cloud platforms (e.g., AWS, Azure, GCP).
- Exposure to containerization tools (e.g., Docker, Kubernetes).
- Experience with monitoring tools (e.g., Prometheus, Grafana, ELK).
- Knowledge of CI/CD tools (e.g., Jenkins, Gitlab, Bitbucket, Jira).
- Programming/scripting skills in Python, Java, or Bash.
- Understanding of networking fundamentals and system security.
- Good written and verbal communication skills.
- Self-motivated, independent and a good team player
- Able to work under pressure in a fast-paced environment
- Innovative, proactive mindset and with a focus on continuous improvement
- Strong analytical and problem-solving skills
- Seniority level Executive
- Employment type Full-time
- Job function Information Technology
- Industries IT Services and IT Consulting
Referrals increase your chances of interviewing at IDEMIA by 2x
Get notified about new Site Reliability Engineer jobs in Singapore, Singapore .
Production Engineer / Site Reliability Engineer Platform Engineer - Up to $200k + Industry Leading Bonus - Elite FinTech Firm Information Technology - Cloud/DevOps Engineer Site Reliability Engineer (EMEA, Japan, Singapore, Australia) Site Reliability Engineer Assistant (DevOps) DevOps / Site Reliability Engineer | up to 150K SGD per year Engineer (Energy Management Systems Department) Site Reliability Engineer (SRE) (GovTech) Site Reliability Engineer, Engineering Infra - AZ SRE (Campus Recruitment 2026)We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-LjbffrBe The First To Know
About the latest Site reliability engineer Jobs in Singapore !
Site Reliability Engineer
Posted 14 days ago
Job Viewed
Job Description
Responsibilities:
- Responsible for deployment, change, issues triage and infrastructure management of overseas games and relevant components and system, e.g. game monitor system, login services.
- Responsible for monitoring and dashboarding for game observability, and ensure the game is reliable, scalable and secure.
- Understand the game architecture, analyze, evaluate and respond to potential risks, such as hidden troubles and performance bottlenecks.
- Responsible for daily communication and coordination between various teams.
Requirements:
- Bachelor’s Degree or above in Computer Science or comparable field.
- More than 3 years of operations experience in Linux and Windows operating system.
- Good knowlege in cloud and containerization.
- Proficiency in scripting programming such as Bash, Python, SQL is a plus.
- Experience with worldwide online game live operations is a plus.
- Have a high sense of responsibility and teamwork spirit.
Site Reliability Engineer
Posted 15 days ago
Job Viewed
Job Description
Join to apply for the Site Reliability Engineer role at Tower Research Capital
Join to apply for the Site Reliability Engineer role at Tower Research Capital
Tower Research Capital is a leading quantitative trading firm founded in 1998. Tower has built its business on a high-performance platform and independent trading teams. We have a 25+ year track record of innovation and a reputation for discovering unique market opportunities.
Tower is home to some of the world’s best systematic trading and engineering talent. We empower portfolio managers to build their teams and strategies independently while providing the economies of scale that come from a large, global organization.
Engineers thrive at Tower while developing electronic trading infrastructure at a world class level. Our engineers solve challenging problems in the realms of low-latency programming, FPGA technology, hardware acceleration and machine learning. Our ongoing investment in top engineering talent and technology ensures our platform remains unmatched in terms of functionality, scalability and performance.
At Tower, every employee plays a role in our success. Our Business Support teams are essential to building and maintaining the platform that powers everything we do — combining market access, data, compute, and research infrastructure with risk management, compliance, and a full suite of business services. Our Business Support teams enable our trading and engineering teams to perform at their best.
At Tower, employees will find a stimulating, results-oriented environment where highly intelligent and motivated colleagues inspire each other to reach their greatest potential.
Responsibilities
- Overseeing and ensuring the continuous operation of the firm's Linux-based trading infrastructure, addressing day-to-day operational needs
- Providing second-level support, including:
- Rapid response to emergencies
- Implementing scheduled updates and deployments
- In-depth analysis and resolution of performance issues
- Engage in a rotational on-call schedule, including early morning and weekend shifts, to provide timely support
- Contributing towards the development of automated solutions for server provisioning, configuration, and monitoring, targeting a scalable management of thousands of servers
- Engaging in interactions with the Trading and Core Engineering teams
- Managing essential Core services such as DHCP, LDAP, DNS, and NFS for on-prem and hosted data centers as well as public clouds
- Participating in an on-call rotation and occasional weekend shifts
- Sound expertise in Linux production environments
- Basic knowledge of Python and Bash scripting
- Engagement with automation and monitoring tool sets
- Comprehensive knowledge of operating system principles, with a particular focus on Linux internals
- Familiarity with Intel-based server hardware and components
- Competence in server-side networking, including understanding network protocols and configurations
- Familiarity in cloud services and architectural solutions
- Experience in designing, building, and troubleshooting complex systems
- Good problem-solving skills, underpinned by a methodical approach to technical challenges. This includes an ability to communicate effectively, demonstrating strong interpersonal skills, a sense of responsibility, and a commitment to driving projects to completion.
- Sense of ownership and drive
- Involvement in open source or personal projects showcasing a passion for innovation and collaboration
- Experience in High Frequency Trading, Quantitative Finance or working in low latency environment is advantageous but not a strict requirement
- Organized, responsible, and meticulous
- Strong communicator
- Proactive and willing to take initiative
- Able to manage and prioritize multiple tasks
- Excellent at supporting Linux Production environments
- Able to work both within a team and independently
Tower’s headquarters are in the historic Equitable Building, right in the heart of NYC’s Financial District and our impact is global, with over a dozen offices around the world.
At Tower, we believe work should be both challenging and enjoyable. That is why we foster a culture where smart, driven people thrive – without the egos. Our open concept workplace, casual dress code, and well-stocked kitchens reflect the value we place on a friendly, collaborative environment where everyone is respected, and great ideas win.
Our benefits include:
- Generous paid time off policies
- Savings plans and other financial wellness tools available in each region
- Hybrid working opportunities
- Free breakfast, lunch and snacks daily
- In-office wellness experiences and reimbursement for select wellness expenses (e.g., gym, personal training and more)
- Company-sponsored sports teams and fitness events (JPM Corporate Challenge, Cycle for Survival, Wall Street Rides FAR and more)
- Volunteer opportunities and charitable giving
- Social events, happy hours, treats and celebrations throughout the year
- Workshops and continuous learning opportunities
Tower Research Capital is an equal opportunity employer. Seniority level
- Seniority level Mid-Senior level
- Employment type Full-time
- Job function Engineering and Information Technology
Referrals increase your chances of interviewing at Tower Research Capital by 2x
Get notified about new Site Reliability Engineer jobs in Singapore .
Internship, Technology (Full Stack Developer) May/June - December 2025 Software Developer (Early Career/Young Talent Program) Frontend Software Engineer, Data Platform - 2025 Start Frontend Software Engineer, Data Platform - 2025 Start Project Intern, Digital Innovations & Solutions (Full Stack Developer) Frontend Engineer-Search - Singapore-2025 Start Backend Software Engineer, TikTok - Singapore Internship, Technology (ML/Data Engineer) May/June - December 2025We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-LjbffrSite Reliability Engineer
Posted 17 days ago
Job Viewed
Job Description
About NetEase Games:
As a leading internet technology company based in China, NetEase, Inc. (NASDAQ: NTES and HKEX:999, “NetEase”) provides premium online services centered around content creation. With extensive offerings across its expanding gaming ecosystem, the Company develops and operates some of China’s most popular and longest-running mobile and PC games. Powered by industry-leading in-house R&D capabilities in China and globally, NetEase creates superior gaming experiences, inspires players, and passionately delivers value for its thriving community worldwide. By infusing play with culture and education with technology, NetEase transforms gaming into a meaningful vehicle to build a more entertaining and enlightened world.
NetEase’s ESG initiatives are among the best in the global media and entertainment industry, earning it a distinction as one of the S&P Global Industry Movers and an “A” rating from MSCI. For more information, please visit:
Job Description:
- Site Reliability Engineering (SRE) refers to using software engineering methods to manage systems, solve problems, and achieve operational automation to reduce trivial tasks and improve service availability. Responsibilities include but are not limited to:
- Manage the operational work of NetEase Interactive Entertainment services, such as Eggy Party, Marvel Rivals, UU Accelerator, Ace Racer, and other online services, as well as internal research projects.
- Design and select basic runtime environments (including servers, virtualization, cloud services, networks, databases, etc.) for game servers based on different games' service architecture, performance requirements, and business conditions, providing high-quality and efficient operational services at controllable costs.
- Establish and monitor various operational metrics and customize data analysis standards.
- Collaborate with product departments to identify issues, optimize technical architecture, and enhance user experience based on game and infrastructure conditions.
- Participate in in-depth research on cutting-edge open-source software, virtualization, databases, and web services, and develop technical solutions for business implementation.
Job Requirements:
- Bachelor's degree or above, majors in computer science, networking, communications, automation, or related fields are preferred.
- Familiar with the Linux operating system; knowledgeable about computer network architectures and common network protocols such as TCP/IP and HTTP.
- Proficient in at least one programming language, including but not limited to C/C++, Shell, Python, Golang, Rust, or Java.
- Passionate about open-source; experience or knowledge in open-source software such as Linux, Nginx, MySQL, K8S, and Istio is preferred.
- Strong logical thinking, communication, and learning abilities; adept at research and problem-solving.
- Skilled at teamwork, with a strong sense of collective honor, responsibility, and service awareness.
- Open to trying new things, with excellent problem-solving skills and strong technical sensitivity; experience in contributing to open-source communities is a plus.
- Proficiency in Chinese is required for this role, as daily communication and collaboration with key stakeholders and team members based in China are essential to the responsibilities of the position.