Incident Response Specialist

Singapore, Singapore beBeeIncidentResponse

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Title: Incident Response Specialist

We are seeking a highly skilled Incident Response Specialist to join our team. The successful candidate will be responsible for delivering exceptional support to our data centre operations, responding to alarms and alerts, and providing detailed records of incident handling activities.

Key Responsibilities:

  • Deliver high-quality data centre operations support across multiple locations
  • Respond to all alarms and alerts in real-time
  • Provide deep understanding and intelligence of critical incidents to resolver groups
  • Ensure accurate and detailed records of alarm handling activities
  • Support program managers and facilitate project deliverables
  • Conduct root cause analysis to determine recurring problems
  • Employ questioning and analysis techniques to determine underlying causes

Requirements:

  • Diploma or degree in Information Technology
  • 2+ years of experience in command center, service center, or similar 24x7 operations environment
  • Ability to quickly triage multiple incidents and assign priority based on risk and confidence levels
  • Knowledge of technical elements associated with systems such as IP Networks, DC Environment, and Server Health
  • Excellent verbal and written communication skills, ability to work with minimal direction, meet goals, and maintain attention to detail
  • Ability to successfully interact at all levels of the organization, including with clients, while functioning as a team player
  • Basic working knowledge of data protection policies such as GDPR and the need to keep sensitive information secure
This advertiser has chosen not to accept applicants from your region.

Incident Response Expert

Singapore, Singapore beBeeCyber

Posted today

Job Viewed

Tap Again To Close

Job Description

Cyber Response Specialist

This role is an excellent opportunity to join a dynamic team and utilize your technical skills to assist clients in responding to cyber security incidents. As a Cyber Response Specialist, you will play a key part in helping clients manage the aftermath of a cyber attack, including identifying the root cause of the incident, containing the breach, and providing recommendations for future prevention.

Responsibilities
  • Manage the response to cyber security incident response engagements
  • Help with commercial proposals to clients based on existing templates
  • Coach and manage colleagues
  • Be hands-on as required to deliver incident response & digital forensics engagements
  • Work directly with end clients and help them deal with their cyber security challenges

We value flexibility and work-life balance, offering our staff the ability to set their working hours through our 'Take Charge' program. This role requires self-motivation, strong desire to learn, and willingness to share knowledge with the team. If you are a driven individual with experience in technical roles in cyber security and a passion for Incident Response and Digital Forensics, we would like to hear from you.

This advertiser has chosen not to accept applicants from your region.

Incident Response Analyst

Singapore, Singapore ASTREYA ASIA PACIFIC PTE. LIMITED

Posted today

Job Viewed

Tap Again To Close

Job Description

Responsibilities:

• Deliver data centre operations support across multiple data centres

• Respond to all alarms/alerts set in Data Center Infrastructure Management (DCIM), Server Automation Operations System (SAOS), CCTV, Access Control Systems (ACS), and other functions (EHS, Security, etc),

• Provide deep understanding and intelligence of the criticality and impact of the incidents to the resolver groups.

• Ensure detailed records of alarm handling activities, including actions taken, resolutions in ticketing tools and file incident reports.

• Be available to coordinate as an incident commander in event of an issue.

• Support program managers and facilitate project deliverables, improve overall operational and engineering initiatives.

• Conduct root cause analysis (RCA) to determine recurring problems to their source.

• Employ in-depth questioning and analysis techniques such as five whys to determine the underlying cause of the incident or problem.

• Handle ticketing system

• Perform duties in compliance with SOP.
Requirements:

• Diploma/Degree in Information Technology.

• 2 years+ experience in command center, service center, or similar 24x7 operations center environment

• Ability to quickly triage multiple incidents and assign the right priority based on risk and confidence levels

• Knowledge of technical elements associated with systems such as IP Networks, DC Environment and Server Health.

• Outstanding verbal and written communication skills required, work with minimal direction, meeting goals, attention to details and an eye for continuous improvements

• Ability to successfully interact at all levels of the organization, including with clients, while functioning as a team player required.

• Basic working knowledge of data protection policies such as GDPR and the need to keep sensitive information secure.
This advertiser has chosen not to accept applicants from your region.

Incident Response Analyst

Singapore, Singapore ASTREYA ASIA PACIFIC PTE. LIMITED

Posted today

Job Viewed

Tap Again To Close

Job Description

Roles & Responsibilities

Responsibilities:


• Deliver data centre operations support across multiple data centres


• Respond to all alarms/alerts set in Data Center Infrastructure Management (DCIM), Server Automation Operations System (SAOS), CCTV, Access Control Systems (ACS), and other functions (EHS, Security, etc),


• Provide deep understanding and intelligence of the criticality and impact of the incidents to the resolver groups.


• Ensure detailed records of alarm handling activities, including actions taken, resolutions in ticketing tools and file incident reports.


• Be available to coordinate as an incident commander in event of an issue.


• Support program managers and facilitate project deliverables, improve overall operational and engineering initiatives.


• Conduct root cause analysis (RCA) to determine recurring problems to their source.


• Employ in-depth questioning and analysis techniques such as five whys to determine the underlying cause of the incident or problem.


• Handle ticketing system


• Perform duties in compliance with SOP.

Requirements:


• Diploma/Degree in Information Technology.


• 2 years+ experience in command center, service center, or similar 24x7 operations center environment


• Ability to quickly triage multiple incidents and assign the right priority based on risk and confidence levels


• Knowledge of technical elements associated with systems such as IP Networks, DC Environment and Server Health.


• Outstanding verbal and written communication skills required, work with minimal direction, meeting goals, attention to details and an eye for continuous improvements


• Ability to successfully interact at all levels of the organization, including with clients, while functioning as a team player required.


• Basic working knowledge of data protection policies such as GDPR and the need to keep sensitive information secure.

Tell employers what skills you have

Switches
Troubleshooting
Incident Response
Hardware
Ticketing
Data Center
Root Cause Analysis
Information Technology
Access Control
CCTV
IP
Networking
Attention to Details
network servers
Routers
Cabling
Security Incident Response
This advertiser has chosen not to accept applicants from your region.

Security Incident Response Specialist

Singapore, Singapore beBeeIncident

Posted today

Job Viewed

Tap Again To Close

Job Description

Security Monitoring and Response

This is a full-time on-site position for a Security Analyst located in Singapore.

The successful candidate will be responsible for monitoring and analyzing security events, identifying potential threats, conducting investigations, and responding to security incidents.

Key Responsibilities:

  • Monitoring and Analyzing Security Events
  • Identifying Potential Threats and Conducting Investigations
  • Responding to Security Incidents
  • Working Closely with the End Client SOC Team
  • Supporting Security Developments
  • Ensuring Infrastructure Related Procedures are Reviewed Annually

The ideal candidate will have a strong background in computer studies and at least 2 years of experience working in a security operation, incident response, vulnerability management or related field.

Demonstrated Experience:

  • Network Security
  • Server Security
  • Endpoint Security
  • Web Security

Required Skills:

  • Strong Knowledge of Security Principles and Technologies
  • Excellent Analytical and Problem-Solving Skills
  • Ability to Quickly Identify and Respond to Security Incidents

Benefits:

The successful candidate will also possess a strong knowledge of security principles, technologies, and industry best practices. Hands-on experience in security appliances such as IBM Guardium, MMVS, Carbon Black, Cyber Ark is a definite advantage.

This advertiser has chosen not to accept applicants from your region.

Incident Response Manager, Singapore

Singapore, Singapore Blackpanda

Posted 11 days ago

Job Viewed

Tap Again To Close

Job Description

Join to apply for the Incident Response Manager, Singapore role at Blackpanda

About Blackpanda

Blackpanda is Asia’s premier cyber crisis response firm, founded by former military special operations forces and cyber defense experts. Headquartered in Singapore, we specialize in incident response and digital forensics across the Asia-Pacific region.

Our mission is to democratize cyber resilience by providing organizations of all sizes with top-tier incident response and insurance solutions, enabling secure and sustainable operations in an increasingly hostile digital landscape.

We are building the world’s first Cyber In-Extremis Force (CIF), an elite digital crisis response unit inspired by Tier-1 military capabilities. Our team doesn’t just respond to cyber threats; we dominate them.

Core Values

  • Sincerity: We deliver on our commitments.
  • Unity: We debate fiercely but act together.
  • Efficiency: We prioritize timing, precision, and leverage.
  • Humility: We listen, act, and learn.
  • Grit: We face setbacks head-on and persevere.

Your Mission: Incident Responder

You will manage cybersecurity investigations, coordinate with stakeholders, and guide organizations through digital crises. The role requires technical expertise, strategic leadership, and calmness under pressure.

Not all cases are complex; some are routine or chaotic. Your goal is to bring clarity, lead calmly, and transform disorder into operational control. You must be adaptable, open-minded, and solution-focused. Blackpanda emphasizes speed, precision, and decisive action, training relentlessly to deliver excellence.

Core Responsibilities

  • Lead high-stakes cyber incident investigations for rapid containment and recovery.
  • Analyze forensic artifacts, attacker TTPs, and malware across various infrastructures.
  • Perform DFIR operations including disk imaging, memory analysis, and threat hunting.
  • Use scripting (Python, Bash, PowerShell) to automate workflows and simulate adversaries.
  • Communicate findings clearly to clients, executives, and regulators.
  • Collaborate with engineering teams to improve tools and processes.
  • Coordinate with legal, insurance, vendor, and government stakeholders.
  • Assist sales teams with technical insights for engagements.
  • Mentor junior analysts to uphold operational standards.

Minimum Requirements

  • Open only to Singaporean candidates.
  • 3+ years in cybersecurity incident response, security operations, digital forensics, or threat intelligence.
  • Strong technical knowledge of enterprise networks, security architecture, and cloud environments.
  • Experience with key DFIR tools (Splunk, ELK, SentinelOne, etc.).
  • Proficiency with Windows, Linux, and macOS investigations.
  • Scripting skills in Python, Bash, or PowerShell.
  • Calm, decisive leadership in high-pressure IR engagements.
  • Ability to communicate complex findings to senior stakeholders.

Preferred Qualifications

  • Certifications like GCFA, GNFA, GREM, OSCP.
  • Experience with ransomware, dark web intelligence, or threat actor tracking.
  • OT/ICS experience in critical infrastructure environments.
  • Building cybersecurity tools or contributing to open-source projects.
  • Military, law enforcement, or intelligence backgrounds are a plus.

We value mindset: grit, discipline, and calmness under fire. If you’re experienced, resilient, and eager to push further, join us.

Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Information Technology
Industries
  • Computer and Network Security
#J-18808-Ljbffr
This advertiser has chosen not to accept applicants from your region.

Manager, Incident Response & Management

Singapore, Singapore Refine Group

Posted 18 days ago

Job Viewed

Tap Again To Close

Job Description

Who we are
About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Incident Response team is a global 24/7 team responsible for driving incident response and management from detection to resolution. Stripe is proud of its five 9s API reliability and this team is at the forefront of ensuring we keep it that way - working hand-in-hand with Reliability Eng and across the Tech Org. This team of incident response managers (IRM) is defined by our sense of ownership and how we drive incidents to resolution - marshaling the necessary cross-functional resources to respond to and resolve service outages, critical bugs, security attacks and anything that significantly impacts the users of our products. The team is user-first and ensures appropriate external communications from Stripe and senior management to keep our users informed of disruption to their experience of Stripe. The team is highly skilled in incident troubleshooting, program management, incident classifications, incident communications, incident escalation and technical adeptness as incidents can arise from anywhere and cut across products and orgs in Stripe.

What you’ll do

This position entails leading and optimizing Stripe's incident management processes and automation, ensuring efficiency and adherence to stringent incident response metrics. As the head of the incident response team, you will establish and maintain a best-in-class incident response framework, upholding the reliability standards expected of Stripe. Responsibilities include but are not limited to incident classification, escalation, and notification management, along with accountability for key incident response metrics (TTx). You will generate actionable insights to drive continuous improvement, collaborating with engineering leadership to refine incident detection, response, user communication, and tooling efficacy. Leadership and development of a highly effective 24/7 global incident response management team, characterized by urgency, programmatic ownership of incidents and communications, and the capacity to engage engineering teams, are crucial. Additionally, you will manage incident communications across multiple channels for executive and end-user audiences, and identify automation opportunities to streamline incident response workflows, thereby safeguarding users and minimizing disruption to their operations.

Responsibilities
  • Lead the global 24/7 team of regional managers and incident response managers with ability to be hands-on and support frontline on-call with speed, cross-functional collaboration and escalation
  • Develop and own Stripe's incident response and management strategy and cross-functional roadmap, ensuring it aligns with the company's reputation for reliability.
  • Spearhead and manage Stripe's AI-First strategy for automation of incident response workflows, partnering with the engineering team to implement required tooling enhancements.
  • Enhance Stripe's incident response by leading and implementing improvements derived from analyzing user-facing incidents and extracting actionable insights and learnings.
  • Collaborate closely with executive leadership, engineering, and operations teams to lead significant programs and reshape workflows and metrics concerning reliability and incident operations.
  • Manage relevant TTx metrics, particularly those related to communication and escalation. Collaborate with engineering leadership to implement necessary improvements for each metric.
  • Develop user-focused metrics and data to guide Stripe's incident response, reliability strategy, and user communications (including RCAs), ensuring impactful decision-making.
Who you are

We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.

Minimum requirements
  • 5+ years of management experience, including 2+ years of experience managing managers with a proven record in building, growing and transforming teams.
  • Extensive experience (4+ years) leading incident response for complex, large-scale distributed services with high SLOs/SLAs, coupled with deep expertise in crisis management.
  • Demonstrated ability to lead, influence other leaders and deliver complex strategic projects involving multiple stakeholders
  • Strong analytical skills, and the ability to use data to drive business decisions
  • Possesses proficiency in basic incident troubleshooting and a reasonable understanding of system architecture. Fluent in using SQL, Splunk, or similar query languages.
  • Exceptional communication abilities, capable of adapting incident updates for diverse audiences (executives, external users, internal teams).
  • Affinity for a fast paced work environment, crafting strategic and rapid fixes to high intensity problems with a keen eye for detail and a high bar for quality
  • Comfort navigating ambiguity, while identifying areas for process improvement and establishing best practices
Preferred qualifications
  • Experience managing geographically dispersed teams
  • Experience using infrastructure and application monitoring tools such as Prometheus, Sentry and others
  • Experience in incident response at a high-growth technology company, preferably within the payments or e-commerce sectors.
  • Proven ability to apply Agentic and Generative AI to revolutionize incident response, coupled with a strong grasp of current industry trends in the incident response domain.
  • Demonstrated history of driving engineering and process enhancements to improve incident response efficiency within a rapidly expanding technology organization.

#J-18808-Ljbffr
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Incident response Jobs in Singapore !

Manager, Incident Response & Management

Singapore, Singapore Refine Group

Posted 18 days ago

Job Viewed

Tap Again To Close

Job Description

Who we are
About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Incident Response team is a global 24/7 team responsible for driving incident response and management from detection to resolution. Stripe is proud of its five 9s API reliability and this team is at the forefront of ensuring we keep it that way - working hand-in-hand with Reliability Eng and across the Tech Org. This team of incident response managers (IRM) is defined by our sense of ownership and how we drive incidents to resolution - marshaling the necessary cross-functional resources to respond to and resolve service outages, critical bugs, security attacks and anything that significantly impacts the users of our products. The team is user-first and ensures appropriate external communications from Stripe and senior management to keep our users informed of disruption to their experience of Stripe. The team is highly skilled in incident troubleshooting, program management, incident classifications, incident communications, incident escalation and technical adeptness as incidents can arise from anywhere and cut across products and orgs in Stripe.

What you’ll do

This position entails leading and optimizing Stripe's incident management processes and automation, ensuring efficiency and adherence to stringent incident response metrics. As the head of the incident response team, you will establish and maintain a best-in-class incident response framework, upholding the reliability standards expected of Stripe. Responsibilities include but are not limited to incident classification, escalation, and notification management, along with accountability for key incident response metrics (TTx). You will generate actionable insights to drive continuous improvement, collaborating with engineering leadership to refine incident detection, response, user communication, and tooling efficacy. Leadership and development of a highly effective 24/7 global incident response management team, characterized by urgency, programmatic ownership of incidents and communications, and the capacity to engage engineering teams, are crucial. Additionally, you will manage incident communications across multiple channels for executive and end-user audiences, and identify automation opportunities to streamline incident response workflows, thereby safeguarding users and minimizing disruption to their operations.

Responsibilities
  • Lead the global 24/7 team of regional managers and incident response managers with ability to be hands-on and support frontline on-call with speed, cross-functional collaboration and escalation
  • Develop and own Stripe's incident response and management strategy and cross-functional roadmap, ensuring it aligns with the company's reputation for reliability.
  • Spearhead and manage Stripe's AI-First strategy for automation of incident response workflows, partnering with the engineering team to implement required tooling enhancements.
  • Enhance Stripe's incident response by leading and implementing improvements derived from analyzing user-facing incidents and extracting actionable insights and learnings.
  • Collaborate closely with executive leadership, engineering, and operations teams to lead significant programs and reshape workflows and metrics concerning reliability and incident operations.
  • Manage relevant TTx metrics, particularly those related to communication and escalation. Collaborate with engineering leadership to implement necessary improvements for each metric.
  • Develop user-focused metrics and data to guide Stripe's incident response, reliability strategy, and user communications (including RCAs), ensuring impactful decision-making.
Who you are

We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.

Minimum requirements
  • 5+ years of management experience, including 2+ years of experience managing managers with a proven record in building, growing and transforming teams.
  • Extensive experience (4+ years) leading incident response for complex, large-scale distributed services with high SLOs/SLAs, coupled with deep expertise in crisis management.
  • Demonstrated ability to lead, influence other leaders and deliver complex strategic projects involving multiple stakeholders
  • Strong analytical skills, and the ability to use data to drive business decisions
  • Possesses proficiency in basic incident troubleshooting and a reasonable understanding of system architecture. Fluent in using SQL, Splunk, or similar query languages.
  • Exceptional communication abilities, capable of adapting incident updates for diverse audiences (executives, external users, internal teams).
  • Affinity for a fast paced work environment, crafting strategic and rapid fixes to high intensity problems with a keen eye for detail and a high bar for quality
  • Comfort navigating ambiguity, while identifying areas for process improvement and establishing best practices
Preferred qualifications
  • Experience managing geographically dispersed teams
  • Experience using infrastructure and application monitoring tools such as Prometheus, Sentry and others
  • Experience in incident response at a high-growth technology company, preferably within the payments or e-commerce sectors.
  • Proven ability to apply Agentic and Generative AI to revolutionize incident response, coupled with a strong grasp of current industry trends in the incident response domain.
  • Demonstrated history of driving engineering and process enhancements to improve incident response efficiency within a rapidly expanding technology organization.

#J-18808-Ljbffr
This advertiser has chosen not to accept applicants from your region.

Manager, Incident Response & Management

Singapore, Singapore Stripe

Posted 18 days ago

Job Viewed

Tap Again To Close

Job Description

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Incident Response team is a global 24/7 team responsible for driving incident response and management from detection to resolution. Stripe is proud of its five 9s API reliability and this team is at the forefront of ensuring we keep it that way - working hand-in-hand with Reliability Eng and across the Tech Org. This team of incident response managers (IRM) is defined by our sense of ownership and how we drive incidents to resolution - marshaling the necessary cross-functional resources to respond to and resolve service outages, critical bugs, security attacks and anything that significantly impacts the users of our products. The team is user-first and ensures appropriate external communications from Stripe and senior management to keep our users informed of disruption to their experience of Stripe. The team is highly skilled in incident troubleshooting, program management, incident classifications, incident communications, incident escalation and technical adeptness as incidents can arise from anywhere and cut across products and orgs in Stripe.

What you’ll do

This position entails leading and optimizing Stripe's incident management processes and automation, ensuring efficiency and adherence to stringent incident response metrics. As the head of the incident response team, you will establish and maintain a best-in-class incident response framework, upholding the reliability standards expected of Stripe. Responsibilities include but are not limited to incident classification, escalation, and notification management, along with accountability for key incident response metrics (TTx). You will generate actionable insights to drive continuous improvement, collaborating with engineering leadership to refine incident detection, response, user communication, and tooling efficacy. Leadership and development of a highly effective 24/7 global incident response management team, characterized by urgency, programmatic ownership of incidents and communications, and the capacity to engage engineering teams, are crucial. Additionally, you will manage incident communications across multiple channels for executive and end-user audiences, and identify automation opportunities to streamline incident response workflows, thereby safeguarding users and minimizing disruption to their operations.

Responsibilities
  • Lead the global 24/7 team of regional managers and incident response managers with ability to be hands-on and support frontline on-call with speed, cross-functional collaboration and escalation
  • Develop and own Stripe's incident response and management strategy and cross-functional roadmap, ensuring it aligns with the company's reputation for reliability.
  • Spearhead and manage Stripe's AI-First strategy for automation of incident response workflows, partnering with the engineering team to implement required tooling enhancements.
  • Enhance Stripe's incident response by leading and implementing improvements derived from analyzing user-facing incidents and extracting actionable insights and learnings.
  • Collaborate closely with executive leadership, engineering, and operations teams to lead significant programs and reshape workflows and metrics concerning reliability and incident operations.
  • Manage relevant TTx metrics, particularly those related to communication and escalation. Collaborate with engineering leadership to implement necessary improvements for each metric.
  • Develop user-focused metrics and data to guide Stripe's incident response, reliability strategy, and user communications (including RCAs), ensuring impactful decision-making.
Who you are

We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.

  • 5+ years of management experience, including 2+ years of experience managing managers with a proven record in building, growing and transforming teams.
  • Extensive experience (4+ years) leading incident response for complex, large-scale distributed services with high SLOs/SLAs, coupled with deep expertise in crisis management.
  • Demonstrated ability to lead, influence other leaders and deliver complex strategic projects involving multiple stakeholders
  • Strong analytical skills, and the ability to use data to drive business decisions
  • Possesses proficiency in basic incident troubleshooting and a reasonable understanding of system architecture. Fluent in using SQL, Splunk, or similar query languages.
  • Exceptional communication abilities, capable of adapting incident updates for diverse audiences (executives, external users, internal teams).
  • Affinity for a fast paced work environment, crafting strategic and rapid fixes to high intensity problems with a keen eye for detail and a high bar for quality
  • Comfort navigating ambiguity, while identifying areas for process improvement and establishing best practices
Preferred qualifications
  • Experience managing geographically dispersed teams
  • Experience using infrastructure and application monitoring tools such as Prometheus, Sentry and others
  • Experience in incident response at a high-growth technology company, preferably within the payments or e-commerce sectors.
  • Proven ability to apply Agentic and Generative AI to revolutionize incident response, coupled with a strong grasp of current industry trends in the incident response domain.
  • Demonstrated history of driving engineering and process enhancements to improve incident response efficiency within a rapidly expanding technology organization.
Office-assigned Stripes spend at least 50% of the time in a given month in their local office or with users. This hits a balance between bringing people together for in-person collaboration and learning from each other, while supporting flexibility about how to do this in a way that makes sense for individuals and their teams.

The annual salary range for this role in the primary location is S$208,000 - S$312,000. This range may change if you are hired in another location. For sales roles, the range provided is the role’s On Target Earnings (“OTE”) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. This salary range may be inclusive of several career levels at Stripe and will be narrowed during the interview process based on a number of factors, including the candidate’s experience, qualifications, and specific location. Applicants interested in this role and who are not located in the primary location may request the annual salary range for their location during the interview process.

Specific benefits and details about what compensation is included in the salary range listed above will vary depending on the applicant’s location and can be discussed in more detail during the interview process. Benefits/additional compensation for this role may include: equity, company bonus or sales commissions/bonuses; retirement plans; health benefits; and wellness stipends.

At Stripe, we're looking for people with passion, grit, and integrity. You're encouraged to apply even if your experience doesn't precisely match the job description. Your skills and passion will stand out—and set you apart—especially if your career has taken some extraordinary twists and turns. At Stripe, we welcome diverse perspectives and people who think rigorously and aren't afraid to challenge assumptions. Join us.

#J-18808-Ljbffr
This advertiser has chosen not to accept applicants from your region.

Manager, Incident Response & Management

Singapore, Singapore Stripe

Posted 18 days ago

Job Viewed

Tap Again To Close

Job Description

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Incident Response team is a global 24/7 team responsible for driving incident response and management from detection to resolution. Stripe is proud of its five 9s API reliability and this team is at the forefront of ensuring we keep it that way - working hand-in-hand with Reliability Eng and across the Tech Org. This team of incident response managers (IRM) is defined by our sense of ownership and how we drive incidents to resolution - marshaling the necessary cross-functional resources to respond to and resolve service outages, critical bugs, security attacks and anything that significantly impacts the users of our products. The team is user-first and ensures appropriate external communications from Stripe and senior management to keep our users informed of disruption to their experience of Stripe. The team is highly skilled in incident troubleshooting, program management, incident classifications, incident communications, incident escalation and technical adeptness as incidents can arise from anywhere and cut across products and orgs in Stripe.

What you’ll do

This position entails leading and optimizing Stripe's incident management processes and automation, ensuring efficiency and adherence to stringent incident response metrics. As the head of the incident response team, you will establish and maintain a best-in-class incident response framework, upholding the reliability standards expected of Stripe. Responsibilities include but are not limited to incident classification, escalation, and notification management, along with accountability for key incident response metrics (TTx). You will generate actionable insights to drive continuous improvement, collaborating with engineering leadership to refine incident detection, response, user communication, and tooling efficacy. Leadership and development of a highly effective 24/7 global incident response management team, characterized by urgency, programmatic ownership of incidents and communications, and the capacity to engage engineering teams, are crucial. Additionally, you will manage incident communications across multiple channels for executive and end-user audiences, and identify automation opportunities to streamline incident response workflows, thereby safeguarding users and minimizing disruption to their operations.

Responsibilities
  • Lead the global 24/7 team of regional managers and incident response managers with ability to be hands-on and support frontline on-call with speed, cross-functional collaboration and escalation
  • Develop and own Stripe's incident response and management strategy and cross-functional roadmap, ensuring it aligns with the company's reputation for reliability.
  • Spearhead and manage Stripe's AI-First strategy for automation of incident response workflows, partnering with the engineering team to implement required tooling enhancements.
  • Enhance Stripe's incident response by leading and implementing improvements derived from analyzing user-facing incidents and extracting actionable insights and learnings.
  • Collaborate closely with executive leadership, engineering, and operations teams to lead significant programs and reshape workflows and metrics concerning reliability and incident operations.
  • Manage relevant TTx metrics, particularly those related to communication and escalation. Collaborate with engineering leadership to implement necessary improvements for each metric.
  • Develop user-focused metrics and data to guide Stripe's incident response, reliability strategy, and user communications (including RCAs), ensuring impactful decision-making.
Who you are

We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.

  • 5+ years of management experience, including 2+ years of experience managing managers with a proven record in building, growing and transforming teams.
  • Extensive experience (4+ years) leading incident response for complex, large-scale distributed services with high SLOs/SLAs, coupled with deep expertise in crisis management.
  • Demonstrated ability to lead, influence other leaders and deliver complex strategic projects involving multiple stakeholders
  • Strong analytical skills, and the ability to use data to drive business decisions
  • Possesses proficiency in basic incident troubleshooting and a reasonable understanding of system architecture. Fluent in using SQL, Splunk, or similar query languages.
  • Exceptional communication abilities, capable of adapting incident updates for diverse audiences (executives, external users, internal teams).
  • Affinity for a fast paced work environment, crafting strategic and rapid fixes to high intensity problems with a keen eye for detail and a high bar for quality
  • Comfort navigating ambiguity, while identifying areas for process improvement and establishing best practices
Preferred qualifications
  • Experience managing geographically dispersed teams
  • Experience using infrastructure and application monitoring tools such as Prometheus, Sentry and others
  • Experience in incident response at a high-growth technology company, preferably within the payments or e-commerce sectors.
  • Proven ability to apply Agentic and Generative AI to revolutionize incident response, coupled with a strong grasp of current industry trends in the incident response domain.
  • Demonstrated history of driving engineering and process enhancements to improve incident response efficiency within a rapidly expanding technology organization.
Office-assigned Stripes spend at least 50% of the time in a given month in their local office or with users. This hits a balance between bringing people together for in-person collaboration and learning from each other, while supporting flexibility about how to do this in a way that makes sense for individuals and their teams.

The annual salary range for this role in the primary location is S$208,000 - S$312,000. This range may change if you are hired in another location. For sales roles, the range provided is the role’s On Target Earnings (“OTE”) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. This salary range may be inclusive of several career levels at Stripe and will be narrowed during the interview process based on a number of factors, including the candidate’s experience, qualifications, and specific location. Applicants interested in this role and who are not located in the primary location may request the annual salary range for their location during the interview process.

Specific benefits and details about what compensation is included in the salary range listed above will vary depending on the applicant’s location and can be discussed in more detail during the interview process. Benefits/additional compensation for this role may include: equity, company bonus or sales commissions/bonuses; retirement plans; health benefits; and wellness stipends.

At Stripe, we're looking for people with passion, grit, and integrity. You're encouraged to apply even if your experience doesn't precisely match the job description. Your skills and passion will stand out—and set you apart—especially if your career has taken some extraordinary twists and turns. At Stripe, we welcome diverse perspectives and people who think rigorously and aren't afraid to challenge assumptions. Join us.

#J-18808-Ljbffr
This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Incident Response Jobs View All Jobs in Singapore