All roles

Monitoring and Incident Response Manager

Remote · USA Full-time New today

Description Description At The One 23 Group, our mission is to set the benchmark for excellence in government services. We empower our clients in the Department of War, Intelligence Community, and Federal Civilian sectors to excel with our advanced capabilities. Our dedication lies in fostering a people-first culture, underpinned by steadfast ethical principles. Embracing innovative technologies and process improvements, we are steadfast in our journey toward a future that is both bright and transformative. Our expertise spans Enterprise IT, Mission IT and Cyber. With our global footprint, we place a strong emphasis on nurturing our people and culture, which forms the core of our successful strategies in leadership and financial management. We pride ourselves on our extensive experience and effective approach, ensuring that we lead with both innovation and integrity. The Monitoring and Incident Response Manager is responsible for leading the Monitoring and Incident Response Team (MIRT) and overseeing real-time monitoring, incident response, and operational support for an enterprise network environment. This role ensures continuous monitoring of infrastructure, rapid response to operational incidents, and effective coordination across engineering, security, and operations teams. The Manager provides leadership for a 24x7x365 monitoring environment, ensuring the availability, performance, and security of enterprise systems. The position is responsible for incident management processes, operational coordination, and performance oversight of monitoring personnel while ensuring compliance with operational procedures and government requirements.

Requirements

Operations Oversight

  • Lead and manage the Monitoring and Incident Response Team (MIRT) supporting enterprise network operations.
  • Provide operational oversight for 24x7 monitoring and incident response activities.
  • Supervise monitoring specialists and ensure coverage across operational shifts.
  • Establish operational priorities and coordinate response activities during incidents affecting enterprise systems.

Network and Service Monitoring

  • Continuously monitor network infrastructure, applications, and services to ensure system availability and performance.
  • Monitor alerts generated by enterprise monitoring platforms and respond to operational events.
  • Track network performance metrics and identify anomalies or potential service disruptions.
  • Monitor enterprise infrastructure including routers, switches, firewalls, load balancers, and WAN circuits.

Incident Response and Troubleshooting

  • Investigate alerts related to network outages, service degradation, and security events.
  • Perform initial triage and root cause analysis of incidents affecting network or application services.
  • Troubleshoot connectivity issues and coordinate resolution with network engineering, security, and application teams.
  • Escalate critical incidents to appropriate support teams based on severity and impact.

Network Infrastructure Support

  • Diagnose issues related to enterprise networking equipment including routers, switches, firewalls, and load balancers.
  • Assist with configuration updates and operational changes under established change management processes.
  • Utilize packet capture and network diagnostic tools to troubleshoot network anomalies.

Incident Documentation and Reporting

  • Document incidents, troubleshooting actions, and resolution steps within the IT service management (ITSM) system.
  • Maintain detailed incident logs and operational reports for network and infrastructure events.
  • Provide updates to stakeholders regarding incident status, impact, and resolution timelines.

Operational Monitoring and Alert Management

  • Monitor enterprise systems for health metrics including:
  • Network availability
  • CPU utilization
  • Memory usage
  • Interface performance
  • System alerts and alarms
  • Investigate monitoring alerts and perform operational response procedures.

Required Qualifications

  • Public Trust
  • Minimum 7 years of experience supporting network operations, IT infrastructure monitoring, or incident response.
  • Experience working in enterprise IT environments supporting network or infrastructure operations.

Apply tot his job Apply To this Job

Related roles

[Hiring] Transplant Quality Manager @WVU Medicine

Remote · USA Full-time

Sr. Director, Clinical and Regulatory Writing

Remote · USA Full-time

Senior Medical Writer - Regulatory Documents - CSR /Protocol - Late Phase

Remote · USA Full-time

Care Coordination (RN) – REMOTE, Compact TX

Remote · USA Full-time

Senior Clinical Trial Manager (Sponsor-Dedicated, Remote - US)

Remote · USA Full-time

AWS Data Cloud Consultant

Remote · USA Full-time

Cloud & DevOps Engineer - Virtual

Remote · USA Full-time

Principal Cloud Developer - ISV Engeinering

Remote · USA Full-time

Remote Software Developer with Cloud

Remote · USA Full-time

Python & Cloud Developer (Unpaid, Remote)

Remote · USA Full-time

Maintenance Utility Employee (MUE 1) - Motorized Role at Delta Air Lines: Join Our Team at JFK Airport

Remote · USA Full-time

Senior Customer Service Representative - Remote in CST OR MST

Remote · USA Full-time

Experienced Customer Service Representative – Remote & Flexible Work Opportunities at arenaflex

Remote · USA Full-time

Experienced Customer Support Associate – Remote, arenaflex – Starting at $19/hr, No Educational Requirements

Remote · USA Full-time

Account Support Analyst

Remote · USA Full-time

Temporary Call Center Representative (Work at Home)

Remote · USA Full-time

Experienced Remote Live Chat Assistant – Delivering Exceptional Customer Support through Innovative Live Chat Solutions at arenaflex

Remote · USA Full-time

Part-Time Data Entry Specialist – Work from Home Opportunity

Remote · USA Full-time

Associate, Conferences & Events

Remote · USA Full-time

Experienced Customer Support Expert – Delivering Exceptional Experiences for arenaflex Entrepreneurs

Remote · USA Full-time