Skip to content

Practice Exercise: Incident Management

Scenario

You are the now the system administrator . You need to perform scheduled server maintenance and monitoring tasks to ensure the servers are running smoothly. As part of this, you want to run a script that checks resource usage, and schedules a daily cron job for further maintenance.

Step 1: Initial Setup

Ensure that the scheduled cronjob runs smoothly at 5 PM by following these steps:
Execute the script insdide the /Acad folder by running the following command:

./update.sh

Step 2: Handling Webserver Errors

If you encounter an error on the webserver, promptly investigate the issue and create an incident ticket for further analysis and resolution. This will help ensure the smooth operation of the webserver and prevent any potential disruptions.

Verify that critical_app is down: via curl or brwser.

curl -IsS http://"insert_hostname".acad.opswerks.net | grep "HTTP/1.1 200 OK"

Step 3: Create an incident ticket to resolve you web app

  • Using your personal account, create a new incident ticket with the following details:

  • Title: "Web Application is down"

  • Watcher: Choose Tech for the watcher
  • Description: Provide a brief description of the issue, including any error messages or user reports.
  • Priority: Set the priority level to "High" or as per your organization's guidelines.
  • Category: Select the appropriate category for web application issues.
  • Attach any relevant files or screenshots if available.

The purpose of this exercise is to train system administrators in conducting scheduled server maintenance and promptly responding to web server errors. Administrators will learn to run resource checks, schedule maintenance tasks, and create incident tickets for issue resolution, ensuring the continuous and reliable operation of critical servers and web applications.