Think Like a Tester: Logging, Monitoring, and Alerting

Saturday, April 6, 2019

Logging, Monitoring, and Alerting

This week I'm writing about three things not often associated with testing: logging, monitoring, and alerting. Perhaps you've taken advantage of logging in your testing, but monitoring and alerting seem like a problem for IT or DevOps. However, a bug-free application doesn't mean a thing if your users can't get to it because the server crashed! For this reason, it's important to understand logging, monitoring, and alerting so that we as testers can participate in ensuring the health of our applications.

Logging:

Logging is simply recording information about what happens in an application. This can be done through writing to a file or a database. Often developers will include logging statements in their code to help determine what's going on with the application below the UI. This is especially helpful in applications that make calls to a number of servers or databases.

Recently I tested a notification system that passed a message from a function to a number of different channels. Logging was so helpful in testing because it enabled me to follow the message through the channels. If I hadn't had good logging, I wouldn't have had any way to figure out where the bug was when I didn't get a message I was expecting.

Good log messages should be easy to access and easy to search. You shouldn't have to log on to some obscure remote desktop and sift through tens of thousands of entries with no line breaks. One helpful tool for logging is Kibana- an open-source tool that lets you search and sort through logs in an easy-to-read format.

Good log messages should also be easy to understand and provide helpful information. It's so frustrating to find a log message about an error and discover that it says "An unknown error occurred", or "Error TSGB-45667". Ask your developer if he or she can provide log messages that make it clear what went wrong and where in the code it happened.

Another helpful tactic for logging is to give each event a specific GUID as an identifier. The GUID will stay associated with everything that happens with the event, so you can follow it as it moves from one area of an application to another.

Monitoring:

Monitoring means setting up automatic processes to watch the health of your application and the servers that run it. Good monitoring ensures that any potential problems can be discovered and dealt with before they reach the end user. For example, if it becomes clear that a server's disk space is reaching maximum capacity, additional servers can be added to handle the load.

Things to monitor include:

server response times
load on the server
server errors, such as 500-level response errors
CPU usage
memory usage
disk space

One way to monitor application health is with a periodic health check or a ping. A job is set up to make a request to the server every few minutes and record whether the response was positive or negative. Monitoring can also happen through a tool that watches the number of requests to the server and records whether those requests were successful. Data points such as response times and CPU usage can also be recorded and examined to see if there are any trends that might indicate that the application is unhealthy. One example of a tool that monitors application and server health is AppDynamics.

Alerting:

All the logging and monitoring in the world won't be helpful if no one is watching to see if there are problems! This is where alerting comes in. Alerts can be set to notify the appropriate people so that immediate action can be taken when there is a problem.

Some situations that might call for an alert would be:

CPU or memory usage goes above a certain threshold
Disk space goes below a certain threshold
The number of 500 errors goes above a certain level
A health check fails twice in a row
Response times are slower than expected
Load is higher than normal

There are a number of ways to alert people of problems. Alerts can be set up that will send emails, text messages, or phone calls. PagerDuty is one service that provides this alerting functionality. An important thing to consider, however, is to set off-hours alerts only for serious cases in which users might be affected. No one wants to be woken up in the middle of the night by an alert that says that the QA servers are down! However, a problem in the QA environment could indicate an issue that could be seen in the production environment in the future. So a less invasive alert, such as a message to a team chat room, could be set up for this situation.

You may be saying to yourself at this point, "But I'm a software tester! It's not my job to set up logging, monitoring, and alerting for the company." The health of your application is the responsibility of everyone who works on the application, including you! While you might not have the clout to purchase server monitoring software, you still have the power to ask questions of your team, such as:

How can we troubleshoot user issues?
How do we know that we have enough servers to handle our application's load?
How will we know if our API is responding correctly?
How will we know if a DDoS attack is being attempted on our application?
How will we know if our end users are experiencing long wait times?
How will we know if we are running out of disk space?

Hopefully these questions will motivate you and your team to set up logging, monitoring, and alerting that will ensure the health and reliability of your application.

24 comments:

LisaApril 8, 2019 at 11:30 AM
So glad to see you encourage testers to get involved with logging, monitoring and alerting. I would add to that, observability. We testers have good skills for spotting patterns in data and identifying risks, it's another way we can make valuable contributions to our team and product.
ReplyDelete
Replies
mneiferbagApril 9, 2019 at 5:24 AM
Monitoring is the new testing!
ReplyDelete
Replies
RajaniFebruary 28, 2020 at 10:22 PM
Thank you for sharing wonderful information with us to get some idea about it.
Azure Training
Azure Online Training
MS Azure Online Training
ReplyDelete
Replies
arshiya fouziaNovember 1, 2020 at 11:44 PM
Very wonderful article. I liked reading your article. Very wonderful share. Thanks ! .
software testing course in chennai
ReplyDelete
Replies
DeskTrackApril 8, 2021 at 12:18 AM
I have read many articles here and learn many things from them, this was really helpful for me. Thank you so much for sharing this info with us and keep sharing your ideas with us.

Visit: Employee Monitoring Software
ReplyDelete
Replies
Rukesh PrasadFebruary 19, 2022 at 12:17 AM

Thank you so much for sharing!
Free Software for Monitoring Cell Phone (Smartphone, Mobile Phone); Remote employee time tracking software with screenshot and activity monitoring (9999332499, 9999332099).
Here list of android app offering Monitoring Software companies in India.
SpyAppKing
SpyCameraIndia
MobileJammerIndia
SpyPlayingCard
Nagios Core.
Zabbix.
Icinga.
Cacti.
Sensu Core.
Observium.
Zenoss.
Monitorix.
spyworld
spyshoponline
ReplyDelete
Replies
AaasdsdApril 15, 2022 at 5:17 AM
Lista seriale turcesti subtitrat in Romana available on Pretul cel bun Clicksud. Get the latest updates of seriale turcesti subtitrat in Romana freely on our website.
ReplyDelete
Replies
Srik27May 21, 2022 at 6:04 AM
Download games and apps from https://apkworlds.com
ReplyDelete
Replies
Pratibha VermaJanuary 24, 2023 at 9:51 PM
This was very nice information about employee attendance tracking app in your blog post and I really appreciate it. Thanks for sharing such really amazing information. Always updated with technology and track your employee using an employee attendance tracking app at a very affordable price.

ReplyDelete
Replies
Online EducationFebruary 27, 2023 at 11:36 PM
Systematic line to line write content.
What is HMS
ReplyDelete
Replies
Vikas TiwariJuly 2, 2023 at 11:32 PM
KYTE is the most popular attendance tracking software for pharmaceutical companies which helps the companies with in-campus as well as field force attendance management.

Attendance tracking software for pharmaceutical companies
KYTE
ReplyDelete
Replies
FlowaceAugust 30, 2023 at 11:17 PM

A comprehensive guide to ensuring application health and reliability! The insights into logging, monitoring, and alerting are invaluable for any IT professional. Speaking of streamlining operations, Flowace Attendance Tracker offers a similar level of meticulousness when it comes to attendance management. It's impressive how technology can elevate both the performance of applications and workforce management. Great job in shedding light on these critical aspects.
ReplyDelete
Replies
FlowaceSeptember 6, 2023 at 6:27 AM
This comment has been removed by the author.
ReplyDelete
Replies
Barbara NimmoSeptember 21, 2023 at 11:28 PM
I'm truly excited about the insights provided in this blog post on logging, monitoring, and alerting! It's evident that the author has a deep understanding of the importance of these aspects in software development and testing.
promocodehq
ReplyDelete
Replies
Sanjeet SinghMarch 20, 2024 at 6:11 AM
This insight is incredibly valuable! Often, testing focuses solely on finding bugs within the application itself, but neglects the crucial aspects of logging, monitoring, and alerting. Understanding these components is essential for testers to contribute effectively to the overall health and reliability of the application. Thank you for highlighting the significance of these often-overlooked elements in ensuring a seamless user experience.
Visit- Data Analytics Trends in the Post-Pandemic World
ReplyDelete
Replies
shivaniApril 5, 2024 at 5:42 AM
Nice Post, Your post really resonated with me, and I found myself nodding along as I read through it. I especially appreciated your idea and the way you presented it. We Provide the tracking software. Please visit our site: Employee Tracking Software.
ReplyDelete
Replies
Ritesh SharmaMay 5, 2024 at 9:28 PM
I learned a lot by reading this articles, thus this was very beneficial to me. I really appreciate you giving us this information, and I hope you will continue to share your thoughts on employee productivity tracking software.
ReplyDelete
Replies
venkatakrishnaJune 14, 2024 at 9:27 PM
Wonderful Article. Thanks for sharing this post

Site Reliability Engineering Training
SRE Training in Hyderabad
Site Reliability Engineering Training in Hyderabad
Site Reliability Engineering Online Training
Site Reliability Engineering Training Institute in Hyderabad
SRE Training Course in Hyderabad
SRE Online Training in Hyderabad
ReplyDelete
Replies
olivia connersFebruary 3, 2025 at 2:58 AM
Fantastic article! Your writing style is engaging, and the information is incredibly helpful. Keep up the great work! Check out our website for exclusive tips, resources, and insights on UnplugWell that you won’t find anywhere else!
ReplyDelete
Replies
AvinshaFebruary 25, 2025 at 5:17 AM
Thanks for this detailed guide, and this is exactly what I needed. Join IELTS Self-Study Course Online today and embark on your study abroad journey.
ReplyDelete
Replies
FlorenceMarch 3, 2025 at 3:50 AM
Such a well-written piece! The examples really helped me understand the concept better. Explore our list of the best Digital Detox Retreats to help you reset and recharge!
ReplyDelete
Replies
TimeChamp.ioMarch 18, 2025 at 12:32 AM
Great post! Logging, monitoring, and alerting are crucial for testers to ensure app health. Logging helps track bugs, and tools like Kibana make it easier to navigate logs. Clear, informative messages are key for troubleshooting. Thanks for the tips!
What are the Pros and Cons of Employee Monitoring
ReplyDelete
Replies
Emma WalkerApril 27, 2025 at 9:56 PM
Very informative post! It's amazing to see how far technology has come. For a nostalgic trip, explore our collection of Classics of Outdated Technology!
ReplyDelete
Replies

Add comment

Think Like a Tester

Email Subscription Form

Saturday, April 6, 2019

Logging, Monitoring, and Alerting

24 comments:

New Blog Location!

Report Abuse