Availability

Your product has little or no value if its functionality isn't available to users. Bugs, crashes, and network outages are examples of what might make your product's functionality unavailable at times.

Product managers therefore typically attach an availability constraint (nonfunctional requirement) to each functional requirement of the product. If one of the functions of the product is to generate reports, for example, a product manager should specify how likely it should be at any particular time that a user will be able to use this functionality.

The question with nonfunctional requirements is always the metric - how you measure them. How do you measure availability? Here are some options:

mean-time-between-failures (MTBF) - the average amount of time elapsed between failures to deliver the functionality.
failure rate - the frequency that the product fails to deliver the functionality. Failure rate is the reciprocal of MTBF, and often is expressed in terms of failures per hour.
uptime - percentage of the time that the functionality is available.
downtime - percentage of the time that the functionality is not available.
mean-time-between-system-abort (MTBSA) - the average amount of time elapsed between complete "reboots" of the system.
mean-time-between-critical-failure (MTBCF) - distinguishes between critical and noncritical failures.
maintenance free operating period (MFOP) - the average amount of time that the functionality is available without any special intervention or system maintenance.

Of course, a prospective customer will always want 100% uptime, but such availability is typically not practical to achieve. If you base a contract on 100% uptime, you will almost certainly be in violation of your contract at some point.

UPDATE: Scott Sehlhorst adds a number of important observations in this entry's comments. One thing he notes is that I neglected to mention MTTR:

mean-time-to-repair (MTTR) - the average amount of time it takes to repair the system after its functionality becomes unavailable. For hardware products, it usually refers to the time to replace a module or part. For software products, it can refer to the amount of time it takes to reboot or restart the system.

Also, some people use "availability" to refer strictly to uptime, and consider all of these parameters to be "reliability" metrics.

Comments

Roger L. Cauvin said…

Sounds like you have some good experience with the various metrics, Scott. I did leave out MTTR, I am going to update the entry to include it.

I also do think it's important to provide context to these metrics. You can use a product in many different circumstances; the availability requirements should specify the possible circumstances to the extent practicable.

As I've stated before, just finding - or even exploring - the right combination of metrics is arguably more important than assigning all of the exact numbers and exhaustive circumstances. (When there's a contract involved, these other factors are obviously still very important.)

Mon May 08, 09:09:00 PM 2006

Cauvin

Search This Blog

Availability

Comments

Popular posts from this blog

Why Spreadsheets Suck for Prioritizing

5 Ways Companies Make Product Decisions

Stop Validating and Start Falsifying