Skip to main content

Availability

Your product has little or no value if its functionality isn't available to users. Bugs, crashes, and network outages are examples of what might make your product's functionality unavailable at times.

Product managers therefore typically attach an availability constraint (nonfunctional requirement) to each functional requirement of the product. If one of the functions of the product is to generate reports, for example, a product manager should specify how likely it should be at any particular time that a user will be able to use this functionality.

The question with nonfunctional requirements is always the metric - how you measure them. How do you measure availability? Here are some options:
  • mean-time-between-failures (MTBF) - the average amount of time elapsed between failures to deliver the functionality.
  • failure rate - the frequency that the product fails to deliver the functionality. Failure rate is the reciprocal of MTBF, and often is expressed in terms of failures per hour.
  • uptime - percentage of the time that the functionality is available.
  • downtime - percentage of the time that the functionality is not available.
  • mean-time-between-system-abort (MTBSA) - the average amount of time elapsed between complete "reboots" of the system.
  • mean-time-between-critical-failure (MTBCF) - distinguishes between critical and noncritical failures.
  • maintenance free operating period (MFOP) - the average amount of time that the functionality is available without any special intervention or system maintenance.
Of course, a prospective customer will always want 100% uptime, but such availability is typically not practical to achieve. If you base a contract on 100% uptime, you will almost certainly be in violation of your contract at some point.

UPDATE: Scott Sehlhorst adds a number of important observations in this entry's comments. One thing he notes is that I neglected to mention MTTR:
  • mean-time-to-repair (MTTR) - the average amount of time it takes to repair the system after its functionality becomes unavailable. For hardware products, it usually refers to the time to replace a module or part. For software products, it can refer to the amount of time it takes to reboot or restart the system.
Also, some people use "availability" to refer strictly to uptime, and consider all of these parameters to be "reliability" metrics.

Comments

Scott Sehlhorst said…
Hey Roger,

I don't think you can realistically use MTBF for measuring software product availability. I notice you didn't suggest that you could, just pointing that out in case anyone assumed it (like I did initially).

Availability requires more than MTBF to measure (My background was mechE prior to software) - you have to also include MTTR (mean time to reset the system.

Availability = MTBF/(MTBF + MTTR).

MTBF-based non-functional requirements should also provide additional information in order to be unambiguously tested.

When I was doing hardware design, we would establish an MTBF measured in operations (say 100,000). We would either present that data, or look at typical usage patterns (10 operations per hour) and express MTBF in terms of hours - 10,000. We never converted MTBF to availability.

For software, we don't have creep-based or other cyclic failure mechanisms. I'm not sure what the true distribution is, I would guess random, but it definitely isn't a Weibull distribution - the one most commonly associated with hardware failures that are a function of repetition.

Since software failures seem to correlate more strongly to circumstances than repetition, I would suggest that an MTBF-based availability calculation is both unmeasureable and ill-advised.

Looking at uptime/downtime and MFOP (as you describe) are more valuable approaches for software.

Thanks,
Scott
Roger L. Cauvin said…
Sounds like you have some good experience with the various metrics, Scott. I did leave out MTTR, I am going to update the entry to include it.

I also do think it's important to provide context to these metrics. You can use a product in many different circumstances; the availability requirements should specify the possible circumstances to the extent practicable.

As I've stated before, just finding - or even exploring - the right combination of metrics is arguably more important than assigning all of the exact numbers and exhaustive circumstances. (When there's a contract involved, these other factors are obviously still very important.)
Paul Young said…
Another important use of MTTR is in services. All the major Telco's use MTTR and at Cisco services, where I worked, we used MTTR heavily as a metric for our managed service.

That brings up another topic; there is a serious lack of resources out there for the "Product Manager" of a service. I wouldn't help me as much now that I've moved to the product side but there are a lot of PM's I know would benefit from it. Maybe we can collaborate on this?

Paul Young

Popular posts from this blog

Why Spreadsheets Suck for Prioritizing

The Goal As a company executive, you want confidence that your product team (which includes all the people, from all departments, responsible for product success) has a sound basis for deciding which items are on the product roadmap. You also want confidence the team is prioritizing the items in a smart way. What Should We Prioritize? The items the team prioritizes could be features, user stories, epics, market problems, themes, or experiments. Melissa Perri  makes an excellent case for a " problem roadmap ", and, in general, I recommend focusing on the latter types of items. However, the topic of what types of items you should prioritize - and in what situations - is interesting and important but beyond the scope of this blog entry. A Sad but Familiar Story If there is significant controversy about priorities, then almost inevitably, a product manager or other member of the team decides to put together The Spreadsheet. I've done it. Some of the mos

Use Case as a Black Box

Consider the following use case: Purchase Items Actor: Purchaser Precondition: Purchaser types at least thirty words per minute and has a web navigation efficiency rating of at least 40. Postcondition: For the average Purchaser acting at full efficiency, the number of seconds elapsed is no more than 30 + 20 * n, where n is the number of items purchased. The name of the use case represents a functional requirement. What does the product do, or enable the user to do? Purchase items. What are we to make of the preconditions and postconditions? What relationship do they have to the requirements for the product? Answer: the preconditions and postconditions are the nonfunctional requirements attached to the functional requirement . Another way of expressing the nonfunctional requirement would be as an attribute and associated constraint: Usability: For a Purchaser who types at least thirty words per minute and has a web navigation efficiency rating of at least 40, it shall take no

Henry Ford's "Faster Horse" Quote

You may have heard the ( apocryphal ) Henry Ford quote: If I'd asked customers what they wanted, they would have said "a faster horse". Over at the On Product Management blog , Saeed gives his take on this infamous quote. He "hates" it, and gives some compelling reasons. Saeed is spot on in his explanations. Personally, I think the quote is great, but it's a matter of interpretation. The valid point of the quote is not that it's a bad idea to facilitate a conversation with your market to better understand it. The valid points are: You must ask the right questions to get valuable answers. You must interpret the answers thoughtfully - often outside their direct meaning - to glean reliable information. Asking questions is not always the best way to "listen" to your market. (E.g., sometimes pure observational studies are more reliable.) Nonetheless, I find the quote is helpful to combat "armchair product management" in the