What Works When?
Using the Right Tools to Test Your Web Site’s Effectiveness

By Tema Frank, Web Mystery Shoppers International Inc.


Have you ever heard complaints about a web site you’ve worked on? Let’s rephrase: have you ever worked on a site that nobody complained about? Face it: no site is going to meet everybody’s goals perfectly. So when you are responsible for a site’s effectiveness, how do you make sure the changes you are making optimize the conflicting demands of management, marketers, programmers, designers, and end-users?

If the goal of the site is to help your organization communicate better or sell more, a crucial place to turn is to end users. Because if it doesn’t work for them, it won’t meet management’s goals, and if those aren’t met there will be no money to pay marketers, programmers and designers.

There are many ways to get user input, and this article discusses when you would want to use each technique. In an ideal world, you would use all of them at some point in your web site’s life cycle. But this isn’t an ideal world, and odds are you don’t have the time or the budget to do them all. So how do you decide?

The type of testing you do should depend on a combination of your stage in the web site life cycle and your goals for testing. Realistically, it will also be influenced by cost and time pressures.


Focus Groups

These typically involve 6 to 12 people, drawn from your existing customer base or recruited externally. They have a group discussion, led by a moderator, about the web site. If the site does not yet exist, they may be asked what they would like to see in such a site. They may be shown page mock-ups, drawings, or led through an exercise called “card-sorting”, which helps figure out how to organize the planned content.

Often the client is behind one-way mirror, and may be able to ask the moderator (who is wearing a headset) to ask specific follow-up questions.

Costs are based on a combination of recruitment costs, participant expenses, and consulting-related fees. Typical focus groups cost between $3,000 - $8,000 per group, which includes a fee of typically $50 - $100 per participant.

Advantages of Focus Groups
* Client gets to watch participant reactions, and may be able to have moderator ask follow-up questions.
* Participants can “play” with the content – for example, rearranging cards in an order that makes sense to them.
* Can be done early – before money has been spent on programming.

Drawbacks of Focus Groups
* User’s guesses as to how they would use the site are very different from reality.
* Will miss most navigational problems.
* Risk of “groupthink”; what people say is influenced by the others in the room.
* Risk of poor moderators. They must not bias the conversation, yet ensure no one dominates.
* Participants limited by geography, time and willingness to travel to the testing location.
* Often either no transcript available, or significant cost or delay to get a transcript.
* Poorly constructed participant group negates the value (i.e. they must be like your target customers, and not so familiar with your site or company that they are unable to view the concepts with fresh eyes).

Focus Groups Useful When:
* In very early stages -- conceptual work.
* For emotional reactions to colours or treatments.
* To get rough idea of desired content.

On-line Focus Groups

On-line focus groups participant log in from their own computers at a pre-arranged time, and the discussion uses a chat-room approach. In the more sophisticated set-ups, participants can view materials that the moderator displays on the screen, and possibly go back and forth between the chat room and the viewing screen, or see both at once.

Advantages of On-line Focus Groups
* Overcomes geographic limitations of traditional focus groups.
* Real-time transcript is available for analysis.

Drawbacks of On-line Focus Groups
* Cannot do physical activities such as card-sorting.
* Hard to follow the flow of conversation.
* No facial expressions or tone of voice.

On-line Focus Groups Useful When:
* Doing early-stage testing for a teen-oriented site. Teens tend to be the group that is most comfortable with on-line chat, and they may prefer the semi-anonymity of this format to a traditional focus group. For most other users, this is not an ideal format.


Lab-based Usability Testing

Users are brought into a testing lab, one at a time. A typical session lasts one to two hours with each user. The participants work on the lab’s computer and are asked to “think aloud” as they work.

The client can usually watch behind a one-way mirror. As with a focus group, there may be a “moderator” in the lab with the user, who can prod for further insight.

In some labs there is also a video-camera taping the participant’s facial reactions, and some form of clickstream monitoring (tracking of where the person clicks or scrolls on a page).

Given the intensity of this type of testing, there are usually only a few people tested. The traditional minimum was five testers, but current research suggests that this is far too few for today’s complex web sites. The problems identified by the first few users may not reflect the most important problems in need of repair.

A typical set of lab-based tests with 5 – 10 users will cost in the range of $20,000 - $100,000, depending on the degree of consulting that is included in the project and the market clout of the testing company.

Advantages of Lab-based Usability Testing
* Users are interacting with the real site, so you can learn about navigational problems.
* Can ask the users questions as they interact with the site, so surprising actions can be explained.
* Because you see the user’s facial expressions and hesitations, you get a sense of the frustration level and uncertainties experienced by the user.


Drawbacks of Lab-based Usability Testing
* A stressful, unnatural environment. Stress affects performance.
* Using an unfamiliar computer.
* Most lab-based computers are more finely-tuned and using more current software than most users’ real computers, so many errors users encounter in real life will be missed.
* Users need to be willing and available to come to lab. This causes a skew in who is willing to participate.
* Geographic limitations. Most labs are in major cities. But are all your users?
* Small number of users tested. Are the problems the first 5 – 10 people encounter typical?
* Often expensive, particularly if a transcript is prepared and analyzed after-the-fact.

Lab-based Usability Testing Useful When:
* Want to see facial reactions; hear vocalized thoughts.
* Want to be able to intervene with real-time questions.
* Site users are a homogeneous group, so it is possible to recruit a small, but representative sample.

Surveys

There are many companies that offer surveys, either mailed to site visitors after the fact, or that pop up on site while the visitor is using it. Mailed surveys are typically too long after the fact to yield any meaningful results for web site assessment. At most you will get vague, general feelings about the overall experience. (“It left me feeling frustrated.” Or “It was OK”.)

Pop-up surveys are programmed to either pop up for every visitor, or for every nth visitor, or for visitors to specific pages. Visitors are asked if they will participate in a survey, and sometimes offered a chance to win a prize as an incentive for participation. Generally, a maximum of about three minutes worth of questions and answers is the longest you can expect people to stick with such a survey.

Costs vary from small, free surveys offered by companies such as Zoomerang, to sophisticated surveys that change the questions asked depending on previous answers and have a lot of built-in analytic tools. The latter type of survey can easily cost in the $10,000 - $20,000 range. Because a heavily trafficked site will usually be able to collect a large number of responses very quickly in this way, pricing is often quoted on a per-respondent basis.


Advantages of Website (Pop-up) Surveys
* Large numbers can be surveyed
* Fast turn-around
* Low cost/respondent
* Actual site visitors responding
* Questions can be changed often
* No need to pay respondents

Drawbacks of Website (Pop-up) Surveys
* Survey fatigue: people are less willing than they used to be to spend time answering such surveys.
* Bias in who replies. Respondents are more likely to be new Internet users, as they are less likely to have become accustomed to being asked their opinion on web sites.
* Intrusive; it may annoy site users.
* Increasing prevalence of pop-up stoppers is making it harder to get results, especially from experienced internet users.
* Very limited number of questions feasible, so you cannot get in-depth responses.
* No tracking of the path taken and the customer’s reactions as they moved along that path.
* Requires special coding be placed on your site. This places demands on IT.

Website (Pop-up) Surveys Useful When:
* Want general reactions to site.
* Have lots of visitors.
* Want feedback quickly, but not in-depth.


Metrics/Analytics Software

Metrics or analytics software consists of sophisticated versions of server logs. Some can track clickstream details (what pages people clicked on and in what order) and analyse patterns and drop-off points. They can usually track which pages were viewed and the average length of time spent per page.

They are useful together with other methods, but you have to infer customer motivations. They may tell you, for instance, that 63% of your site visitors abandon your site at the first page of your order form. You have no way of knowing whether they are leaving at that point because the page isn’t displaying properly on the screens of Windows XP users, or if they couldn’t find information about shipping costs and were unwilling to continue without knowing that, or if they pressed the wrong button and then decided it wasn’t worth the hassle to go back and re-enter the information.

Costs range from the free basic statistics provided by many site hosting companies, to sophisticated programs in the tens or hundreds of thousands of dollars.

Advantages of Site Metrics/Analytics Programs
* Can provide detailed statistical information on your actual site visitors.
* Can help you identify problem pages.

Drawbacks of Site Metrics/Analytics Programs
* The volume of statistical data can be overwhelming. Many site owners find it impossible to interpret, and end up ignoring most of it.
* Does not explain why visitors are behaving as they are.

Site Metrics/Analytics Programs Useful When:
* You have the staff and budget to do sophisticated analysis of the data, and can combine it with qualitative information gained from other approaches.
* It is provided free along with your hosting agreement, if you use it in conjunction with other approaches.


Unsolicited Customer Feedback

Almost everybody gets some feedback on their site. From family and friends to site visitors who feel strongly enough that they’ll take the time to send you an e-mail or fill in a comments form, people will give you their opinions. If you have a call centre, you can also learn a lot about peoples’ reactions to your site from analyzing the help requests that relate to the web site.

Since you get it, use it. You may get some gems from this sort of feedback. The big mistake site owners often make, though, is assuming that what they hear from such channels is typical, and that they thus know all they need to know about what people think of their site. Wrong! The people who take the time to comment are not typical of the general public. They are a vocal minority, who tend to perceive things in more extreme terms than most, and who have enough time on their hands that they consider it worth the time to comment.

Advantages of Unsolicited Customer Feedback
* Minimal cost.
* Real users.
* Chance to repair relationship with dissatisfied customers who bother to comment.

Drawbacks of Unsolicited Customer Feedback
* Biased sample.
* Miss many issues.
* Analysis can be costly in large organizations.
* You must respond quickly and well, or your lack of correct response will make matters worse.
* Often you will not learn exactly where the problem occurred. The comments are often too vague to be useful.

Unsolicited Customer Feedback Useful when:
* Making minor adjustments to the site as you go along.
* Ensuring a good linkage between customer service staff and site promises.
* Input for redesign.

Remote Usability Testing

“Remote usability testing” takes two basic forms. One approach is to work one-on-one, as in a lab-based usability test, but with the testers working from their own computers and either talking by phone or through on-line chat with the interviewer while they work through the site. The advantages and drawbacks of “One-On-One Remote Usability Testing”. are similar to lab-based testing, except that you can use testers from any geographic region, but you cannot see their facial expressions.

Another approach is to have lots of people (typically 100 – 200) work independently on their own computers to try to complete tasks at the site, and report on their progress by answering survey forms as they go through the site. Since they are expected to devote much more time than is expected in a pop-up survey, they are usually paid $10 - $20 for their effort. We’ll call this “Large Sample Remote Usability Testing” (LSRUT).

Advantages of Large Sample Remote Usability Testing (LSRUT)
* No geographic or time-based limitations.
* Collect lots of detail as they go.
* Includes both qualitative and quantitative information.
* Realistic setting; low stress.
* Wide range of computer systems; uncover problems faced using buggy personal computers.
* No demands on IT department*
* Provides easy ability to benchmark – both against competitors and to compare present and future performance*.

* (Depending on system used to collect the information)

Drawbacks of LSRUT
* Can’t see facial expressions or ask spur-of-the-moment follow-up questions.
* Need reasonably literate testers.

LSRUT Useful when:
* Need either a wide range of testers (e.g. people from across North America) or a specific type of testers (e.g. rural Albertans aged 50+)
* A current site or a draft version of a new site is accessible on-line
* Preparing for redesign
* Competitive analysis*
* Benchmarking*
* Monitor and reward customer service staff*

* (Depending on system used to collect the information)

Which Techniques are Best at What Stages?

Concept development: Use focus groups, plus feedback from clients and current customers.

Once online (test or live server): Use lab-based testing if you need to see facial expressions; otherwise consider alternatives such as LSRUT.

Ongoing: Pop-ups if and when needed; web analytics/metrics, collect and analyze unsolicited feedback; use LSRUT to ensure not slipping and for competitive analysis.

Redesign: LSRUT before redesigning, so you don’t throw out the baby with the bathwater! Also use data from ongoing stage.



Tema Frank is the president of Web Mystery Shoppers International Inc., and a previous WebAwards judge. For more information on this and related topics, see the Research page at www.webmysteryshoppers.com. Tema can be reached toll-free at 1-866-211-7027 or at tema@webmysteryshoppers.com .





 

 

 

 

 

 

bullet Award Update
bullet Featured Article
bullet Featured Site
bullet Special Offers
bullet Feedback

bullet Advertising
bullet Books
bullet Conferences
bullet Direct Mail
bullet Email
bullet PR
bullet Trade Press
bullet Contact us
bullet Become a sponsor
bullet Contribute an article

bullet BURST! Media
bullet Venture Direct
bullet Small Army
bullet Line56 Media
bullet Perseus Software
bullet Velocity Micro

 

bullet Microsoft Office Live Meeting
bullet Vertical Response
bullet NewsUSA
bullet O’Dwyer’s PR Daily
bullet Burrelle’s/Luce
bullet OddCast
bullet The Wall Street Journal Online
bullet Barron’s Online