Experimental design: case study

Two enthusiastic computer science researchers, let's call them Bill Bodgett and Susan Scarper, have devised a new technique for information retrieval from the World-Wide Web. Their technique has a novel user interface, and employs very sophisticated techniques for cataloguing and prioritizing information. They are justly proud of their work, and after a year of refinement they write it up for publication in a prestigious journal.

Sadly the referees are not impressed; they assert that although they ideas sound good in practice, there is no evidence that the technique really improves the speed or ease of end-users' access to information. The work is of technical interest, they say, but not fully evaluated enough to merit publication at this stage.

Dismayed but not downhearted, Bodgett and Scarper round up a group of a half dozen or so of their friends and family, and sit them down in front of a computer. They ask each person to use the new software to carry out some information retrieval task, then ask them to fill in a questionnaire about the experience. The questionnaire asks the users to rate various aspects of their impressions of the system on a scale of 1-10, with `1' being awful and `10' excellent. On the whole the users rate most aspects of the system's performance as 8-10. The average overall score is 8.6.

Thus encouraged, Bodgett and Scarper re-write their paper and re-submit it. To their shock it is rejected again, this time because their experimental methodology was ill-founded. Now thoroughly discouraged, the researchers shelve the project and leave academic work completely. Both get jobs as programmers in a merchant bank and make enough money to retire at 40. But that's a different story.

In this workshop we will be examining ways in which Bodgett and Scarper could have performed their experiment better, and saved themselves from the disgrace of moving to the private sector.

©1994-2003 Kevin Boone, all rights reserved