William Y. Arms
CNRI
[email protected]
Preliminary Draft
January 2, 1998
The current phase of digital library research is highly empirical. A researcher who is developing a new concept implements software that incorporates the concept, demonstrates it with some trial set of data, reports observations on the results, and encourages others to build on the work. This is an effective method of working during the early stages of an experimental field, but as the field matures, we need a more systematic methodology.
For example, three of the current DLI projects are doing work in image recognition. Each is tackling a different aspect of the same problem: to be able to search collections for images that match specific criteria. However, the three projects are using their work in different applications and with different data. Therefore, any comparison of the three approaches is highly subjective.
There are two closely related needs:
Hopefully, the D-Lib Metrics working group will help the development of ways to measure the effectiveness of various aspects of digital library research. The next requirement is standard test data that researchers can use to evaluate their work.
I envisage a test suite that consists of a group of standard sets of test data that represent the major categories of material in digital libraries. The requirements for the test suite are demanding:>
wya
January 2, 1998