Table of Contents | Appendices | Bibliography |
APPENDIX J
RETRIEVAL PERFORMANCE IN CHESHIRE
Appendix J contains precision and recall ratios for all search queries submitted to CHESHIRE throughout the experiment. The causes of search failures, if applicable, are also given. The figures in Column 1 refer to search query number (Q. no.); the full text of the query can be found in Appendix I.
Precision and recall ratios obtained before the relevance feedback searches are given in Columns 2 and 3, respectively. No precision and recall ratios are available for discontinued searches. Columns 4 through 9 give precision and recall ratios obtained after relevance feedback searches. No figures are available if a relevance feedback search was not performed for a given search query. Average precision and recall ratios are given in Columns 10 and 11, respectively.
Search effectiveness for each query is given in Column 12. No data is provided in this column for out-of-domain search queries or queries that was discontinued for some reason.
The cause of search failure, if applicable, is briefly explained in Column 13.
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure |
001 | .000 | .850 | .166 | .812 | .083 | .831 | E |
|||||
002 | .150 | .750 | .055 | 1.00 | .105 | .875 | I |
Collection failure | ||||
003 | No clusters selected; user did not like the clusters | |||||||||||
004 | No clusters selected; user wanted to revise the query | |||||||||||
005 | .210 | .850 | .105 | .818 | .158 | .834 | E |
|||||
006 | .5 | .231 | .00 | .00 | .167 | .077 | E |
|||||
007 | .00 | .231 | .000 | .231 | I |
Specific query | ||||||
008 | .263 | .312 | .200 | .273 | .00 | .250 | .154 | .278 | E |
|||
009 | No clusters selected; user wanted to revise the query | |||||||||||
010 | .214 | .750 | .667 | 1.00 | .440 | .875 | E |
|||||
011 | Faulty cluster selection | |||||||||||
012 | .00 | .833 | .00 | .00 | .00 | .417 | E |
|||||
013 | Collection failure; zero retrieval | |||||||||||
014 | .00 | .400 | .461 | .500 | .230 | .450 | E |
|||||
015 | .167 | .00 | .167 | .00 | I |
Collection failure | ||||||
016 | .105 | .00 | .133 | .00 | .00 | .00 | .079 | .00 | I |
Collection failure | ||
017 | No clusters selected; telecommunication problems | |||||||||||
018 | Cluster failure; no clusters selected as being relevant | |||||||||||
019 | .053 | .033 | .052 | .033 | I |
Collection failure | ||||||
020 | Collection failure; zero retrieval | |||||||||||
021 | Collection failure; zero retrieval | |||||||||||
022 | .222 | .333 | .222 | .333 | E |
|||||||
023 | .316 | .333 | .200 | .333 | .258 | .333 | E |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure |
024 | .316 | .727 | .312 | .555 | .314 | .641 | E |
|||||
025 | .00 | .100 | .00 | .00 | .00 | .500 | E |
|||||
026 | .368 | .467 | .00 | .375 | .184 | .421 | E |
|||||
027 | No clusters selected; user interface problems | |||||||||||
028 | .067 | .850 | .00 | 1.00 | .033 | .925 | I |
Search statement | ||||
029 | .400 | .133 | .00 | .385 | .200 | .259 | I |
User interface problems | ||||
030 | .00 | .00 | .00 | .00 | I |
Out of domain search query | ||||||
031 | .316 | .350 | .150 | .461 | .30 | 1.00 | .255 | .604 | E |
|||
032 | .667 | .417 | .100 | .286 | .00 | .00 | .255 | .234 | I |
Vocabulary problem | ||
033 | No clusters selected; out of domain search query | |||||||||||
034 | .00 | .00 | .00 | .00 | I |
Collection failure | ||||||
035 | No clusters selected; out of domain search query | |||||||||||
036 | .667 | .600 | .666 | .600 | E |
|||||||
037 | Zero retrieval; call number search | |||||||||||
038 | .00 | .00 | .00 | .00 | I |
Broad search query | ||||||
039 | .00 | .400 | .00 | .417 | .00 | .408 | E |
|||||
040 | No clusters selected as being relevant | |||||||||||
041 | No clusters selected; user interface problems | |||||||||||
042 | .053 | .250 | .055 | .333 | .50 | 1.00 | .202 | .528 | E |
|||
043 | Zero retrieval; help request | |||||||||||
044 | I |
Help request | ||||||||||
045 | Zero retrieval; "the" is a stop word | |||||||||||
046 | .00 | .00 | .00 | .500 | .00 | .250 | E |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||||||||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||||||||||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure | ||||||||||||
047 | .053 | .555 | .053 | .555 | E |
|||||||||||||||||||
048 | .273 | .643 | .00 | 1.00 | .136 | .821 | E |
|||||||||||||||||
049 | .75 | .9 | .00 | 1.00 | .375 | .95 | E |
|||||||||||||||||
050 | No clusters selected; user wanted to revise the query | |||||||||||||||||||||||
051 | .053 | .25 | .091 | .667 | .072 | .458 | I |
Collection failure | ||||||||||||||||
052 | .267 | .25 | .00 | .00 | .133 | .125 | E |
|||||||||||||||||
053 | No clusters selected; user interface problems | |||||||||||||||||||||||
054 | .5 | .85 | .00 | .778 | .25 | .814 | E |
|||||||||||||||||
055 | No clusters selected; out of domain search query | |||||||||||||||||||||||
056 | .5 | .5 | .00 | .00 | .00 | .00 | .167 | .167 | I |
Collection failure | ||||||||||||||
057 | .067 | 1.00 | .067 | 1.00 | E |
|||||||||||||||||||
058 | No clusters selected; user wanted to revise the query | |||||||||||||||||||||||
059 | No clusters selected; user wanted to revise the query | |||||||||||||||||||||||
060 | .263 | 1.00 | .053 | .00 | .158 | .5 | E |
|||||||||||||||||
061 | .474 | .45 | .00 | .15 | .44 | .7 | .306 | .433 | E |
|||||||||||||||
062 | .00 | .00 | .00 | .00 | .00 | .00 | .00 | .00 | I |
Collection failure; search statement | ||||||||||||||
063 | .1 | 1.00 | .1 | 1.00 | E |
|||||||||||||||||||
064 | Zero retrieval; user entered gibberish characters | |||||||||||||||||||||||
065 | Collection failure; no clusters selected | |||||||||||||||||||||||
066 | .00 | .25 | .00 | .25 | I |
Collection failure; most recent items needed | ||||||||||||||||||
067 | Scope failure; periodical literature search | |||||||||||||||||||||||
068 | Cluster failure; user did not like the clusters | |||||||||||||||||||||||
069 | I |
Scope failure; periodical literature search (Continued) | ||||||||||||||||||||||
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||||||||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||||||||||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure | ||||||||||||
070 | I |
User did not like clusters; no clusters selected as relevant | ||||||||||||||||||||||
071 | Out of domain search query; zero retrieval | |||||||||||||||||||||||
072 | I |
User interface problems; no clusters selected as being relevant | ||||||||||||||||||||||
073 | .00 | .00 | .00 | .00 | I |
User interface problems; collection failure | ||||||||||||||||||
074 | I |
Stemming algorithm; did not recognize "C"; zero retrieval | ||||||||||||||||||||||
075 | I |
Stemming algorithm failure; no clusters selected as relevant | ||||||||||||||||||||||
076 | No clusters selected; user wanted to revise her query | |||||||||||||||||||||||
077 | .00 | .850 | .00 | .650 | .40 | 1.00 | .133 | .833 | E |
|||||||||||||||
078 | .091 | 1.00 | .00 | .00 | .045 | .500 | I |
Known-item search | ||||||||||||||||
079 | User typed "quit" in the query description screen | |||||||||||||||||||||||
080 | .667 | 1.00 | .667 | 1.00 | I |
Collection failure | ||||||||||||||||||
081 | .500 | .200 | .500 | .200 | E |
|||||||||||||||||||
082 | .500 | .650 | .500 | .650 | E |
|||||||||||||||||||
083 | .579 | .850 | .250 | .450 | .11 | .100 | .00 | .00 | .235 | .350 | E |
|||||||||||||
084 | Out of domain search query | |||||||||||||||||||||||
085 | No clusters selected; out of domain search query | |||||||||||||||||||||||
086 | .555 | .800 | .555 | .800 | E |
|||||||||||||||||||
087 | Out of domain search query | |||||||||||||||||||||||
088 | .00 | .555 | .00 | .555 | E |
|||||||||||||||||||
089 | .800 | .650 | .833 | .454 | .817 | .552 | E |
|||||||||||||||||
090 | Author search; not supported in CHESHIRE; false drops | |||||||||||||||||||||||
091 | Author search; not supported in CHESHIRE; zero retrieval | |||||||||||||||||||||||
092 | Author search; not supported in CHESHIRE; zero retrieval |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure |
093 | Author/title search; not supported in CHESHIRE | |||||||||||
094 | Quit entered in the query description screen | |||||||||||
095 | .00 | .00 | .00 | .00 | I |
Collection failure | ||||||
096 | .368 | 1.00 | .100 | .00 | .234 | .500 | E |
|||||
097 | Author search; not supported in CHESHIRE | |||||||||||
098 | No clusters selected as being relevant | |||||||||||
099 | .500 | .650 | .500 | .800 | .500 | .725 | E |
|||||
100 | Out of domain search query | |||||||||||
101 | .667 | .300 | .00 | .350 | .333 | .325 | E |
|||||
102 | .500 | .00 | .500 | .00 | I |
Collection failure | ||||||
103 | .210 | 1.00 | .210 | 1.00 | E |
|||||||
104 | .105 | 1.00 | .00 | 1.00 | .053 | 1.00 | E |
|||||
105 | .263 | .500 | .00 | .00 | .131 | .250 | E |
|||||
106 | Out of domain search query | |||||||||||
107 | I |
Collection failure; no clusters selected as being relevant | ||||||||||
108 | I |
Collection failure; zero retrieval | ||||||||||
109 | .222 | .300 | .300 | .300 | .00 | .150 | .173 | .250 | E |
|||
110 | E |
|||||||||||
111 | I |
Telecommunication problem; no cluster selected | ||||||||||
112 | .053 | .00 | .050 | .00 | .00 | .00 | .051 | .00 | I |
Collection failure | ||
113 | I |
False drops; no clusters selected as being relevant | ||||||||||
114* | .158 | .210 | .450 | .600 | .00 | .00 | .00 | .00 | .122 | .162 | E |
|
115 | .500 | .00 | .00 | .00 | .05 | .00 | .183 | .00 | E |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Cause of search failure |
116 | .158 | .200 | .150 | .875 | .05 | 1.00 | .119 | .692 | E |
|||
117 | .263 | .263 | .105 | .143 | .184 | .203 | E |
|||||
118 | I |
Stemming algorithm; "r&d" not recognized; zero retrieval | ||||||||||
119 | .684 | .850 | .250 | .450 | .467 | .650 | E |
|||||
120 | Out of domain search query | |||||||||||
121 | Out of domain search query; no clusters selected as relevant | |||||||||||
122 | .333 | .333 | .500 | 1.00 | .117 | .667 | I |
Out of domain search query | ||||
123 | .263 | .750 | .100 | .500 | .10 | .50 | .154 | .583 | E |
|||
124 | .00 | .800 | .00 | 1.00 | .00 | .900 | E |
|||||
125 | I |
Collection failure; no clusters selected as being relevant | ||||||||||
126 | .00 | 1.00 | .00 | 1.00 | E |
|||||||
127 | .00 | .400 | .00 | .400 | I |
Library of Congress Subject Headings | ||||||
128 | .272 | .600 | .00 | .00 | .136 | .300 | I |
Library of Congress Subject Headings | ||||
129 | .00 | 1.00 | .00 | 1.00 | E |
|||||||
130 | Out of domain search query; user wanted to revise the query | |||||||||||
131 | .00 | .500 | .00 | .500 | I |
Collection failure | ||||||
132 | .00 | 1.00 | .00 | 1.00 | E |
|||||||
133 | Out of domain search query | |||||||||||
134 | Out of domain search query | |||||||||||
135 | Out of domain search query; no clusters selected as relevant | |||||||||||
136 | Out of domain search query; no clusters selected as relevant | |||||||||||
137 | Out of domain search query; no clusters selected as relevant | |||||||||||
138 |
Precision (P) & recall (R) ratios before relevance | Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Cause of search failure |
139 | Out of domain search query; zero retrieval | |||||||||||
140 | .105 | 1.00 | .050 | .00 | .00 | .00 | .00 | .00 | .039 | .250 | I |
Collection failure |
141 | I |
Collection failure | ||||||||||
142 | .368 | .615 | .077 | .200 | .223 | .408 | E |
|||||
143 | No clusters selected; user wanted to revise her query | |||||||||||
144 | No clusters selected; user wanted to revise her query | |||||||||||
145 | .00 | .615 | .00 | .00 | .00 | .308 | E |
|||||
146 | .500 | .00 | .500 | .00 | I |
Collection failure | ||||||
147 | .316 | .545 | .250 | 1.00 | .00 | .00 | .205 | .515 | E |
|||
148 | .500 | .769 | .00 | .00 | .250 | .385 | E |
|||||
149 | .00 | .500 | .00 | .500 | E |
|||||||
150 | .158 | 1.00 | .158 | 1.00 | I |
Collection failure | ||||||
151 | I |
Cluster failure; no clusters selected as being relevant | ||||||||||
152 | .053 | .769 | .125 | .333 | .00 | .00 | .059 | .367 | E |
|||
153 | I |
Collection failure | ||||||||||
154 | I |
Collection failure; no clusters selected as being relevant | ||||||||||
155 | Out of domain search query; zero retrieval | |||||||||||
156 | .00 | .900 | .00 | .400 | .00 | .650 | E |
|||||
157 | I |
No apparent reason; relevant clusters, but not selected | ||||||||||
158 | .00 | 1.00 | .00 | 1.00 | Out of domain search query | |||||||
159 | Out of domain search query | |||||||||||
160 | .00 | 1.00 | .00 | .00 | .00 | .500 | Out of domain search query | |||||
161 |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Cause of search failure |
162 | .00 | .333 | .00 | .00 | .00 | .167 | I |
Specific query | ||||
163 | .053 | .100 | .053 | .100 | I |
Collection failure | ||||||
164 | .00 | .667 | .00 | .666 | E |
|||||||
165 | .00 | .00 | I |
Collection failure | ||||||||
166 | I |
Cluster failure | ||||||||||
167 | No clusters selected; user wanted to revise his query | |||||||||||
168 | No clusters selected; user wanted to revise his query | |||||||||||
169 | .526 | .950 | .00 | .437 | .263 | .694 | E |
|||||
170 | .526 | .850 | .300 | .500 | .413 | .675 | E |
|||||
171 | .143 | .500 | .050 | 1.00 | .096 | .750 | E |
|||||
172 | No clusters selected as being relevant | |||||||||||
173 | No clusters selected as being relevant | |||||||||||
174* | .167 | .500 | .100 | .500 | .17 | .500 | .00 | 1.00 | .097 | .500 | I |
Library of Congress Subject Headings |
175 | .263 | .00 | .263 | .00 | I |
Collection failure | ||||||
176 | .538 | 1.00 | .267 | 1.00 | .10 | 1.00 | .00 | .00 | .226 | .750 | I |
No apparent reason given |
177 | Out of domain search query; no clusters selected as relevant | |||||||||||
178 | .500 | .429 | .500 | .429 | I |
Search statement; abbreviation used | ||||||
179 | .053 | 1.00 | .053 | 1.00 | E |
|||||||
180 | .421 | .00 | .00 | .00 | .210 | .00 | I |
Collection failure | ||||
181 | .368 | .214 | .250 | .364 | .25 | .714 | .289 | .431 | I |
Library of Congress Subject Headings | ||
182 | Out of domain search query | |||||||||||
183 | Out of domain search query | |||||||||||
184 | .158 | .250 | .250 | .210 | .204 | .230 | I |
Search statement; truncated word not recognized (Cont'd) |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Cause of search failure |
185 | .200 | 1.00 | .00 | .00 | .100 | .500 | E |
|||||
186 | .00 | .00 | .00 | .00 | .00 | .00 | I |
Collection failure | ||||
187 | No clusters selected as being relevant | |||||||||||
188 | .00 | 1.00 | .00 | .00 | .00 | .500 | E |
|||||
189 | .00 | .00 | .00 | .00 | I |
User interface problems | ||||||
190 | .00 | .00 | .500 | .00 | .250 | .00 | I |
User interface problems | ||||
191 | Collection failure; no clusters selected as being relevant | |||||||||||
192 | .214 | .571 | .062 | .333 | .00 | 1.00 | .092 | .635 | I |
Library of Congress Subject Headings | ||
193 | Collection failure | |||||||||||
194 | .053 | .200 | .00 | .00 | .00 | .00 | .017 | .067 | I |
Search statement | ||
195 | .250 | .300 | .100 | .300 | .170 | .300 | I |
Out of domain search query | ||||
196 | Out of domain query; no clusters selected as being relevant | |||||||||||
197 | Out of domain search query | |||||||||||
198 | .00 | .00 | .00 | .00 | I |
Collection failure | ||||||
199 | .263 | .818 | .050 | 1.00 | .156 | .910 | E |
|||||
200 | Title search; not supported in CHESHIRE | |||||||||||
201 | .100 | 1.00 | .100 | 1.00 | Title search; not supported in CHESHIRE | |||||||
202 | Title search; not supported in CHESHIRE | |||||||||||
203 | Title search; not supported in CHESHIRE | |||||||||||
204 | Title search; not supported in CHESHIRE | |||||||||||
205 | Title search; not supported in CHESHIRE | |||||||||||
206 | Title search; not supported in CHESHIRE | |||||||||||
207 |
Precision (P) & recall (R) ratios before relevance |
Precision & recall ratios after relevance feedback searches | Average | Search perfor- mance: effec- |
|||||||||
Q. | feedback searches | First iteration | Second iteration | Third iteration | precision & recall ratios | tive (E)/ ineffec- |
||||||
no. | P | R | P | R | P | R | P | R | P | R | tive (I) |
Causes of search failure |
208 | .00 | .00 | .00 | .00 | I |
Collection failure | ||||||
209 | Stemming algorithm failure; abbreviation ("e") not recognized | |||||||||||
210 | .789 | .750 | .700 | .867 | .744 | .808 | E |
|||||
211 | .200 | .650 | .00 | .100 | .100 | .370 | E |
|||||
212 | Out of domain search query | |||||||||||
213 | No clusters selected as being relevant | |||||||||||
214 | No clusters selected as being relevant | |||||||||||
215 | .00 | .00 | .00 | .00 | Out of domain search query | |||||||
216 | Out of domain search query; no clusters selected as relevant | |||||||||||
217 | .947 | .900 | .800 | .800 | .874 | .850 | E |
|||||
218 | Out of domain search query; no clusters selected as relevant | |||||||||||
219 | I |
No reason given; relevant clusters retrieved but not selected | ||||||||||
220 | Out of domain search query; no clusters selected as relevant | |||||||||||
221 | .316 | .350 | .222 | .210 | .269 | .280 | E |
|||||
222 | I |
Collection failure; no clusters selected as being relevant | ||||||||||
223 | .053 | .00 | .053 | .00 | I |
Search statement | ||||||
224 | .053 | .400 | .100 | .400 | .06 | .200 | .00 | .250 | .054 | .312 | I |
Faulty cluster selection |
225 | .526 | .950 | .454 | .700 | .00 | .850 | .327 | .833 | I |
Search statement | ||
226 | Misspelling; zero retrieval | |||||||||||
227 | .105 | .625 | .00 | .333 | .053 | .479 | E |
|||||
228 | .846 | .55 | .846 | .550 | E |
Table of Contents | Appendices | Bibliography |