APPENDIX C
TRANSACTION LOG RECORD FORMAT
This appendix describes the types of data captured for each search query in the transaction log record.
TRANSACTION LOG RECORD FORMAT
Field Name | Description |
User's password | Each user's password is captured during logon process |
Logon date and time | Logon date and time (to the nearest second) is supplied by the system |
Full search query | The full search statement entered by the user |
Parsed search query | The full search statement is preparsed. Three pieces of data are recorded: 1) stemmed query terms; 2) term id's; and 3) collection weight of each search term |
Normalized call numbers | Normalized call numbers in user-selected clusters are recorded in the log file |
Top Ranked Clusters
(Up to 20 cluster records can be displayed)
Query id number Query id number assigned by the system
Cluster id number Each cluster record that best represents all the records in a given cluster is assigned a unique number in advance
Cluster rank The rank of each retrieved cluster record (1st, 2d, etc.) among all the retrieved cluster records
Cluster weight Cluster weight determines how closely each cluster record matches the query entered by the user
Action taken by the user Code indicating whether the user displayed the cluster record and whether he or she selected it as relevant or not
Normalized call number Classification number that best represents all the bibliographic records in a given cluster
Topical information Broad topical information taken from the Library of Congress Classification (LCC) scheme (e.g., Bibliography Libraries . . .)
Number of records Number of bibliographic records represented by a given cluster
Subject headings The most frequently encountered three Library of Congress subject headings attached to the bibliographic records under a given cluster (along with their frequencies)
(continued)
TRANSACTION LOG RECORD FORMAT (Continued)
Field Name | Description |
Top Ranked Bibliographic Records
(Up to 20 bibliographic records can be displayed)
Query id number Query id number assigned by the system
Record id number ID number of record displayed
Record rank The rank of each retrieved bibliographic record (1st, 2d, etc.) among all the retrieved bibliographic records
Record weight The weight of bibliographic record that determines how closely the record matches the query entered by the user
Feedback iteration number Whether the record is retrieved during original search or relevance feedback search (users can perform relevance feedback searches more than once for the same query)
Action taken by the user Code indicating whether the user displayed the bibliographic record and whether he or she selected it as relevant or not
Bibliographic data The full MARC data (author, title, imprint, subject headings, etc.) for each retrieved bibliographic record including local call numbers
Completion time Date and time supplied by the system once the user completes the search
Records Retrieved During Relevance Feedback Searches
(Up to 20 records can be displayed)
Should the user opt for a relevance feedback search after the first retrieval results, CHESHIRE revises the original query based on the user's relevance judgments and retrieves more bibliographic records. The same type of information as given above (under Top Ranked Bibliographic Records) is displayed for each bibliographic record retrieved during relevance feedback searches. The user can continue relevance feedback searches as many times as desired. Starting and ending times are supplied for each relevance feedback cycle.