Golden Retriever an information retrieval project by Peter Peerdeman and Timen Olthof Information Retrieval Group 13
Project Progress: Introductory work 1. Read the assignment and tutorial 2. Downloaded and unzipped the data files 3. Use Lucene to index the dataset 4. Try out Luke to examine the index
Project Progress: Assignment 1. Found a Lucene API tutorial online 2. Get examplecode running (porting) 3. Adjust the examplecode to fit the information needs for the assignment
Project Progress: Findings • Lucene code structure • Lucene API is easy to use • Very different results in plain and XML corpora, because of different fields
Project Progress: Plans: • Add multiple keyword input, Stemming / Case folding to query • Write report on our topic exploration process and findings
Golden Retriever an information retrieval project by Peter Peerdeman and Timen Olthof Information Retrieval Group 13
Recommend
More recommend