[1188] in magellan

home help back first fref pref prev next nref lref last post

Web Search Service October 2003 Status Report

daemon@ATHENA.MIT.EDU (Joanne M. Hallisey)
Fri Nov 21 09:51:01 2003

Mime-Version: 1.0
Message-Id: <p0523010bbbe3d3f82c6c@[18.152.2.178]>
Date: Fri, 21 Nov 2003 09:41:47 -0500
To: Magellan@mit.edu, search-engines@mit.edu
From: "Joanne M. Hallisey" <hallisey@MIT.EDU>
Content-Type: text/plain; charset="us-ascii" ; format="flowed"

October 2003 Status Report

Project Name: Web Search Service
Project Leader: Joanne Hallisey
Report Date: November 3, 2003
URL: http://web.mit.edu/is/discovery/search/

Accomplishments in October:
- Make test collections starting from the list of offices, and index 
URLs of the form http://web.mit.edu/{a,b,...}, stopping when the 
Google collection reaches close to our 300,000 url limit.
- Investigate how multiple urls pointing to a document impacts the 
Google search.
- Create test query pages and results look-and-feel for our user 
testing.  Remove vendor information from both, so that the testers 
won't know which search engine produces which results.
- analyze data from daily logs of search.mit.edu users.
- Begin financial comparison of Google service to Ultraseek service. 
Requested information on contract size of 1 million and 2 million 
documents for better comparison
- Begin draft of initial Discovery findings and comparisons.
- Continued comparison of Ultraseek and Google search capabilities. 
Noted that they have different algorithms which makes it difficult to 
compare.- Determined that 300,000 documents was not adequate for 
meeting MIT's needs.
- Continued "tests" of small collections to compare results. Found 
that Google "discovers" and crawls some directories we did not expect 
it to, such as "http://web.mit.edu/activity". We restricted the 
crawling to only crawl URLs of the form http://web.mit.edu/lockername
- Ran daily scripts for inktomilogs.
- Meet with Barbara Johnson to begin preparation for usability 
testing to compare the two search engines.
- Decided to test at least 3 in each of the following categories: 
students, non-MIT and staff.

Goals for November:
- Recruit testers.
- Prepare test environment.
- Prepare test materials/tasks.
- Schedule tests.
- Continue to refine requirements and begin thinking about the recommendation.
- Recruit someone from Training and Publications to assist with the 
documentation for the search service.
- Begin to draft a business plan.
- Begin to draft a support and service plan.

Ongoing tasks
- Ask Google about SSL capacity for later use - August
- Review rules for existing search service
- Determine final definition for data sets for new service
- Define rules for new service
- Contact Resources to determine how to appropriately thank Google
- Determine solution for conversion of query forms if required.

Next community milestone:
None scheduled.

Issues:
Comparing the two agreements is difficult as the terms are based on 
annual fees for Google licensing and a one time license fee for 
Ultraseek.

Key learnings:


Team dynamics:

Good.

Additional comments:

None

home help back first fref pref prev next nref lref last post