| Tasks | Traditional web searching | twURL web collection process |
|---|---|---|
| Search Engine Querying | Sequential queries to selected search engines, e.g. Altavista, then
Fast, then ... Typical collection: 10-50 URLs |
Use powerful desktop meta-searchers, WebFerret and Bullseye, to collect
URLs from more than a dozen search engines, concurrently. Typical Collection: 100-2000 URLs |
| Filtering duplicates and duds; Qualifying URLs relevance and authoritative | Selecting a few URLs with good descriptions, viewing pages, building a list of URLs | Using meta-searchers to double-check queried terms; twURL triage (keepers, losers, unsure) iteratively to sort, filter, rate URLs; off-line browsing of downloaded web pages for rapid viewing and authentication |
| Categorizing URLs | Text templates, questions, cut-and-paste of search engine result pages | Keywords from controlled vocabularies for concept clustering, links among pages to show most popular, hubs and authorities; Internet domain and site ordering to see distributions and types of content; outline and graph presentations of distribution data |
| Generating Reports | Annotated results of categorization | Multi-linked HTML pages with URL thumbnails, pie-chart and tables for navigating sites, keyword indexes to URLs |
| Updating and managing collections | Databases, text lists, HTML pages | URL bases that can be merged to produce unions or intersections of topics, audit trails of rating and selection decisions for repeated searches; expansions of URL bases by adding linked to URLs |
Time required is usually 3-8 hours, depending on amount of filtering required, search engine and net behavior, and parameters of report generation. Further refinement and quality assessment, including reading all web pages off-line, development of effective keyword categories, and analysis of trends may run from 2-5 days and may be negotiated at a project rate.
Deliverables include:
To order a twURL Web Brief or inquire about additional web searching
and collection services, contact Susan
Gerhart, Research Outlet and Integration,
281-486-8480 Houston, Texas.