-
Notifications
You must be signed in to change notification settings - Fork 0
search engine - indexer, crawler, query engine
codelurker/TinySearch-Engine
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
By: Ricky A. Melgares '11 The sources are arranged in the following directory: /shared/ html.h html.c (HTML parsers, where getNextUrl() hides) hash.h hash.c (hash functions) /data/ (where the data output from the crawler is stored) *****CRAWLER******** /crawler/src/ crawler.h crawler.c (the main entry, the crawler main logic is here) crawler_test.h (test script that tests crawler for arguments & equality of original and re-created index) Makefile (make file for compiling the indexer) temp (temporary file to store data) ==> To build the crawler goto /crawler/src and type "make" ==> To run the crawler ex) "./crawler http://www.cs.dartmouth.edu/ ../../data 3" ==> To test the crawler goto /crawler/src and type "./crawler_test.sh" This script will: 1 Test the argument checking of crawler. 2 Test for an invalid specified directory. 3 Test the termination of crawler. *****INDEXER**** /indexer/util (where the general purpose utilities file is located) input.c input.h (file access and input processing utilities are located) /indexer/src/ indexer.c (the main entry, the indexer main logic is here) dictionary.h dictionary.c (contains index data structure library & related functions) indexer_test.h (test script that tests crawler for arguments & equality of original and re-created index) Makefile (make file for compiling the indexer) ==> To build the indexer goto /lab6/indexer/src and type "make" ==> To run the indexer ex) "./indexer ../../data index.dat" ==> To test the indexer goto /lab6/indexer/src and type "./indexer_test.sh" This script will: 1 Test the argument checking of indexer. 2 Test for an invalid specified directory. 3 Test the termination of indexer. 4 Compare the output of the index.dat file and the re-created indexR.dat file *******QUERY******** /queryengine/util (where the general purpose utilities file is located) input.c input.h (file access and input processing utilities are located) util.c util.h (misc. functions for dealing with lists among other things) /queryengine/src/ query.c (the main entry, the indexer main logic is here) dictionary.h dictionary.c (contains index data structure library & related functions) queryengine_test.h (test script that query engine) logic.c logic.h (files for handling intersection, union of lists, etc.) hash.h hash.c (hash functions) data data.sort (temp data files for sorting) Makefile (make file for compiling the indexer) ==> To build the queryengine goto /lab6/queryengine/src and type "make" ==> To run the queryengine ex) "./query index.dat ../../data " ==> To test the queryengine harness goto /lab6/indexer/src and type: make -f Makefile_normal -This function will test the various functions in query engin
About
search engine - indexer, crawler, query engine
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published