Skip to content

codelurker/TinySearch-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

By: Ricky A. Melgares '11

The sources are arranged in the following directory:
/shared/
	html.h html.c (HTML parsers, where getNextUrl() hides)
	hash.h hash.c (hash functions)

/data/   (where the data output from the crawler is stored)

*****CRAWLER********
/crawler/src/
	crawler.h crawler.c (the main entry, the crawler main logic is here)
	crawler_test.h (test script that tests crawler for arguments & equality of 
	                original and re-created index)
	Makefile (make file for compiling the indexer)
	temp (temporary file to store data)
	
	==> To build the crawler
	goto /crawler/src and type "make"

	==> To run the crawler
	ex) "./crawler http://www.cs.dartmouth.edu/ ../../data 3"

	==> To test the crawler
	goto /crawler/src and type "./crawler_test.sh"
	     This script will: 
	     1 Test the argument checking of crawler.
	     2 Test for an invalid specified directory.
	     3 Test the termination of crawler.

*****INDEXER****
/indexer/util (where the general purpose utilities file is located)
	input.c input.h (file access and input processing utilities are located)

/indexer/src/
	indexer.c (the main entry, the indexer main logic is here)
	dictionary.h dictionary.c (contains index data structure library & related functions)
	indexer_test.h (test script that tests crawler for arguments & equality of 
	                original and re-created index)
	Makefile (make file for compiling the indexer)
	
	==> To build the indexer
	goto /lab6/indexer/src and type "make"

	==> To run the indexer
	ex) "./indexer ../../data index.dat"

	==> To test the indexer
	goto /lab6/indexer/src and type "./indexer_test.sh"
	     This script will: 
	     1 Test the argument checking of indexer.
	     2 Test for an invalid specified directory.
	     3 Test the termination of indexer.
	     4 Compare the output of the index.dat file and the re-created indexR.dat file

*******QUERY********
/queryengine/util (where the general purpose utilities file is located)
	input.c input.h (file access and input processing utilities are located)
	util.c util.h (misc. functions for dealing with lists among other things)

/queryengine/src/
	query.c (the main entry, the indexer main logic is here)
	dictionary.h dictionary.c (contains index data structure library & related functions)
	queryengine_test.h (test script that query engine)
	logic.c logic.h (files for handling intersection, union of lists, etc.)
	hash.h hash.c (hash functions)
	data data.sort (temp data files for sorting)
	Makefile (make file for compiling the indexer)
	
	==> To build the queryengine
	goto /lab6/queryengine/src and type "make"

	==> To run the queryengine
	ex) "./query index.dat ../../data "

	==> To test the queryengine harness
	goto /lab6/indexer/src and type: make -f Makefile_normal
		-This function will test the various functions in query engin

About

search engine - indexer, crawler, query engine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published