irSearch is a toy C++ search engine able to index HTML pages and to retrieve using some IR simple models
This project was made for the lecture Information Retrievel - Web Search Engine offered in UFMG. For more information about the lecture go to http://homepages.dcc.ufmg.br/~nivio/br/teaching-ri-14.php
irSearch is currently running with a 5GB collection indexed, you can test it yourself at http://greenwich.lbd.dcc.ufmg.br/lizardo/ri/
The models implemented in irSearch are: Okapi BM25, Cosine, PageRank, Boolean and some combinations of them.