International Journal of Computer Science and Security | |
A Vertical Search Engine - Based on Domain Classifier | |
Rajashree Shettar1  Rahul Bhuptani1  | |
[1] $$ | |
关键词: domain classifier; inverted index; page rank; relevance; vertical search; | |
DOI : | |
来源: Computer Science and Security | |
【 摘 要 】
The World Wide Web is growing exponentially and the dynamic, unstructured nature of the web makes it difficult to locate useful resources. Web Search engines such as Google and Alta Vista provide huge amount of information many of which might not be relevant to the users query. In this paper, we build a vertical search engine which takes a seed URL and classifies the URLs crawled as Medical or Finance domains. The filter component of the vertical search engine classifies the web pages downloaded by the crawler into appropriate domains. The web pages crawled is checked for relevance based on the domain chosen and indexed. External users query the database with keywords to search; The Domain classifiers classify the URLs into relevant domain and are presented in descending order according to the rank number. This paper focuses on two issues – page relevance to a particular domain and page contents for the search keywords to improve the quality of URLs to be listed thereby avoiding irrelevant or low-quality ones .
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201912040511440ZK.pdf | 433KB | download |