Implementation of a Hidden Web crawler

dc.contributor.author	MAHAMEDI, Soundous
dc.contributor.author	Supervisor: SAOUDI, Lalia
dc.date.accessioned	2023-05-24T08:26:03Z
dc.date.available	2023-05-24T08:26:03Z
dc.date.issued	2015-06-10
dc.description.abstract	Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of web pages reachable purely by following hypertext links, ignoring search forms and pages that require authorization or prior registration. In particular, they ignore the tremendous amount of high quality content "hidden" behind search forms, in large searchable electronic databases. In this work, we provide a framework for addressing the problem of extracting content from this hidden Web, that is why we have built a task-specific hidden Web crawler called the Intelligent Hidden Web Crawler (IHiWC). We describe the architecture of IHiWC and present a number of new techniques that went into its design, approach and implementation. We also present results from experiments we conducted to test and validate our techniques.	en_US
dc.identifier.uri	http://dspace.univ-msila.dz:8080//xmlui/handle/123456789/38681
dc.language.iso	en	en_US
dc.publisher	University of M'sila	en_US
dc.subject	Deep crawler, Hidden Web Crawling, forms classification, forms submission	en_US
dc.title	Implementation of a Hidden Web crawler	en_US
dc.type	Thesis	en_US

Files

Now showing 1 - 1 of 1

Now showing 1 - 1 of 1