广东永利总站ylzz55平洲电子有限公司

教学科研

课程建设

当前位置: 网站首页 > 教学科研 > 专著教材 > 学术专著 > 正文

《Query Selection in Deep Web Crawling》

发布时间：2024年01月16日来源：浏览量：

Query Selection in Deep Web Crawling,2014-03,王焱.The deep web is the content that is dynamically generated from data sources such as databases or file system. Unlike surface web where web pages are collected by following the hyperlinks embedded inside collected pages， data from a deep web data source is guarded by a search interface and only can be retrieved by queries. The amount of data in deep web exceeds by far that of the surface web. This calls for deep web crawlers to excavate the data so that they can be used， indexed， and searched upon in an integrated environment. Crawling deep web is the process of collecting data from search interfaces by issuing queries. One of the major challenges in crawling deep web is the selection of the queries so that most of the data can be retrieved at a low cost. This work first comprehensively introduces the state-of-art work in query selection techniques for crawling， then in-depth analyzes the remaining problems， such as cold start problem and return limit problem，and finally presents a novel technique to address them.

上一页：《我国对美离岸服务外包影响因素与竞争力研究》
下一页：《基于企业知识生态系统的动态能力影响机制研究》

学院南路校区地址：北京市海淀区学院南路39号邮编：100081

沙河校区地址：北京市昌平区沙河高教园区邮编：102206 京ICP备05004636号京公网安备110402430071号