With the exponential growth of network information resources and the dynamic changes of network information resources, the information retrieval services provided by traditional search engines can not meet the growing demand for personalized services, and are facing great challenges. What kind of network access strategy to improve the search efficiency has become one of the main problems in the research of professional search engine web crawler in recent years.<br>Web crawler is an important part of search engine. At present, the popular search engines are Baidu, Google and so on. It can be said that without the existence of web crawler, there may be no search engine. For the consideration of commercial confidentiality, the technical insiders of crawler system used by various search engines are generally not open, and the existing literature is only an abstract introduction. To a large extent, web search engines, digital libraries and other web applications rely on the HTML document information obtained by web crawlers.<br>By crawling and analyzing the video data, this paper can provide decision support for users when watching movies.<br>
正在翻译中..