mediashare/crawler
Composer 安装命令:
composer require mediashare/crawler
包简介
Crawl urls from a webpage and provide a DomCrawler with Scraper Library
README 文档
README
💫 Crawl urls from a webpage and provide a DomCrawler with Scraper Library.
DomCrawler
Scraper use DomCrawler library. This is symfony component for DOM navigation for HTML and XML documents. You can retrieve Documentation Here.
Installation
composer require mediashare/crawler
Usage
<?php require 'vendor/autoload.php'; use Mediashare\Crawler\Crawler; $crawler = new Crawler("https://mediashare.fr"); $crawler->run(); dump($crawler);
With Config
<?php require 'vendor/autoload.php'; use Mediashare\Crawler\Crawler; use Mediashare\Crawler\Config; $config = new Config(); $config->setWebspider(true); // All website crawling $config->setVerbose(true); // Prompt progress bar $config->setPathRequires(['/Kernel/']); // Not crawl other path $config->setPathExceptions(['/CodeSnippet/']); // Not crawl this path $crawler = new Crawler("https://mediashare.fr", $config); $crawler->run(); dump($crawler);
统计信息
- 总下载量: 271
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 3
- 点击次数: 3
- 依赖项目数: 2
- 推荐数: 0
其他信息
- 授权协议: MIT
- 更新时间: 2019-12-22