sukohi/shellless
Composer 安装命令:
composer require sukohi/shellless
包简介
A PHP package to extract readable text from HTML.
README 文档
README
A PHP package to extract readable text from HTML.
Installation
Execute the next command.
composer require sukohi/shellless:1.*
Usage
use Sukohi\Shellless\Shellless;
$html = file_get_contents('http://example.com/');
$shellless = new Shellless();
$result = $shellless->extract($html);
echo $result->title; // Page title
echo $result->best_text; // The longest text
echo $result->full_text; // Joined text if more than 100 characters length.
print_r($result->all_texts, true);
Options
$shellless->setOptions([
'join_step' => 5,
'min_text_length' => 100
]);
Algorithm
- Join close texts if less than 5 HTML tags between them.
- Pick up texts if more than 100 characters length.
License
This package is licensed under the MIT License.
Copyright 2017 Sukohi Kuhoh
统计信息
- 总下载量: 30
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 0
- 点击次数: 0
- 依赖项目数: 0
- 推荐数: 0
其他信息
- 授权协议: MIT
- 更新时间: 2017-03-13