mirror of
https://git.FreeBSD.org/ports.git
synced 2024-11-23 00:43:28 +00:00
b22b3c01b6
- remove comments, or tags from HTML snippets - extract base url from HTML snippets - translate entites on HTML strings - encoding mulitpart/form-data - convert raw HTTP headers to dicts and vice-versa - construct HTTP auth header - converting HTML pages to unicode - RFC-compliant url joining - sanitize urls (like browsers do) - extract arguments from urls WWW: http://github.com/scrapy/w3lib Feature safe: yes
15 lines
477 B
Plaintext
15 lines
477 B
Plaintext
This is a Python library of web-related functions, such as:
|
|
|
|
- remove comments, or tags from HTML snippets
|
|
- extract base url from HTML snippets
|
|
- translate entites on HTML strings
|
|
- encoding mulitpart/form-data
|
|
- convert raw HTTP headers to dicts and vice-versa
|
|
- construct HTTP auth header
|
|
- converting HTML pages to unicode
|
|
- RFC-compliant url joining
|
|
- sanitize urls (like browsers do)
|
|
- extract arguments from urls
|
|
|
|
WWW: http://github.com/scrapy/w3lib
|