Skip to content

not-raspberry/aio_crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

aio_crawler status

Single site web crawler using aiohttp.

Usage

Install from source:

./setup.py install
$ aio_crawler --help
Usage: aio_crawler [OPTIONS] SITE_ADDRESS

Crawl the website and print results to stdout.

Options:
-c, --concurrency INTEGER  Number of parallel downloads.
-t, --timeout FLOAT        Timeout of each single request.
-v, --verbose
--help                     Show this message and exit.

Development

It's strongly advised to use a virtualenv.

Install dependencies and the CLI hook:

./setup.py develop

Install test dependencies:

pip install -e '.[tests]'

System requirements

Python 3.5.

If your OS does not ship Python 3.5, use pyenv. It's miserable but better than nothing.

About

AIO single website crawler

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages