Skip to content

TuRanLisp/dryscrape

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

Author: Niklas Baumstark

dryscrape is a lightweight web scraping library for Python. It uses a headless Webkit instance to evaluate Javascript on the visited pages. This enables painless scraping of plain web pages as well as Javascript-heavy “Web 2.0” applications like Facebook.

It is built on the shoulders of capybara-webkit's webkit-server. A big thanks goes to thoughtbot, inc. for building this excellent piece of software!

Installation, Usage, API Docs

Documentation can be found at dryscrape's ReadTheDocs page.

Contact, Bugs, Contributions

If you have any problems with this software, don't hesitate to open an
issue on Github or open a pull request or write a mail to niklas baumstark at Gmail.

About

A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published