htmlol

Utility for HTML scraping, DOM parsing, and content extraction.

npm install htmlol
2 downloads in the last week
2 downloads in the last month

htmlol

Utility for HTML scraping, DOM parsing, and content extraction. More features/tests to come.

Usage:

// Module
var htmlol = require('htmlol');

// Initialize crawler object
var crawler = new htmlol.Crawler();

// Let's get some content
crawler.get("http://i.imgur.com/LF4i8.png", console.log);
crawler.get("http://speckyboy.com/2012/11/26/dynamic-open-source-javascript-libraries/", console.log);
crawler.get("http://techcrunch.com/2012/11/26/twitter-ceo-dick-costolo-twitter-sees-a-billion-tweets-every-two-and-a-half-days-users-can-download-their-entire-archive-by-year-end/", console.log);
npm loves you