scrapingai
TypeScript icon, indicating that this package has built-in type declarations

1.0.1 • Public • Published

scarapingai

version license

Extract data from websites automatically with AI or build web scraping agents for bulk URL scraping.

Installation

Install it via npm:

npm i scarapingai

Highlights

  • Built-in residential proxies and captcha handling
  • Smart ad blocker, popup blocker for better performance
  • Accept cookie consent automatically to close cookie banners
  • Compatible with Puppeteer, Playwright for browser automation and testing.
  • Background jobs for bulk URL scraping with automatic retry & error handling.

Usage

Get your api key from here

const agenty = new Agenty(API_KEY);
const data = await agenty.browser.extract("https://example.com");
console.log(data);

Extract

To auto-extract product, jobs listing, SEO meta data, schema JSON etc from given URL

const data = await agenty.browser.extract("https://example.com");
console.log(data);

Scrape

To extract data from given CSS selector or custom jQuery function

const data = await agenty.browser.scrape("https://example.com");
console.log(data);

Screenshot

To capture a screenshot for given URL

const data = await agenty.browser.screenshot("https://example.com");
console.log(data);

PDF

To convert webpage into PDF.

const data = await agenty.browser.pdf("https://example.com");
console.log(data);

Content

To get HTML content from a URL.

const data = await agenty.browser.content("https://example.com");
console.log(data);

License

scrapingai is a project by Agenty, released under the MIT License.

Package Sidebar

Install

npm i scrapingai

Homepage

agenty.com/

Weekly Downloads

2

Version

1.0.1

License

MIT

Unpacked Size

50.4 kB

Total Files

24

Last publish

Collaborators

  • vickyrathee