@nuclia/extractor

1.0.0 • Public • Published

Web page extractor

Allows to extract the HTML content of a web page.

Objective

When pushing regular links to your Nuclia Knowledge Box, the content of the page is extracted by the Nuclia processing, but your web pages may not be accessible from the internet. This extractor allows you to extract the content of your web pages locally and so the Nuclia Sync Agent can push the corresponding content to your Nuclia Knowledge Box.

It must be used in conjunction with the Nuclia Sync agent. It must be deployed on the same machine as the Nuclia Sync agent, and it runs on port 8091.

Installation

npm install @nuclia/extractor

Usage

npm start

Readme

Keywords

none

Package Sidebar

Install

npm i @nuclia/extractor

Weekly Downloads

3

Version

1.0.0

License

MIT

Unpacked Size

4.89 kB

Total Files

4

Last publish

Collaborators

  • ebrehault
  • ramon-nuclia
  • ramonnb