Search results

146 packages found

Extracts plain text from Markdown strings

published 1.0.2 2 months ago
M
Q
P

A simple, Twitter-aware tokenizer.

published 7.1.0 2 months ago
M
Q
P

Module for fast css selector tokenization from string

published 0.9.5 7 years ago
M
Q
P

Tokenizer for tokenizing sentences, for BERT or other NLP preprocessing.

published 1.0.2 5 years ago
M
Q
P

Lemmize and tokenize string which contains Chinese and English words

published 1.0.0 5 years ago
M
Q
P

A simple iterative lexer written in TypeScript

published 0.7.4 5 years ago
M
Q
P

Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.

published 3.0.1 a year ago
M
Q
P

Transform hypertext strings (e.g., HTML, Markdown) into plain text for natural language processing (NLP) normalization

published 1.0.0 2 months ago
M
Q
P

Tokenizes utterances that contain a mix of English and Chinese words.

published 1.1.0 6 years ago
M
Q
P

Module for fast element to selector matching (with custom :hover, :focus etc. handing)

published 0.9.2 7 years ago
M
Q
P

personal tokenize util

published 2.0.0 4 years ago
M
Q
P

lexing json

published 1.2.0 2 years ago
M
Q
P

Tokenize Excel formulas

published 2.3.2 3 years ago
M
Q
P

A complex string based CSS managment library

published 0.8.12 6 years ago
M
Q
P

A small library for building toy lexers.

published 0.3.2 7 years ago
M
Q
P

Used to interact with the OpenToken API.

published 1.0.1 7 years ago
M
Q
P

String Tokenizer for Node.js using ICU's BreakIterators

published 1.0.4 7 years ago
M
Q
P

Uses esprima to extract line and block comments from a string of JavaScript. Also optionally parses code context (the next line of code after a comment).

published 1.1.0 5 years ago
M
Q
P

A replacement library for JavaScript's standard JSON functions and more

published 0.1.5 3 years ago
M
Q
P

get word counts / frequencies on a per-speaker or per-category basis, or as an aggregate

published 0.0.1721 2 years ago
M
Q
P