keywords:tokenize - npm search

stopmarkdown

Extracts plain text from Markdown strings

bent10

published 1.0.2 2 months ago

M

Q

P

happynodetokenizer

A simple, Twitter-aware tokenizer.

phughes

published 7.1.0 2 months ago

M

Q

P

selector-tokenizer

Module for fast css selector tokenization from string

eugeneford

published 0.9.5 7 years ago

M

Q

P

sentence-tokenization

Tokenizer for tokenizing sentences, for BERT or other NLP preprocessing.

huhaoyu

published 1.0.2 5 years ago

M

Q

P

imago-mixed-eng-chi-tokenizer

Lemmize and tokenize string which contains Chinese and English words

imago.ai

published 1.0.0 5 years ago

M

Q

P

tinylex

A simple iterative lexer written in TypeScript

jabney

published 0.7.4 5 years ago

M

Q

P

@nojaja/tokenize-comment

Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.

nojaja

published 3.0.1 a year ago

M

Q

P

nomark

Transform hypertext strings (e.g., HTML, Markdown) into plain text for natural language processing (NLP) normalization

bent10

published 1.0.0 2 months ago

M

Q

P

mixed-english-and-chinese-tokenizer

Tokenizes utterances that contain a mix of English and Chinese words.

anton-ivanov

published 1.1.0 6 years ago

M

Q

P

selector-checker

Module for fast element to selector matching (with custom :hover, :focus etc. handing)

eugeneford

published 0.9.2 7 years ago

M

Q

P

@kessler/tokenize

personal tokenize util

kessler

published 2.0.0 4 years ago

M

Q

P

json-lexer

lexing json

finnpauls

published 1.2.0 2 years ago

M

Q

P

@modelmap/excel-formula-tokenizer

Tokenize Excel formulas

ngs

published 2.3.2 3 years ago

M

Q

P

virtual-stylesheets

A complex string based CSS managment library

eugeneford

published 0.8.12 6 years ago

M

Q

P

scanty

A small library for building toy lexers.

kroogs

published 0.3.2 7 years ago

M

Q

P

opentoken-lib

Used to interact with the OpenToken API.

andy.gertjejansen

published 1.0.1 7 years ago

M

Q

P

node-icu-tokenizer

String Tokenizer for Node.js using ICU's BreakIterators

imojinima

published 1.0.4 7 years ago

M

Q

P

extract-comments

Uses esprima to extract line and block comments from a string of JavaScript. Also optionally parses code context (the next line of code after a comment).