Search results
146 packages found
Extracts plain text from Markdown strings
A simple, Twitter-aware tokenizer.
- tokenise
- tokenize
- tokenising
- tokenizing
- tokeniser
- tokenizer
- token
- NLP
- language
- text
- strings
- stanford
- dlatk
Module for fast css selector tokenization from string
Tokenizer for tokenizing sentences, for BERT or other NLP preprocessing.
Lemmize and tokenize string which contains Chinese and English words
A simple iterative lexer written in TypeScript
Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.
Transform hypertext strings (e.g., HTML, Markdown) into plain text for natural language processing (NLP) normalization
Tokenizes utterances that contain a mix of English and Chinese words.
- tokenizer
- chinese
- english
- language
- tokenize
- nltk
- nlp
- natural-language-processing
- chinese-tokenizer
- contains-chinese
Module for fast element to selector matching (with custom :hover, :focus etc. handing)
Tokenize Excel formulas
A complex string based CSS managment library
A small library for building toy lexers.
Used to interact with the OpenToken API.
String Tokenizer for Node.js using ICU's BreakIterators
Uses esprima to extract line and block comments from a string of JavaScript. Also optionally parses code context (the next line of code after a comment).
A replacement library for JavaScript's standard JSON functions and more
- javascript
- js
- nodejs
- node
- json
- Traverse
- HasPath
- FindName
- FindValue
- GetValue
- SetValue
- Stringify
- Tablify
- ToIniText
- View more
get word counts / frequencies on a per-speaker or per-category basis, or as an aggregate