A simple, fast string tokenizer for lexical scanning operations.
Want to see pretty graphs? Log in now!
npm install pstrscan
|14||downloads in the last week|
|41||downloads in the last month|
|Last Published By|
|Version||0.1.6 last updated 9 months ago|
|Keywords||regular, expression, tokenizer, text, string, scanner|
|Dependents||depak, exparser, rulez|
PStringScanner is a simple string tokenizer that provides for lexical scanning operations on a string.
The original Ruby version was written in C, and is very fast. This version, while not in C, is as fast on short strings (under 32 Kb of characters), and almost twice as fast on large strings (million+ characters) than the other ports.
npm install -g pstrscan
Scanning a string means keeping track of and advancing a position (a zero-based index into the source string) and matching regular expressions against the portion of the source string after the position.
var PStrScan = require("pstrscan"); var s = new PStrScan("This is a test"); s.scan(/\w+/); // = "This" s.scan(/\w+/); // = null s.scan(/\s+/); // = " " s.scan(/\s+/); // = null s.scan(/\w+/); // = "is" s.hasTerminated(); // = false s.scan(/\s+/); // = " " s.scan(/(\w+)\s+(\w+)/); // = "a test" s.getMatch(); // = "a test" s.getCapture(1); // = "a" s.getCapture(2); // = "test" s.hasTerminated(); // = true
- More documentation specific to this implementation.
- Add a more comprehensive