wordsoap-regex

0.1.1 • Public • Published

wordsoap-regex

Build Status NPM version

Regular expressions for cleaning up dirty HTML output from Microsoft Word.

module.exports = {
    // from http://tim.mackey.ie/CleanWordHTMLUsingRegularExpressions.aspx
    msoTags: /<[\/]?(font|span|xml|del|ins|[ovwxp]:\w+)[^>]*?>/,
    msoAttributes: /<([^>]*)(?:class|lang|style|size|face|[ovwxp]:\w+)=(?:'[^']*'|""[^""]*""|[^\s>]+)([^>]*)>/,
}

License

ISC © Raine Lourie

Package Sidebar

Install

npm i wordsoap-regex

Weekly Downloads

6

Version

0.1.1

License

ISC

Last publish

Collaborators

  • raine