HTML CLEANUP


http://www.regular-expressions.info

make each link start on a new line
(?!\r\n)<a href
\r\n<a href

\r\n\r\n
\r\n

lines with only spaces
r\n *\r\n
\r\n

links across multiple lines
<br>\r\n(\w*)</a
<br>\1</a

put newlines before headings
<([hH]\d)
\r\n\r\n<\1

multi line links with at least one space on the second line
<a href([^<]*)\r\n *(.*)</a
<a href\1 \2</a

multi line links with at no space on the second line
<a href([^<]*)\r\n *(.*)</a
<a href\1 \2</a

remove unnecessary link information [weren't many]
ADD_DATE="\d+"
LAST_VISIT="\d+"
LAST_MODIFIED="\d+"

br not at end of line [not many, manually fix]
<br>.+\r\n

remove unnecesssary font color info
<font color="#000000">
[empty]
</font>
[empty]

THIS IS NOW IN MACRO
within a heading, put each link on a new line
</?[pP]> *USE BLOCK SETTING*
[empty]
<br> *USE BLOCK SETTING*
[empty]
\r\n
<br>\r\n

^&nbsp;<br>\r\n
[empty]
^(&nbsp;){1,}(<br>)* *\r\n
[empty]

remove stupid icon info
icon=".*"

 

and for chrome bookmarks:

\x0D\x0A<DT>
<br>\x0D\x0A

ADD_DATE=".*"
[empty]

<DL>
[empty]

</DL>
[empty]

LAST_MODIFIED=".*"
[empty]



 

last updated:    Tue 2021-01-12 1:01 AM