HTML cleanup tool & simplifier

HTML Washer is a tool which reduces a HTML document (or fragment) to basic HTML tags and attributes – clean HTML

Upload a HTML or a Word file to wash it (to clean it up)



What exactly does it do?

  • Fixes or removes non-well formed tags and attributes (e.g. adds alt attributes to images if missing)
  • Converts the markup to HTML5 (if it is XHTML for example)
  • Reduces the markup to: <a href>, <body>, <h1>, <h2>, <h3>, <h4>, <h5>, <h6>, <head>, <hr>, <html>, <i>, <img src width height alt>, <li>, <ol>, <p>, <ruby>, <strong>, <table>, <tbody>, <td colspan rowspan>, <th colspan rowspan>, <title>, <tr>, <ul>
  • Replaces: <b> to <strong>, <div> to <p>
  • Reformats the HTML (line breaks, indents)

Example

Input:
<p class="funny" onlick="alert('LOL')">bla bla</p>

will be simplified to:
<p>bla bla</p>