The \"old\" HTML/XHTML standards have a DTD (Document Type Definition) defined for them:
HTML 4.01 http://www.w3.org/TR/html401/sgml/dtd.html XHTML 1.0 http://ww
I think they did away with the old DTDs, now we just start HTML pages with:
Maybe the W3C will come out with one eventually.