W3C

HTML 5 Reference

A Web Developer’s Guide to HTML 5

W3C Editor’s Draft 23 March 2009

This version:
http://www.w3.org/TR/2009/ED-html5-author-20090323
Latest version:
http://www.w3.org/TR/
Previous version:
No previous versions.
Editors:
Lachlan Hunt (Opera Software ASA) lachlan.hunt@lachy.id.au

[Copyright licence pending]


Abstract

This document illustrates how to write HTML 5 documents, focussing on simplicity and practical applications for beginners while also providing in depth information for more advanced web developers.

Status of this document

This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.

This document is an Editors Draft of the “HTML 5 Reference” produced by the HTML Working Group, part of the HTML Activity. The working group is working on HTML 5 (see the HTML 5 Editor's draft). The appropriate forum for comments on this document is public-html-comments@w3.org (public archive) or public-html@w3.org (public archive).

Publication as a Working Group Note does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.

Table of contents

  1. 1 Introduction
  2. 2 Getting Started with HTML 5
    1. 2.1 A Basic Document
    2. 2.2 Understanding Semantics
  3. 3 The HTML and XHTML Syntax
    1. 3.1 Syntactic Overview
    2. 3.2 The Syntax
      1. 3.2.1 DOCTYPE Declaration
        1. 3.2.1.1 Historical Notes
      2. 3.2.2 Elements
      3. 3.2.3 Attributes
        1. 3.2.3.1 Empty Attributes
        2. 3.2.3.2 Unquoted Attribute Values
        3. 3.2.3.3 Double-Quoted Attribute Values
        4. 3.2.3.4 Single-Quoted Attribute Values
      4. 3.2.4 Comments
      5. 3.2.5 Text
      6. 3.2.6 CDATA Sections
      7. 3.2.7 Character References
    3. 3.3 Understanding MIME Types
    4. 3.4 Character Encoding
    5. 3.5 Choosing HTML or XHTML
    6. 3.6 Polyglot Documents
  4. 4 The HTML Vocabulary and APIs
    1. 4.1 Categories
      1. 4.1.1 Metadata Content
      2. 4.1.2 Flow content
      3. 4.1.3 Sectioning root
      4. 4.1.4 Sectioning content
      5. 4.1.5 Heading content
      6. 4.1.6 Phrasing content
      7. 4.1.7 Embedded content
      8. 4.1.8 Interactive content
      9. 4.1.9 Transparent Content Models
    2. 4.2 Global Attributes
    3. 4.3 The Elements
      1. 4.3.1 The Root Element
        1. 4.3.1.1 The html element
      2. 4.3.2 Document Metadata
        1. 4.3.2.1 The head element
        2. 4.3.2.2 The title element
        3. 4.3.2.3 The base element
        4. 4.3.2.4 The link element
        5. 4.3.2.5 The meta element
        6. 4.3.2.6 The style element
      3. 4.3.3 Scripting
        1. 4.3.3.1 The script element
        2. 4.3.3.2 The noscript element
      4. 4.3.4 Sections
        1. 4.3.4.1 The body element
        2. 4.3.4.2 The section element
        3. 4.3.4.3 The nav element
        4. 4.3.4.4 The article element
        5. 4.3.4.5 The aside element
        6. 4.3.4.6 The h1, h2, h3, h4, h5, and h6 elements
        7. 4.3.4.7 The header element
        8. 4.3.4.8 The footer element
        9. 4.3.4.9 The address element
      5. 4.3.5 Grouping Content
        1. 4.3.5.1 The p element
        2. 4.3.5.2 The hr element
        3. 4.3.5.3 The br element
        4. 4.3.5.4 The pre element
        5. 4.3.5.5 The dialog element
        6. 4.3.5.6 The blockquote element
        7. 4.3.5.7 The ol element
        8. 4.3.5.8 The ul element
        9. 4.3.5.9 The li element
        10. 4.3.5.10 The dl element
        11. 4.3.5.11 The dt element
        12. 4.3.5.12 The dd element
      6. 4.3.6 Text-Level Semantics
        1. 4.3.6.1 The a element
        2. 4.3.6.2 The q element
        3. 4.3.6.3 The cite element
        4. 4.3.6.4 The em element
        5. 4.3.6.5 The strong element
        6. 4.3.6.6 The small element
        7. 4.3.6.7 The mark element
        8. 4.3.6.8 The dfn element
        9. 4.3.6.9 The abbr element
        10. 4.3.6.10 The time element
        11. 4.3.6.11 The progress element
        12. 4.3.6.12 The meter element
        13. 4.3.6.13 The code element
        14. 4.3.6.14 The var element
        15. 4.3.6.15 The samp element
        16. 4.3.6.16 The kbd element
        17. 4.3.6.17 The sub and sup elements
        18. 4.3.6.18 The span element
        19. 4.3.6.19 The i element
        20. 4.3.6.20 The b element
        21. 4.3.6.21 The bdo element
        22. 4.3.6.22 The ruby element
        23. 4.3.6.23 The rt element
        24. 4.3.6.24 The rp element
      7. 4.3.7 Edits
        1. 4.3.7.1 The ins element
        2. 4.3.7.2 The del element
      8. 4.3.8 Embedded Content
        1. 4.3.8.1 The figure element
        2. 4.3.8.2 The img element
        3. 4.3.8.3 The iframe element
        4. 4.3.8.4 The embed element
        5. 4.3.8.5 The object element
        6. 4.3.8.6 The param element
        7. 4.3.8.7 The video element
        8. 4.3.8.8 The audio element
        9. 4.3.8.9 The source element
        10. 4.3.8.10 The canvas element
        11. 4.3.8.11 The map element
        12. 4.3.8.12 The area element
      9. 4.3.9 Tabular Data
        1. 4.3.9.1 The table element
        2. 4.3.9.2 The caption element
        3. 4.3.9.3 The colgroup element
        4. 4.3.9.4 The col element
        5. 4.3.9.5 The tbody element
        6. 4.3.9.6 The thead element
        7. 4.3.9.7 The tfoot element
        8. 4.3.9.8 The tr element
        9. 4.3.9.9 The td element
        10. 4.3.9.10 The th element
      10. 4.3.10 Forms
        1. 4.3.10.1 The form element
        2. 4.3.10.2 The fieldset element
        3. 4.3.10.3 The label element
        4. 4.3.10.4 The input element
        5. 4.3.10.5 The button element
        6. 4.3.10.6 The select element
        7. 4.3.10.7 The datalist element
        8. 4.3.10.8 The optgroup element
        9. 4.3.10.9 The option element
        10. 4.3.10.10 The textarea element
        11. 4.3.10.11 The output element
      11. 4.3.11 Interactive Elements
        1. 4.3.11.1 The details element
        2. 4.3.11.2 The command element
        3. 4.3.11.3 The bb element
        4. 4.3.11.4 The menu element
      12. 4.3.12 Miscellaneous Elements
        1. 4.3.12.1 The legend element
        2. 4.3.12.2 The div element
  5. 5 Index of Elements
    1. 5.1 Conforming Elements
    2. 5.2 Obsolete Elements
    3. 5.3 Comparison of HTML 4.01 and HTML 5 Elements
  6. 6 How to Read This Guide
    1. 6.1 Conventions
      1. 6.1.1 Notes, Tips and Warnings
      2. 6.1.2 Example Markup
        1. 6.1.2.1 Attributes
        2. 6.1.2.2 Void Elements
        3. 6.1.2.3 Namespaces

1 Introduction

This document serves as a reference guide for the HTML syntax, vocabulary and its associated DOM APIs and is intended for web site and application developers, publishers, tutorial writers and teachers and their students. That is, people who write documents using HTML, or who teach others to do so. This guide is structured into three major sections.

The first provides an introductory tutorial on writing HTML, explaining the basic structure and syntax of an HTML document, covering the fundamental techniques and best practices, encouraging the use of clean and valid markup, and the use of quality assurance tools.

The second section provides an in depth look at the syntax of HTML and XHTML documents. This will investigate both the similarities and differences between the two alternatives and provides guidance on choosing which to use for your own projects, depending on your needs. Additionally, this will also provide details about creating polyglot documents — that is, documents that conform to both HTML and XHTML simultaneiously — including issues related to ensuring stylesheets and scripts work correctly under both conditions.

The third and final section provides a reference for the HTML vocabulary. Each element is described, providing details about its its meaning, allowed attributes, content models and DOM APIs. Each is accompanied by clear examples illustrating how the element is designed to be used for a range of different use cases.

2 Getting Started with HTML 5

The most common format for publishing documents on the web and creating web applications is HTML. From its beginning as a relatively simple language primarily designed for describing scientific documents, it has grown and adapted to a wide variety needs ranging from publishing news and blogs, to providing the foundation for full blown applications for email, maps, word processing and spreadsheets.

As the uses of HTML have grown, the demands placed upon it by authors have increased and the limitations of HTML become more pronounced. HTML 5 is represents the next major step in the development of HTML, introducing a wide range of new features into the language. Authors who are familiar with previous versions of HTML are advised to familiarise themselves with the differences from HTML 4 [HTML4DIFF]

This section provides an introductory tutorial to help get you started with HTML, and is suitable for beginners. Experienced authors may choose to skip this section and proceed to the syntax overview and the element reference.

2.1 A Basic Document

The goal of this section is to walk people though creating example01.html

To begin, we’re going to create a very basic HTML document, which will also serve as a useful template for future HTML documents. This document will simply contain a title and short paragraph.

Open a text editor and create a new, empty file. I suggest you save the file as example01.html.

All HTML documents need to begin with a DOCTYPE. The DOCTYPE is a remnant from the early days of the web. For historical reasons, it is needed to ensure that web browsers interpret the document correctly, rather than using a special compatibility mode designed to replicate the behaviour of older browsers.

In your text editor, type the following on the first line, and save the file.

<!DOCTYPE html>

Because this is required for all documents, it is good practice to get in the habit of always typing that as the first line in all new HTML documents you create, so that it never gets forgotten.

An HTML document is divided into two main sections. The head, which is used to contain document metadata, such as the title, stylesheets and scripts; and the body, which contain all of the page’s content. The markup itself forms a tree structure, as illustrated in the following diagram.

2.2 Understanding Semantics

In general, the purpose of writing and publishing a document is to convey information to the readers. This could be any kind of information, such as telling a story, reporting news and current affairs or describing available products and services. Whatever the information is, it needs to be conveyed to the reader in a way that can be easily understood.

A typical document, such as an book, news article, blog entry or letter is often grouped into different sections containing a variety of headings, paragraphs, lists, tables, quotes and various other typographical structures. All of these structures are important for more easily conveying information to the reader. HTML provides the means to clearly identify each of these structures in a way that can then be easily presented to the user. In essence, this is the purpose of markup, and HTML in particular.

Markup is a machine readable language that describes aspects of a document such as its structure, semantics and/or style. Some markup languages are designed solely for the purpose of describing the presentation of the document, such as RTF (Rich Text Format). Others, such as HTML, are more generic and rather than focussing on describing the presentation, they are designed to focus on describing the meaning or purpose of the content and leave the presentation for another layer to deal with.

HTML provides a wide variety of semantic elements that can be used to mark up various common typographical structures. There are heading elements for marking up different levels of headings, a paragraph (p) element for paragraph, various list elements for marking up different types of lists, and a table elements for marking up tables.

It’s important to distinguish between the structure and semantics of content, which should be described using HTML, and its presentation. In one document, a heading may be presented visually in a large bold typeface with wide margins above and below to separate it from the surrounding content and make it stand out. In another document, a heading may be presented in a light coloured, italic, fancy script typeface. But regardless of the presentation, it’s still a heading and the markup can still uses the same basic elements for identifying common structures.

3 The HTML and XHTML Syntax

It is useful to make a distinction between the vocabulary of an HTML document—the elements and attributes, and their meanings—and the syntax in which it is written.

HTML has a defined set of elements and attributes which can be used in a document; each designed for a specific purpose with their own meaning. Consider this set of elements to be analogous to the list of words in a dictionary. This includes elements for headings, paragraphs, lists, tables, links, form controls and many other features. This is the vocabulary of HTML. Similarly, just as natural languages have grammatical rules for how different words can be used, HTML has rules for where and how each element and attribute can be used.

The basic structure of elements in an HTML document is a tree structure. Most elements have at most one parent element, (except for the root element), and may have any number of child elements. This structure needs to be reflected in the syntax used to write the document.

3.1 Syntactic Overview

There are two syntaxes that can be used: the traditional HTML syntax, and the XHTML syntax. While these are similar, each is optimised for different needs and authoring habits. The former is more lenient in its design and handling requirements, and has a number of convenient shorthands for authors to use. The latter is based on XML and has much stricter syntactic requirements, designed to discourage the proliferation of syntactic errors.

The HTML syntax is loosely based upon the older, though very widely used syntax from HTML 4.01. Although it is inspired by its SGML origins, in practice, it really only shares minor syntactic similarities. This features a range of shorthand syntaxes, designed to make hand coding more convenient, such as allowing the omission of some optional tags and attribute values. Authors are free to choose whether or not they wish to take advantage of these shorthand features based upon their own personal preferences.

The following example illustrates a basic HTML document, demonstrating some shorthand syntax:

HTML Example:

<!DOCTYPE html>
<html>
 <head>
   <title>An HTML Document</title>
 </head>
 <body class=example>
   <h1>Example</h1>
   <p>This is an example HTML document.
 </body>
</html>

XHTML, however, is based on the much more strict XML syntax. While this too is inspired by SGML, this syntax requires documents to be well-formed, which some people prefer because of its stricter error handling, forcing authors to maintain cleaner markup.

XHTML Example:

<html xmlns="http://www.w3.org/1999/xhtml">
 <head>
   <title>An HTML Document</title>
 </head>
 <body class="example">
   <h1>Example</h1>
   <p>This is an example HTML document.</p>
 </body>
</html>

Note: The XHTML document does not need to include the DOCTYPE because XHTML documents that are delivered correctly using an XML MIME type and are processed as XML by browsers, are always rendered in no quirks mode. However, the DOCTYPE may optionally be included, and should be included if the document uses the compatible subset of markup that is conforming in both HTML and XHTML, and is ever expected to be used in text/html environments.

Due to the similarities of both the HTML and XHTML syntaxes, it is possible to mark up documents using a common subset of the syntax that is the same in both, while avoiding the syntactic sugar that is unique to each. This type of document is known as a polyglot document because it simultaneously conforms to both syntaxes and may be handled as either. There are a number of issues involved with creating such documents and authors wishing to do so should familiarise themselves with the similarities and differences between HTML and XHTML.

3.2 The Syntax

There are a number of basic components make up the syntax of HTML, that are used throughout any document. These include the DOCTYPE declaration, elements, attributes, comments, text and CDATA sections.

3.2.1 DOCTYPE Declaration

The Document Type Declaration needs to be present at the beginning of a document that uses the HTML syntax. It may optionally be used within the XHTML syntax, but it is not required. The canonical DOCTYPE that most HTML documents should use is as follows:

<!DOCTYPE html>

For compatibility with legacy producers of HTML — that is, software that outputs HTML documents — an alternative DOCTYPE is available for use by systems that are unable to output the DOCTYPE given above. This limitation occurs in software that expects a DOCTYPE to include either a PUBLIC or SYSTEM identifier, and is unable to omit them. The canonical form of this DOCTYPE is as follows:

<!DOCTYPE html SYSTEM "about:legacy-compat">

Note: The term "legacy-compat" refers to compatibility with legacy producers only. In particular, it does not refer to compatibility with legacy browsers, which, in practice, ignore SYSTEM identifiers and DTDs.

In HTML, the DOCTYPE is case insensitive, except for the quoted string "about:legacy-compat", which must be written in lower case. This quoted string, however, may also be quoted with single quotes, rather than double quotes. The emphasised parts below illustrate which parts are case insensitive.

HTML Example:

<!DOCTYPE html>

<!DOCTYPE html SYSTEM "about:legacy-compat">

<!DOCTYPE html SYSTEM 'about:legacy-compat'>

The following are also valid alternatives in the HTML syntax:

HTML Example:

<!doctype html>

<!DOCTYPE HTML>

<!doctype html system 'about:legacy-compat'>

<!Doctype HTML System "about:legacy-compat">

For XHTML, it is recommended that the DOCTYPE be omitted because it is unnecessary. However, should you wish to use a DOCTYPE, note that the DOCTYPE is case sensitive, and only the canonical versions of these DOCTYPEs given above may be used.

XHTML Example:

<!DOCTYPE html>

<!DOCTYPE html SYSTEM "about:legacy-compat">

<!DOCTYPE html SYSTEM 'about:legacy-compat'>

However, there are no restrictions placed on the use of alternative DOCTYPEs in XHTML. You may, if you wish, use a custom DOCTYPE referring to a custom DTD, typically for validation purposes. Although, be advised that DTDs have a number of limitations compared with other alternative schema languages and validation techniques.

3.2.1.1 Historical Notes

This section needs revising and may be moved to an external document and simply referred to.

The DOCTYPE originates from HTML’s SGML lineage and, in previous levels of HTML, was originally used to refer to a Document Type Definition (DTD) — a formal declaration of the elements, attributes and syntactic features that could be used within the document. Those who are familiar with previous levels of HTML will notice that there is no PUBLIC identifier present in this DOCTYPE, which were used to refer to the DTD. Also, note that the about: URI scheme in the SYSTEM identifier of the latter DOCTYPE is used specifically because it cannot be resolved to any specific DTD.

As HTML5 is no longer formally based upon SGML, the DOCTYPE no longer serves this purpose, and thus no longer needs to refer to a DTD. However, due to legacy constraints, it has gained another very important purpose: triggering no-quirks mode in browsers.

HTML 5 defines three modes: quirks mode, limited quirks mode and no quirks mode, of which only the latter is considered conforming to use. The reason for this is due to backwards compatibility. The important thing to understand is that there are some differences in the way documents are visually rendered in each of the modes; and to ensure the most standards compliant rendering, it is important to ensure no-quirks mode is used.

3.2.2 Elements

Elements are marked up using start tags and end tags. Tags are delimited using angle brackets with the tag name in between. The difference between start tags and end tags is that the latter includes a slash before the tag name.

Example:

This example paragraph illustrates the use of start tags and end tags.

<p>The quick brown fox jumps over the lazy dog.</p>

In both tags, whitespace is permitted between the tag name and the closing right angle bracket, however it is usually omitted because it’s redundant.

In XHTML, tag names are case sensitive and are usually defined to be written in lowercase. In HTML, however, tag names are case insensitive and may be written in all uppercase or mixed case, although the most common convention is to stick with lowercase. The case of the start and end tags do not have to be the same, but being consistent does make the code look cleaner.

HTML Example:

<DIV>...</DIV>

An empty element is any element that does not contain any content within it. In general, an empty element is just one with a start tag immediately followed by its associated end tag. In both HTML and XHTML syntaxes, this can be represented in the same way.

Example:

<span></span>

Some elements, however, are forbidden from containing any content at all. These are known as void elements. In HTML, the above syntax cannot be used for void elements. For such elements, the end tag must be omitted because the element is automatically closed by the parser. Such elements include, among others, br, hr, link and meta

HTML Example:

<link type="text/css" rel="stylesheet" href="style.css">

In XHTML, the XML syntactic requirements dictate that this must be made explicit using either an explicit end tag, as above, or the empty element syntax. This is achieved by inserting a slash at the end of the start tag immediately before the right angle bracket.

Example:

<link type="text/css" href="style.css"/>

Authors may optionally choose to use this same syntax for void elements in the HTML syntax as well. Some authors also choose to include whitespace before the slash, however this is not necessary. (Using whitespace in that fashion is a convention inherited from the compatibility guidelines in XHTML 1.0, Appendix C.)

3.2.3 Attributes

Elements may contain attributes that are used to set various properties of an element. Some attributes are defined globally and can be used on any element, while others are defined for specific elements only. All attributes have a name and a value and look like this.

Example:

This example illustrates how to mark up a div element with an attribute named class using a value of "example".

<div class="example">...</div>

Attributes may only be specified within start tags and must never be used in end tags.

Erroneous Example:

<section id="example">...</section id="example">

In XHTML, attribute names are case sensitive and most are defined to be lowercase. In HTML, attribute names are case insensitive, and so they could be written in all uppercase or mixed case, depending on your own preferences. It is conventional, however, to use the same case as would be used in XHTML, which is generally all lowercase.

HTML Example:

<div CLASS="example">

In general, the values of attributes can contain any text or character references, although depending on the syntax used, some additional restrictions apply, which are outlined below.

There are four slightly different syntaxes that may be used for attributes in HTML: empty, unquoted, single-quoted and double-quoted. All four syntaxes may be used in the HTML syntax, depending on what is needed for each specific attribute. However, in the XHTML syntax, attribute values must always be quoted using either single or double quotes.

3.2.3.1 Empty Attributes

An empty attribute is one where the value has been omitted. This is a syntactic shorthand for specifying the attribute with an empty value, and is commonly used for boolean attributes. This syntax may be used in the HTML syntax, but not in the XHTML syntax.

Note: In previous editions of HTML, which were formally based on SGML, it was technically an attribute’s name that could be omitted where the value was a unique enumerated value specified in the DTD. However, due to legacy constraints, this has been changed in HTML5 to reflect the way implementations really work.

HTML Example:

<input disabled>...</div>

The previous example is equivalent to specifying the attribute with an empty string as the value.

<input disabled="">...</div>

Note: The previous example is semantically equivalent to specifying the attribute with the value "disabled", but it is not exactly the same.

Example:

<img src="decoration.png" alt>

The previous example is equivalent to specifying the attribute with an empty string as the value.

<img src="decoration.png" alt="">
3.2.3.2 Unquoted Attribute Values

In HTML, but not in XHTML, the quotes surrounding the value may also be omitted in most cases. The value may contain any characters except for spaces, single or double quotes (' or "), an equals sign (=) or a greater-than symbol (>). If you need an attribute to contain those characters, they either need to be escaped using character references, or you need to use either the single- or double-quoted attribute values.

Some additional characters cannot be used in unquoted attribute values, including space characters, single (') or double (") quotation marks, equals signs (=) or greater than signs (>).

HTML Example:

<div class=example>
3.2.3.3 Double-Quoted Attribute Values

In both HTML and XHTML, attribute values may be surrounded with double quotes.

By quoting attributes, the value may contain the additional characters that can’t be used in unquoted attribute values, but for obvious reasons, these attributes cannot contain additional double quotation marks within the value.

Example:

<div class="example class names">...</div>
3.2.3.4 Single-Quoted Attribute Values

In both HTML and XHTML, attribute values may be surrounded with single quotes.

By quoting attributes, the value may contain the additional characters that can’t be used in unquoted attribute values, but for obvious reasons, these attributes cannot contain additional single quotation marks within the value.

Example:

<div class='example class names'>...</div>

3.2.4 Comments

...

3.2.5 Text

...

3.2.6 CDATA Sections

...

3.2.7 Character References

Discuss numeric and named character reference syntax. May link to the list of entity references in a separate document, rather than trying to list them all in here.

3.3 Understanding MIME Types

Discuss text/html, application/xhtml+xml, etc.

3.4 Character Encoding

Overview of Unicode, character repertoires, encodings, etc. Declaring the encoding with the Content-Type header, BOM, meta, etc.

3.5 Choosing HTML or XHTML

The choice of HTML or XHTML syntax is largely dependent upon a number of factors the, including needs of a given project, the skill set of the developers involved, level of support in browsers used by the site’s target audience, or it may simply be a matter of personal preference.

The important thing to understand is that there are valid reasons to choose both, and that authors are encouraged to make an informed decision.

Need to develop guidelines to help authors make this choice.

3.6 Polyglot Documents

A polyglot HTML document is a document that conforms to both the HTML and XHTML syntactic requirements, and which can be processed as either by browsers, depending on the MIME type used. This works by using a common subset of the syntax that is shared by both HTML and XHTML.

Polyglot documents are useful to create for situations where a document is intended to be served as either HTML or XHTML, depending on the support in particular browsers, or when it is not known at the time of creation, which MIME type the document will ultimately be served as.

In order to successfully create and maintain polyglot documents, authors need to be familiar with both the similarities and differences between the two syntaxes. This includes not only syntactic differences, but also differences in the way stylesheets, and scripts are handled, and the way in which character encodings are detected.

This section will provide the details about each of these similarities and differences, and provide guidelines on the creation of polyglot documents.

Base this on the HTML vs. XHTML article.

4 The HTML Vocabulary and APIs

4.1 Categories

Each element in HTML falls into zero or more categories that group elements with similar characteristics together. The following categories are used in this guide:

Some elements have unique requirements and do not fit into any particular category.

These categories are related as follows:

Sectioning content, heading content, phrasing content, and
			  embedded content are all types of flow content. Embedded content is
			  also a type of phrasing content.

[Create and link to some sort of index of elements that lists each element in each category.]

4.1.1 Metadata Content

Metadata content includes elements for marking up document metadata; marking up or linking to resources that describe the behaviour or presentation of the document; or indicate relationships with other documents.

Metadata elements appear within the head of a document. Some common examples of metadata elements include: title, meta, link, script and style.

4.1.2 Flow content

Most elements that are used in the body of documents and applications are categorised as flow content. Most of the elements used to mark up the main content in the body of a page are considered to be flow content. In general, this includes elements that are presented visually as either block level or inline level.

Some common flow content includes elements like div, p, em and strong.

Elements categorised as heading content, phrasing content or embedded content are also considered to be flow content.

4.1.3 Sectioning root

[This description needs improving.]

These elements can have their own outlines, but the sections and headers inside these elements do not contribute to the outlines of their ancestors.

Some common sectioning root elements include, among others, body, blockquote and figure.

4.1.4 Sectioning content

Sectioning content is used for structuring a document into sections, each of which generally has its own heading. These elements provide a scope within which associated headers, footers and contact information apply.

Some common sectioning elements include, among others, section, article and nav.

Most sectioning elements, with the exception of the body element, are also classified as flow content.

4.1.5 Heading content

Heading content includes the elements for marking up headers. Headings, in conjunction with the sectioning elements, are used to describe the the structure of the document.

Heading content includes the header element and the h1 to h6 elements.

Elements categorised as heading content are considered to be flow content.

4.1.6 Phrasing content

Phrasing content includes text and text-level markup. This is similar to the concept of inline level elements in HTML 4.01. Most elements that are categorised as phrasing content can only contain other phrasing content.

Some common examples of phrasing content elements include abbr, em, strong and span

Elements categorised as phrasing content are considered to be flow content.

4.1.7 Embedded content

Embedded content includes elements that load external resources into the document. Such external resources include, for example, images, videos and Flash-based content. Some embedded content elements include img, object, embed and video.

Elements categorised as embedded content are considered to be phrasing content, and thus also considered to be flow content.

4.1.8 Interactive content

Interactive elements are those that allow the user to interact with or activate in some way. Depending on the user’s browser and device, this could be performed using any kind of input device, such as, for example, a mouse, keyboard, touch screen or voice input.

Some common examples of interactive content include a, audio and video when used with the controls attribute, and most form controls using input.

4.1.9 Transparent Content Models

Some elements have transparent content models, meaning that their allowed content depends upon the parent element. They may contain any content that their parent element may contain, in addition to any other allowances or exceptions described for the element.

When the element has no parent, then the content model defaults to flow content.

4.2 Global Attributes

To be completed.

4.3 The Elements

Expect major changes to this section. Each of these needs longer descriptions and the elements should be divided into categories. The IDL for the DOM Interfaces is likely to be replaced by something a lot more reader-friendly in the future; consider it a placeholder for now. Attributes will likely be accompanied by brief descriptions within the summary box, in addition to fuller descriptions and examples afterwards.

4.3.1 The Root Element

4.3.1.1 The html element

The html element represents the root of an HTML document.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
  • As the root element of a document.
  • Wherever a subdocument fragment is allowed in a compound document.
Content Model:
  • A head element followed by a body element.
Attributes
DOM Interface
  • Uses HTMLElement.

The html element is the root element of a document. Every document must begin with this element, and it must contain both the head and body elements.

It is considered good practice to specify the primary language of the document on this element using the lang attribute.

HTML Example:

<!DOCTYPE html>
<html lang="en">
  <head>
    ...
  </head>
  <body>
    ...
  </body>
</html>

In the HTML syntax only, both the start and end tags are optional, and so for convenience either may be omitted, unless you wish to specify attributes on this element, in which case, at least the start tag needs to be included.

HTML Example:

<!DOCTYPE html>
<head>
  ...
</head>
<body>
  ...
</body>

In the XHTML syntax, the xmlns attribute needs to be specified on this element to declare that it is in the HTML namespace. You may use either the lang or xml:lang attribute to specify the langauge.

XHTML Example:

<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
  <head>
    ...
  </head>
  <body>
    ...
  </body>
</html>
manifest
The manifest attribute gives the address of the document’s application cache manifest, if there is one. If the attribute is present, the attribute’s value must be a valid URL.

Need to describe application cache manifests.

4.3.2 Document Metadata

4.3.2.1 The head element

The head element collects the document’s metadata.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
  • As the first element in an html element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

The head element is the container for the document’s metadata. Metadata is information about the document itself, such as it's title, author. Scripts and stylesheets may also be included within the head element. Every document must have a head element.

The following examples illustrate the typical usage of the head element in HTML and XHTML.

HTML Example

<!DOCTYPE html>
<html>
<head>
  <title>Example</title>
</head>
<body>
  <h1>Document</h1>
</body>
</html>

XHTML Example

<html xmlns="http://www.w3.org/1999/xhtml">
<head>
  <title>Example</title>
</head>
<body>
  <h1>Document</h1>
</body>
</html>
4.3.2.2 The title element

The title element represents the document’s title or name, and should be meaningful even when read out of context.

Start tag:
required
End tag:
required
Categories:
Contained By:
  • In a head element containing no other title elements.
Content Model:
  • Text.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.2.3 The base element

The base element is for specifying a base URL against which relative links will be resolved, and the name of the default target for opening links and form submissions.

Start tag:
required
End tag:
empty
Categories:
Contained By:
  • In a head element containing no other base elements.
Content Model:
  • Empty.
Attributes
  • Global attributes
  • href
  • target
DOM Interface
  • interface HTMLBaseElement : HTMLElement {
               attribute DOMString href;
               attribute DOMString target;
    };

The link is for linking to other resources, such as stylesheets, favicons and syndication feeds.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • href
  • rel
  • media
  • hreflang
  • type
  • sizes
  • Also, the title attribute has special semantics on this element.
DOM Interface
  • interface HTMLLinkElement : HTMLElement {
               attribute boolean disabled;
               attribute DOMString href;
               attribute DOMString rel;
      readonly attribute DOMTokenList relList;
               attribute DOMString media;
               attribute DOMString hreflang;
               attribute DOMString type;
               attribute DOMString sizes;
    };

    The LinkStyle interface must also be implemented by this element, the styling processing model defines how. [CSSOM]

4.3.2.5 The meta element

The meta element is for providing various types of metadata, such as the application-name or specifying the documents character encoding.

Start tag:
required
End tag:
empty
Categories:
Contained By:
  • If the charset attribute is present, or if the element is in the Encoding declaration state: in a head element.
  • If the http-equiv attribute is present, and the element is not in the Encoding declaration state: in a head element.
  • If the http-equiv attribute is present, and the element is not in the Encoding declaration state: in a noscript element that is a child of a head element.
  • If the name attribute is present: where metadata content is expected.
Content Model:
  • Empty.
Attributes
  • Global attributes
  • name
  • http-equiv
  • content
  • charset
DOM Interface
  • interface HTMLMetaElement : HTMLElement {
               attribute DOMString content;
               attribute DOMString name;
               attribute DOMString httpEquiv;
    };
4.3.2.6 The style element

The style element allows authors to embed stylesheets, typically CSS, within their documents.

Start tag:
required
End tag:
required
Categories:
Contained By:
  • If the scoped attribute is absent: where metadata content is expected.
  • If the scoped attribute is absent: in a noscript element that is a child of a head element.
  • If the scoped attribute is present: where flow content is expected, but before any other flow content other than other style elements and inter-element whitespace.
Content Model:
  • Depends on the value of the type attribute.
Attributes
  • Global attributes
  • media
  • type
  • scoped
  • Also, the title attribute has special semantics on this element.
DOM Interface
  • interface HTMLStyleElement : HTMLElement {
               attribute boolean disabled;
               attribute DOMString media;
               attribute DOMString type;
               attribute boolean scoped;
    };

    The LinkStyle interface must also be implemented by this element, the styling processing model defines how. [CSSOM]

4.3.3 Scripting

4.3.3.1 The script element

The script element allows authors to include scripts, typically javaScript, and data blocks in their documents.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • If there is no src attribute, depends on the value of the type attribute.
  • If there is a src attribute, the element must be either empty or contain only script documentation.
Attributes
  • Global attributes
  • src
  • async
  • defer
  • type
  • charset
DOM Interface
  • interface HTMLScriptElement : HTMLElement {
               attribute DOMString src;
               attribute boolean async;
               attribute boolean defer;
               attribute DOMString type;
               attribute DOMString charset;
               attribute DOMString text;
    };
4.3.3.2 The noscript element

The noscript element is used to provide alternative content for users using browsers that do not support scripting or have it disabled.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • When scripting is disabled, in a head element: in any order, zero or more link elements, zero or more style elements, and zero or more meta elements.
  • When scripting is disabled, not in a head element: transparent, but there must be no noscript element descendants.
  • Otherwise: text that conforms to the requirements given in the prose.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

4.3.4 Sections

4.3.4.1 The body element

The body element represents the main content of the document.

Start tag:
optional
End tag:
optional
Categories:
Contained By:
  • As the second element in an html element.
Content Model:
Attributes
  • Global attributes
  • onbeforeunload
  • onerror
  • onhashchange
  • onload
  • onmessage
  • onoffline
  • ononline
  • onpopstate
  • onresize
  • onstorage
  • onunload
DOM Interface
  • interface HTMLBodyElement : HTMLElement {
               attribute Function onbeforeunload;
               attribute Function onerror;
               attribute Function onhashchange;
               attribute Function onload;
               attribute Function onmessage;
               attribute Function onoffline;
               attribute Function ononline;
               attribute Function onpopstate;
               attribute Function onresize;
               attribute Function onstorage;
               attribute Function onunload;
    };
4.3.4.2 The section element

The section element represents a generic document or application section. A section, in this context, is a thematic grouping of content, typically with a header and possibly a footer.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.3 The nav element

The nav element represents a section of a page containing primary navigation links to other pages or to parts within the page.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.4 The article element

The article element represents an independent section of a document, page, or site. This could be a forum post, a magazine or newspaper article, a blog entry, a user-submitted comment, or any other independent item of content.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.5 The aside element

The aside element represents a section of a page that consists of content that is tangentially related to the content around the aside element, and which could be considered separate from that content. Such sections are often represented as sidebars in printed typography.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.6 The h1, h2, h3, h4, h5, and h6 elements

These elements define headers for their sections.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.7 The header element

The header element represents the header of a section, typically containing headings and subheadings, and other metadata about the section.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

The footer element represents a footer of a section, typically containing information such as who wrote it, links to related documents, and copyright notices.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.4.9 The address element

The address element represents the contact information for the section it applies to. If it applies to the body element, then it instead applies to the document as a whole.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

4.3.5 Grouping Content

4.3.5.1 The p element

The p element represents a paragraph.

Start tag:
required
End tag:
optional
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.2 The hr element

The hr element represents a paragraph-level thematic break, e.g. a scene change in a story, or a transition to another topic within a section of a reference book.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.3 The br element

The br element represents a line break.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.4 The pre element

The pre element represents a block of preformatted text, in which structure is represented by typographic conventions rather than by elements.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.5 The dialog element

The dialog element represents a conversation.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Zero or more pairs of one dt element followed by one dd element.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.6 The blockquote element

The blockquote element represents a section that is quoted from another source.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • cite
DOM Interface
  • interface HTMLQuoteElement : HTMLElement {
               attribute DOMString cite;
    };

    The HTMLQuoteElement interface is also used by the q element.

4.3.5.7 The ol element

The ol element represents an ordered list.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Zero or more li elements.
Attributes
  • Global attributes
  • reversed
  • start
DOM Interface
  • interface HTMLOListElement : HTMLElement {
               attribute boolean reversed;
               attribute long start;
    };
4.3.5.8 The ul element

The ul element represents an unordered list.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Zero or more li elements.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.9 The li element

The li element represents a list item.

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
  • Inside ol elements.
  • Inside ul elements.
  • Inside menu elements.
Content Model:
Attributes
  • Global attributes
  • If the element is a child of an ol element: value
DOM Interface
  • interface HTMLLIElement : HTMLElement {
               attribute long value;
    };
4.3.5.10 The dl element

The dl element introduces an association list containing groups of terms and associated descriptions. (a description list).

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Zero or more groups each consisting of one or more dt elements followed by one or more dd elements.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.11 The dt element

The dt element represents the term, or name, part of a term-description group in a description list (dl element), and the talker, or speaker, part of a talker-discourse pair in a conversation (dialog element).

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
  • Before dd or dt elements inside dl elements.
  • Before a dd element inside a dialog element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.5.12 The dd element

The dd element represents the description, definition, or value, part of a term-description group in a description list (dl element), and the discourse, or quote, part in a conversation (dialog element).

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
  • After dt or dd elements inside dl elements.
  • After a dt element inside a dialog element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

4.3.6 Text-Level Semantics

4.3.6.1 The a element

If the a element has an href attribute, then it represents a hyperlink.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • href
  • target
  • ping
  • rel
  • media
  • hreflang
  • type
DOM Interface
  • [Stringifies=href] interface HTMLAnchorElement : HTMLElement {
               attribute DOMString href;
               attribute DOMString target;
               attribute DOMString ping;
               attribute DOMString rel;
      readonly attribute DOMTokenList relList;
               attribute DOMString media;
               attribute DOMString hreflang;
               attribute DOMString type;
    };

    The Command interface must also be implemented by this element.

4.3.6.2 The q element

The q element represents a phrase quoted from another source.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • cite
DOM Interface
4.3.6.3 The cite element

The cite element represents the title of a work, such as an article, a book, a poem, a song, a film, or any other creative work.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.4 The em element

The em element represents stress emphasis of its contents.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.5 The strong element

The strong element represents strong importance for its contents.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.6 The small element

The small element represents small print (part of a document often describing legal restrictions, such as copyrights or other disadvantages), or other side comments.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.7 The mark element

The mark element represents a run of text in one document marked or highlighted for reference purposes, due to its relevance in another context.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.8 The dfn element

The dfn element represents the defining instance of a term, where its definition is provided nearby.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • Also, the title attribute has special semantics on this element.
DOM Interface
  • Uses HTMLElement.
4.3.6.9 The abbr element

The abbr element represents an abbreviation or acronym, optionally with its expansion.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • Also, the title attribute has special semantics on this element.
DOM Interface
  • Uses HTMLElement.
4.3.6.10 The time element

The time element represents a date and/or a time.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • datetime
DOM Interface
  • interface HTMLTimeElement : HTMLElement {
               attribute DOMString dateTime;
      readonly attribute Date date;
      readonly attribute Date time;
      readonly attribute Date timezone;
    };
4.3.6.11 The progress element

The progress element represents the completion progress of a task.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • value
  • max
DOM Interface
  • interface HTMLProgressElement : HTMLElement {
               attribute float value;
               attribute float max;
      readonly attribute float position;
    };
4.3.6.12 The meter element

The meter element represents a scalar measurement within a known range, or a fractional value.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • value
  • min
  • low
  • high
  • max
  • optimum
DOM Interface
  • interface HTMLMeterElement : HTMLElement {
               attribute float value;
               attribute float min;
               attribute float max;
               attribute float low;
               attribute float high;
               attribute float optimum;
    };
4.3.6.13 The code element

The code element represents a fragment of computer code.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.14 The var element

The var element represents a variable, such as in a mathematical expression or programming context, or it could just be a term used as a placeholder in prose.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.15 The samp element

The samp element represents (sample) output from a program or computing system.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.16 The kbd element

The kbd element represents user input (typically keyboard input, although it may also be used to represent other input, such as voice commands).

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.17 The sub and sup elements

The sup element represents a superscript and the sub element represents a subscript.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.18 The span element

The span element doesn’t mean anything on its own, but can be useful when used together with other attributes, e.g. class, lang, or dir.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.19 The i element

The i element represents a span of text in an alternate voice or mood, or otherwise offset from the normal prose, such as a taxonomic designation, a technical term, an idiomatic phrase from another language, a thought, a ship name, or some other prose whose typical typographic presentation is italicized.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.20 The b element

The b element represents a span of text to be stylistically offset from the normal prose without conveying any extra importance, such as key words in a document abstract, product names in a review, or other spans of text whose typical typographic presentation is boldened.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.21 The bdo element

The bdo element allows authors to override the Unicode bidi algorithm by explicitly specifying a direction override. [BIDI]

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • Also, the dir global attribute has special semantics on this element.
DOM Interface
  • Uses HTMLElement.
4.3.6.22 The ruby element

The ruby element allows one or more spans of phrasing content to be marked with ruby annotations. Ruby annotations are short runs of text presented alongside base text, primarily used in East Asian typography as a guide for pronounciation or to include other annotations. In Japanese, this form of typography is also known as furigana.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • One or more groups of: phrasing content followed either by a single rt element, or an rp element, an rt element, and another rp element.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.23 The rt element

The rt element marks the ruby text component of a ruby annotation.

Start tag:
required
End tag:
required
Categories:
  • None.
Contained By:
  • As a child of a ruby element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.6.24 The rp element

The rp element can be used to provide parentheses around a ruby text component of a ruby annotation, to be shown by user agents that don’t support ruby annotations.

Start tag:
required
End tag:
required
Categories:
  • None.
Contained By:
  • As a child of a ruby element, either immediately before or immediately after an rt element.
Content Model:
  • If the rp element is immediately after an rt element that is immediately preceded by another rp element: a single character from Unicode character class Pe.
  • Otherwise: a single character from Unicode character class Ps.
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

4.3.7 Edits

4.3.7.1 The ins element

The ins element represents an addition to the document.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • cite
  • datetime
DOM Interface
  • Uses the HTMLModElement interface.
4.3.7.2 The del element

The del element represents a removal from the document.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • cite
  • datetime
DOM Interface
  • Uses the HTMLModElement interface.

4.3.8 Embedded Content

4.3.8.1 The figure element

The figure element represents some flow content, optionally with a caption, which can be moved away from the main flow of the document without affecting the document’s meaning.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.8.2 The img element

An img element represents an image.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • alt
  • src
  • usemap
  • ismap
  • width
  • height
DOM Interface
  • [NamedConstructor=Image(),
     NamedConstructor=Image(in unsigned long width),
     NamedConstructor=Image(in unsigned long width, in unsigned long height)]
    interface HTMLImageElement : HTMLElement {
               attribute DOMString alt;
               attribute DOMString src;
               attribute DOMString useMap;
               attribute boolean isMap;
               attribute unsigned long width;
               attribute unsigned long height;
      readonly attribute boolean complete;
    };
4.3.8.3 The iframe element

The iframe element introduces a new nested browsing context.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Text that conforms to the requirements given in the prose.
Attributes
  • Global attributes
  • src
  • name
  • sandbox
  • seamless
  • width
  • height
DOM Interface
  • interface HTMLIFrameElement : HTMLElement {
               attribute DOMString src;
               attribute DOMString name;
               attribute DOMString sandbox;
               attribute boolean seamless;
               attribute DOMString width;
               attribute DOMString height;
    };

    Objects implementing the HTMLIFrameElement interface must also implement the EmbeddingElement interface defined in the Window Object specification. [WINDOW]

4.3.8.4 The embed element

The embed element represents an integration point for an external (typically non-HTML) application or interactive content.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • src
  • type
  • width
  • height
  • Any other attribute that has no namespace (see prose).
DOM Interface
  • interface HTMLEmbedElement : HTMLElement {
               attribute DOMString src;
               attribute DOMString type;
               attribute DOMString width;
               attribute DOMString height;
    };

    Depending on the type of content instantiated by the embed element, the node may also support other interfaces.

4.3.8.5 The object element

The object element can represent an external resource, which, depending on the type of the resource, will either be treated as an image, as a nested browsing context, or as an external resource to be processed by a plugin.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • data
  • type
  • name
  • usemap
  • form
  • width
  • height
DOM Interface
  • interface HTMLObjectElement : HTMLElement {
               attribute DOMString data;
               attribute DOMString type;
               attribute DOMString name;
               attribute DOMString useMap;
      readonly attribute HTMLFormElement form;
               attribute DOMString width;
               attribute DOMString height;
    };

    Objects implementing the HTMLObjectElement interface must also implement the EmbeddingElement interface defined in the Window Object specification. [WINDOW]

    Depending on the type of content instantiated by the object element, the node may also support other interfaces.

4.3.8.6 The param element

The param element defines parameters for plugins invoked by object elements.

Start tag:
required
End tag:
empty
Categories:
  • None.
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • name
  • value
DOM Interface
  • interface HTMLParamElement : HTMLElement {
               attribute DOMString name;
               attribute DOMString value;
    };
4.3.8.7 The video element

A video element represents a video or movie.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • If the element has a src attribute: transparent.
  • If the element does not have a src attribute: one or more source elements, then, transparent.
Attributes
  • Global attributes
  • src
  • poster
  • autobuffer
  • autoplay
  • loop
  • controls
  • width
  • height
DOM Interface
  • interface HTMLVideoElement : HTMLMediaElement {
               attribute DOMString width;
               attribute DOMString height;
      readonly attribute unsigned long videoWidth;
      readonly attribute unsigned long videoHeight;
               attribute DOMString poster;
    };
4.3.8.8 The audio element

An audio element represents a sound or audio stream.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • If the element has a src attribute: transparent.
  • If the element does not have a src attribute: one or more source elements, then, transparent.
Attributes
  • Global attributes
  • src
  • autobuffer
  • autoplay
  • loop
  • controls
DOM Interface
  • [NamedConstructor=Audio(),
     NamedConstructor=Audio(in DOMString src)]
    interface HTMLAudioElement : HTMLMediaElement {
      // no members
    };
4.3.8.9 The source element

The source element allows authors to specify multiple media resources for media elements.

Start tag:
required
End tag:
empty
Categories:
  • None.
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • src
  • type
  • media
DOM Interface
  • interface HTMLSourceElement : HTMLElement {
               attribute DOMString src;
               attribute DOMString type;
               attribute DOMString media;
    };
4.3.8.10 The canvas element

The canvas element represents a resolution-dependent bitmap canvas, which can be used for rendering graphs, game graphics, or other visual images on the fly.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • width
  • height
DOM Interface
  • interface HTMLCanvasElement : HTMLElement {
               attribute unsigned long width;
               attribute unsigned long height;
    
      DOMString toDataURL([Optional] in DOMString type, [Variadic] in any args);
    
      Object getContext(in DOMString contextId);
    };
4.3.8.11 The map element

The map element, in conjunction with any area element descendants, defines an image map.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • name
DOM Interface
  • interface HTMLMapElement : HTMLElement {
               attribute DOMString name;
      readonly attribute HTMLCollection areas;
      readonly attribute HTMLCollection images;
    };
4.3.8.12 The area element

The area element represents either a hyperlink with some text and a corresponding area on an image map, or a dead area on an image map.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • alt
  • coords
  • shape
  • href
  • target
  • ping
  • rel
  • media
  • hreflang
  • type
DOM Interface
  • interface HTMLAreaElement : HTMLElement {
               attribute DOMString alt;
               attribute DOMString coords;
               attribute DOMString shape;
               attribute DOMString href;
               attribute DOMString target;
               attribute DOMString ping;
               attribute DOMString rel;
      readonly attribute DOMTokenList relList;
               attribute DOMString media;
               attribute DOMString hreflang;
               attribute DOMString type;
    };

4.3.9 Tabular Data

4.3.9.1 The table element

The table element represents data with more than one dimension (a table).

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • In this order: optionally a caption element, followed by either zero or more colgroup elements, followed optionally by a thead element, followed optionally by a tfoot element, followed by either zero or more tbody elements or one or more tr elements, followed optionally by a tfoot element (but there can only be one tfoot element child in total).
Attributes
  • Global attributes
DOM Interface
  • interface HTMLTableElement : HTMLElement {
               attribute HTMLTableCaptionElement caption;
      HTMLElement createCaption();
      void deleteCaption();
               attribute HTMLTableSectionElement tHead;
      HTMLElement createTHead();
      void deleteTHead();
               attribute HTMLTableSectionElement tFoot;
      HTMLElement createTFoot();
      void deleteTFoot();
      readonly attribute HTMLCollection tBodies;
      HTMLElement createTBody();
      readonly attribute HTMLCollection rows;
      HTMLElement insertRow([Optional] in long index);
      void deleteRow(in long index);
    };
4.3.9.2 The caption element

The caption element represents the title of the table that is its parent, if it has a parent and that is a table element.

Start tag:
required
End tag:
required
Categories:
  • None.
Contained By:
  • As the first element child of a table element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.
4.3.9.3 The colgroup element

The colgroup element represents a group of one or more columns in the table that is its parent, if it has a parent and that is a table element.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Zero or more col elements.
Attributes
  • Global attributes
  • span
DOM Interface
  • interface HTMLTableColElement : HTMLElement {
               attribute unsigned long span;
    };
4.3.9.4 The col element

If a col element has a parent and that is a colgroup element that itself has a parent that is a table element, then the col element represents one or more columns in the column group represented by that colgroup.

Start tag:
required
End tag:
empty
Categories:
  • None.
Contained By:
  • As a child of a colgroup element that doesn't have a span attribute.
Content Model:
  • Empty.
Attributes
  • Global attributes
  • span
DOM Interface
4.3.9.5 The tbody element

The tbody element represents a block of rows that consist of a body of data for the parent table element, if the tbody element has a parent and it is a table.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Zero or more tr elements
Attributes
  • Global attributes
DOM Interface
  • interface HTMLTableSectionElement : HTMLElement {
      readonly attribute HTMLCollection rows;
      HTMLElement insertRow([Optional] in long index);
      void deleteRow(in long index);
    };

    The HTMLTableSectionElement interface is also used for thead and tfoot elements.

4.3.9.6 The thead element

The thead element represents the block of rows that consist of the column labels (headers) for the parent table element, if the thead element has a parent and it is a table.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Zero or more tr elements
Attributes
  • Global attributes
DOM Interface
4.3.9.7 The tfoot element

The tfoot element represents the block of rows that consist of the column summaries (footers) for the parent table element, if the tfoot element has a parent and it is a table.

Start tag:
optional
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Zero or more tr elements
Attributes
  • Global attributes
DOM Interface
4.3.9.8 The tr element

The tr element represents a row of cells in a table.

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Zero or more td or th elements
Attributes
  • Global attributes
DOM Interface
  • interface HTMLTableRowElement : HTMLElement {
      readonly attribute long rowIndex;
      readonly attribute long sectionRowIndex;
      readonly attribute HTMLCollection cells;
      HTMLElement insertCell([Optional] in long index);
      void deleteCell(in long index);
    };
4.3.9.9 The td element

The td element represents a data cell in a table.

Start tag:
required
End tag:
optional
Categories:
Contained By:
  • As a child of a tr element.
Content Model:
Attributes
  • Global attributes
  • colspan
  • rowspan
  • headers
DOM Interface
  • interface HTMLTableDataCellElement : HTMLTableCellElement {};
4.3.9.10 The th element

The th element represents a header cell in a table.

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
  • As a child of a tr element.
Content Model:
Attributes
  • Global attributes
  • colspan
  • rowspan
  • headers
  • scope
DOM Interface
  • interface HTMLTableHeaderCellElement : HTMLTableCellElement {
               attribute DOMString scope;
    };

4.3.10 Forms

4.3.10.1 The form element

The form element represents a collection of form-associated elements, some of which can represent editable values that can be submitted to a server for processing.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • accept-charset
  • action
  • autocomplete
  • enctype
  • method
  • name
  • novalidate
  • target
DOM Interface
  • [Callable=namedItem]
    interface HTMLFormElement : HTMLElement {
               attribute DOMString acceptCharset;
               attribute DOMString action;
               attribute boolean autocomplete;
               attribute DOMString enctype;
               attribute DOMString method;
               attribute DOMString name;
               attribute boolean novalidate;
               attribute DOMString target;
    
      readonly attribute HTMLFormControlsCollection elements;
      readonly attribute long length;
      [IndexGetter] any item(in DOMString name);
      [NameGetter=OverrideBuiltins] any namedItem(in DOMString name);
    
      void submit();
      void reset();
      boolean checkValidity();
    
      void dispatchFormInput();
      void dispatchFormChange();
    };
4.3.10.2 The fieldset element

The fieldset element represents a set of form controls grouped under a common name.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • disabled
  • form
  • name
DOM Interface
  • interface HTMLFieldSetElement : HTMLElement {
               attribute boolean disabled;
      readonly attribute HTMLFormElement form;
               attribute DOMString name;
    
      readonly attribute DOMString type;
    
      readonly attribute HTMLFormControlsCollection elements;
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    };
4.3.10.3 The label element

The label represents a caption in a user interface. The caption can be associated with a specific form control, known as the label element’s labeled control.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • form
  • for
DOM Interface
  • interface HTMLLabelElement : HTMLElement {
      readonly attribute HTMLFormElement form;
               attribute DOMString htmlFor;
      readonly attribute HTMLElement control;
    };
4.3.10.4 The input element

The input element represents a typed data field, usually with a form control to allow the user to edit the data.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • accept
  • action
  • alt
  • autocomplete
  • autofocus
  • checked
  • disabled
  • enctype
  • form
  • height
  • list
  • max
  • maxlength
  • method
  • min
  • multiple
  • name
  • novalidate
  • pattern
  • placeholder
  • readonly
  • required
  • size
  • src
  • step
  • target
  • type
  • value
  • width
DOM Interface
  • interface HTMLInputElement : HTMLElement {
               attribute DOMString accept;
               attribute DOMString action;
               attribute DOMString alt;
               attribute boolean autocomplete;
               attribute boolean autofocus;
               attribute boolean defaultChecked;
               attribute boolean checked;
               attribute boolean disabled;
               attribute DOMString enctype;
      readonly attribute HTMLFormElement form;
               attribute DOMString height;
               attribute boolean indeterminate;
      readonly attribute HTMLElement list;
               attribute DOMString max;
               attribute long maxLength;
               attribute DOMString method;
               attribute DOMString min;
               attribute boolean multiple;
               attribute DOMString name;
               attribute boolean noValidate;
               attribute DOMString pattern;
               attribute DOMString placeholder;
               attribute boolean readOnly;
               attribute boolean required;
               attribute unsigned long size;
               attribute DOMString src;
               attribute DOMString step;
               attribute DOMString target;
               attribute DOMString type;
               attribute DOMString defaultValue;
               attribute DOMString value;
               attribute Date valueAsDate;
               attribute float valueAsNumber;
      readonly attribute HTMLOptionElement selectedOption;
               attribute DOMString width;
    
      void stepUp(in long n);
      void stepDown(in long n);
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    
      readonly attribute NodeList labels;
    
      void select();
               attribute unsigned long selectionStart;
               attribute unsigned long selectionEnd;
      void setSelectionRange(in unsigned long start, in unsigned long end);
    };
4.3.10.5 The button element

The button element represents a button. If the element is not disabled, then the user agent should allow the user to activate the button.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • action
  • autofocus
  • disabled
  • enctype
  • form
  • method
  • name
  • novalidate
  • target
  • type
  • value
DOM Interface
  • interface HTMLButtonElement : HTMLElement {
               attribute DOMString action;
               attribute boolean autofocus;
               attribute boolean disabled;
               attribute DOMString enctype;
      readonly attribute HTMLFormElement form;
               attribute DOMString method;
               attribute DOMString name;
               attribute DOMString noValidate;
               attribute DOMString target;
               attribute DOMString type;
               attribute DOMString value;
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    
      readonly attribute NodeList labels;
    };
4.3.10.6 The select element

The select element represents a control for selecting amongst a set of options.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • autofocus
  • disabled
  • form
  • multiple
  • name
  • size
DOM Interface
  • [Callable=namedItem]
    interface HTMLSelectElement : HTMLElement {
               attribute boolean autofocus;
               attribute boolean disabled;
      readonly attribute HTMLFormElement form;
               attribute boolean multiple;
               attribute DOMString name;
               attribute boolean size;
    
      readonly attribute DOMString type;
    
      readonly attribute HTMLOptionsCollection options;
               attribute unsigned long length;
      [IndexGetter] any item(in DOMString name);
      [NameGetter] any namedItem(in DOMString name);
      void add(in HTMLElement element, in HTMLElement before);
      void add(in HTMLElement element, in long before);
      void remove(in long index);
    
      readonly attribute HTMLCollection selectedOptions;
               attribute long selectedIndex;
               attribute DOMString value;
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    
      readonly attribute NodeList labels;
    };
4.3.10.7 The datalist element

The datalist element represents a set of option elements that represent predefined options for other controls. The contents of the element represents fallback content for legacy user agents, intermixed with option elements that represent the predefined options. In the rendering, the datalist element represents nothing and it, along with its children, should be hidden.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • interface HTMLDataListElement : HTMLElement {
      readonly attribute HTMLCollection options;
    };
4.3.10.8 The optgroup element

The optgroup element represents a group of option elements with a common label.

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
  • As a child of a select element.
Content Model:
Attributes
  • Global attributes
  • disabled
  • label
DOM Interface
  • interface HTMLOptGroupElement : HTMLElement {
               attribute boolean disabled;
               attribute DOMString label;
    };
4.3.10.9 The option element

The option element represents an option in a select element or as part of a list of suggestions in a datalist element.

Start tag:
required
End tag:
optional
Categories:
  • None.
Contained By:
Content Model:
  • Text.
Attributes
  • Global attributes
  • disabled
  • label
  • selected
  • value
DOM Interface
  • [NamedConstructor=Option(),
     NamedConstructor=Option(in DOMString text),
     NamedConstructor=Option(in DOMString text, in DOMString value),
     NamedConstructor=Option(in DOMString text, in DOMString value, in boolean defaultSelected),
     NamedConstructor=Option(in DOMString text, in DOMString value, in boolean defaultSelected, in boolean selected)]
    interface HTMLOptionElement : HTMLElement {
               attribute boolean disabled;
      readonly attribute HTMLFormElement form;
               attribute DOMString label;
               attribute boolean defaultSelected;
               attribute boolean selected;
               attribute DOMString value;
    
      readonly attribute DOMString text;
      readonly attribute long index;
    };
4.3.10.10 The textarea element

The textarea element represents a multiline plain text edit control for the element’s raw value. The contents of the control represent the control’s default value.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
  • Text.
Attributes
  • Global attributes
  • autofocus
  • cols
  • disabled
  • form
  • maxlength
  • name
  • readonly
  • required
  • rows
  • wrap
DOM Interface
  • interface HTMLTextAreaElement : HTMLElement {
               attribute boolean autofocus;
               attribute unsigned long cols;
               attribute boolean disabled;
      readonly attribute HTMLFormElement form;
               attribute long maxLength;
               attribute DOMString name;
               attribute boolean readOnly;
               attribute boolean required;
               attribute unsigned long rows;
               attribute DOMString wrap;
    
      readonly attribute DOMString type;
               attribute DOMString defaultValue;
               attribute DOMString value;
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    
      readonly attribute NodeList labels;
    
      void select();
               attribute unsigned long selectionStart;
               attribute unsigned long selectionEnd;
      void setSelectionRange(in unsigned long start, in unsigned long end);
    };
4.3.10.11 The output element

The output element represents the result of a calculation.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • for
  • form
  • name
DOM Interface
  • interface HTMLOutputElement : HTMLElement {
               attribute DOMString htmlFor;
      readonly attribute HTMLFormElement form;
               attribute DOMString name;
    
      readonly attribute DOMString type;
               attribute DOMString defaultValue;
               attribute DOMString value;
    
      readonly attribute boolean willValidate;
      readonly attribute ValidityState validity;
      readonly attribute DOMString validationMessage;
      boolean checkValidity();
      void setCustomValidity(in DOMString error);
    };

4.3.11 Interactive Elements

4.3.11.1 The details element

The details element represents additional information or controls which the user can obtain on demand.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • open
DOM Interface
  • interface HTMLDetailsElement : HTMLElement {
               attribute boolean open;
    };
4.3.11.2 The command element

The command element represents a command that the user can invoke.

Start tag:
required
End tag:
empty
Categories:
Contained By:
Content Model:
  • Empty.
Attributes
  • Global attributes
  • type
  • label
  • icon
  • disabled
  • checked
  • radiogroup
  • default
  • Also, the title attribute has special semantics on this element.
DOM Interface
  • interface HTMLCommandElement : HTMLElement {
               attribute DOMString type;
               attribute DOMString label;
               attribute DOMString icon;
               attribute boolean disabled;
               attribute boolean checked;
               attribute DOMString radiogroup;
               attribute boolean default;
     void click(); // shadows HTMLElement.click()
    };

    The Command interface must also be implemented by this element.

4.3.11.3 The bb element

The bb element represents a user agent command that the user can invoke.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • type
DOM Interface
  • interface HTMLBrowserButtonElement : HTMLElement {
               attribute DOMString type;
      readonly attribute boolean supported;
      readonly attribute boolean disabled;
    };

    The Command interface must also be implemented by this element.

4.3.11.4 The menu element

The menu element represents a list of commands.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
  • type
  • label
DOM Interface
  • interface HTMLMenuElement : HTMLElement {
               attribute DOMString type;
               attribute DOMString label;
    };

4.3.12 Miscellaneous Elements

4.3.12.1 The legend element

The legend element represents a title or explanatory caption for the rest of the contents of the legend element’s parent element.

Start tag:
required
End tag:
required
Categories:
  • None.
Contained By:
  • As the first child of a fieldset element.
  • As the first child of a details element.
  • As a child of a figure element, if there are no other legend element children of that element.
Content Model:
Attributes
  • Global attributes
DOM Interface
  • interface HTMLLegendElement : HTMLElement {
      readonly attribute HTMLFormElement form;
    };
4.3.12.2 The div element

The div element represents nothing at all. It can be used with the class, lang/xml:lang, and title attributes to mark up semantics common to a group of consecutive elements.

Start tag:
required
End tag:
required
Categories:
Contained By:
Content Model:
Attributes
  • Global attributes
DOM Interface
  • Uses HTMLElement.

5 Index of Elements

5.1 Conforming Elements

Element Start Tag End Tag Short Description Notes
a required required Hyperlink
abbr required required Abbreviation
address required required Contact information
area required empty Image map region
article required required Independent section
aside required required Auxiliary section
audio required required Audio stream
b required required Bold text
base required empty Document base URI
bb required required Browser button
bdo required required Bi-directional text override
blockquote required required Long quotation
body optional optional Main content
br required empty Line break
button required required Push button control
canvas required required Bitmap canvas
caption required required Table caption
cite required required Citation
code required required Code fragment
col required empty Table column
colgroup required optional Table column group
command required empty Command that a user can invoke
datagrid required required Interactive tree, list or tabular data
datalist required required Predefined control values
dd required optional Description description
del required required Deletion
details required required Additional information
dfn required required Defining instance of a term
dialog required required Conversation
div required required Generic division
dl required required Description list
dt required optional Description term
em required required Stress emphasis
embed required empty Embedded application
fieldset required required Form control group
figure required required A figure with a caption.
footer required required Section footer
form required required Form
h1 required required Heading level 1 The heading level is also affected by sectioning elements
h2 required required Heading level 2
h3 required required Heading level 3
h4 required required Heading level 4
h5 required required Heading level 5
h6 required required Heading level 6
head optional optional Document head
header required required Section header
hr required empty Separator
html optional optional Document root
i required required Italic text
iframe required required Inline frame
img required empty Image
input required empty Form control
ins required required Insertion
kbd required required User input
label required required Form control label
legend required required Explanatory title or caption
li required optional List item
link required empty Link to resources
map required required Client-side image map
mark required required Marked or highlighted text
menu required required Command menu
meta required empty Metadata
meter required required Scalar measurement
nav required required Navigation
noscript required required Alternative content for no script support
object required required Generic embedded resource
ol required required Ordered list
optgroup required optional Option group
option required optional Selection choice
output required required Output control
p required optional Paragraph
param required empty Plugin parameter
pre required required Preformatted text
progress required required Progress of a task
q required required Inline quotation
rp required required Ruby parenthesis
rt required required Ruby text
ruby required required Ruby annotation
samp required required Sample output
script required required Linked or embedded script
section required required Document section
select required required Selection control
small required required Small print
source required empty Media resource
span required required Generic inline container
strong required required Strong importance
style required required Embedded stylesheet
sub required required Subscript
sup required required Superscript
table required required Table
tbody optional optional Table body
td required optional Table cell
textarea required required Multi-line text control
tfoot optional optional Table footer
th required optional Table header cell
thead optional optional Table head
time required required Date and/or time
title required required Document title
tr required optional Table row
ul required required Unordered list
var required required Variable
video required required Video or movie

5.2 Obsolete Elements

These elements are obsolete and should not be used by authors. However, they are documented here because they are supported by browsers, along with notes about conforming alternatives that may be used instead.

This list may be incomplete. Please report any missing elements.

Element Start Tag End Tag Short Description Notes
acronym required required Acronym Use the abbr element
applet required required Java applet Use the object element.
basefont required empty Base font style This has limited support in browsers. Use CSS instead.
bgsound required empty Use the audio element.
big required required Use a semantically appropriate element with CSS for style.
blink required required CSS provides an alternative with limited browser support, but note that blinking text is annoying.
center required required Use a semantically appropriate element with CSS for style.
dir required required Use the ul element.
font required required Font style Use a semantically appropriate element with CSS for style.
frame required required Consider using CSS layouts or the iframe element.
frameset required required Consider using CSS layouts or the iframe element.
isindex required required Use a form with a text input and submit button.
listing required required Preformatted text Use the pre element.
marquee required required Scripting or CSS animations can be used to simulate scrolling text.
nobr required required Use a semantically appropriate element with CSS for style.
noembed required required
noframes required required
plaintext required required Preformatted text Use the pre element.
s required required Consider using the del element, if appropriate, or another semantically appropriate element with CSS for style.
spacer required required Use CSS layout techniques.
strike required required Consider using the del element, if appropriate, or another semantically appropriate element with CSS for style.
tt required required Teletype Consider using the code element, if appropriate, or another semantically appropriate element with CSS for style.
u required required Use a semantically appropriate element with CSS for style.
wbr required empty
xmp required required Preformatted text Use the pre element.

5.3 Comparison of HTML 4.01 and HTML 5 Elements

Element HTML 4.01/XHTML 1.0 HTML 5 Short Description
a strict yes Hyperlink
abbr strict yes Abbreviation
acronym strict - Acronym
address strict yes Contact information
applet transitional - Java applet
area strict yes Image map region
article - yes Independent section
aside - yes Auxiliary section
audio - yes Audio stream
b strict yes Bold text
base strict yes Document base URI
basefont transitional - Base font style
bb - yes Browser button
bdo strict yes Bi-directional text override
bgsound - -
big strict -
blink - -
blockquote strict yes Long quotation
body strict yes Main content
br strict yes Line break
button strict yes Push button control
canvas - yes Bitmap canvas
caption strict yes Table caption
center transitional -
cite strict yes Citation
code strict yes Code fragment
col strict yes Table column
colgroup strict yes Table column group
command - yes Command that a user can invoke
datagrid - yes Interactive tree, list or tabular data
datalist - yes Predefined control values
dd strict yes Description description
del strict yes Deletion
details - yes Additional information
dfn strict yes Defining instance of a term
dialog - yes Conversation
dir transitional -
div strict yes Generic division
dl strict yes Description list
dt strict yes Description term
em strict yes Stress emphasis
embed - yes Embedded application
fieldset strict yes Form control group
figure - yes A figure with a caption.
font transitional - Font style
footer - yes Section footer
form strict yes Form
frame frameset -
frameset frameset -
h1 strict yes Heading level 1
h2 strict yes Heading level 2
h3 strict yes Heading level 3
h4 strict yes Heading level 4
h5 strict yes Heading level 5
h6 strict yes Heading level 6
head strict yes Document head
header - yes Section header
hr strict yes Separator
html strict yes Document root
i strict yes Italic text
iframe transitional yes Inline frame
img strict yes Image
input strict yes Form control
ins strict yes Insertion
isindex transitional -
kbd strict yes User input
label strict yes Form control label
legend strict yes Explanatory title or caption
li strict yes List item
link strict yes Link to resources
listing - - Preformatted text
map strict yes Client-side image map
mark - yes Marked or highlighted text
marquee - -
menu transitional yes Command menu
meta strict yes Metadata
meter - yes Scalar measurement
nav - yes Navigation
nobr - -
noembed - -
noframes frameset -
noscript strict yes Alternative content for no script support
object strict yes Generic embedded resource
ol strict yes Ordered list
optgroup strict yes Option group
option strict yes Selection choice
output - yes Output control
p strict yes Paragraph
param strict yes Plugin parameter
plaintext - - Preformatted text
pre strict yes Preformatted text
progress - yes Progress of a task
q strict yes Inline quotation
rp - yes Ruby parenthesis
rt - yes Ruby text
ruby - yes Ruby annotation
s transitional -
samp strict yes Sample output
script strict yes Linked or embedded script
section - yes Document section
select strict yes Selection control
small strict yes Small print
source - yes Media resource
spacer - -
span strict yes Generic inline container
strike transitional -
strong strict yes Strong importance
style strict yes Embedded stylesheet
sub strict yes Subscript
sup strict yes Superscript
table strict yes Table
tbody strict yes Table body
td strict yes Table cell
textarea strict yes Multi-line text control
tfoot strict yes Table footer
th strict yes Table header cell
thead strict yes Table head
time - yes Date and/or time
title strict yes Document title
tr strict yes Table row
u transitional -
ul strict yes Unordered list
var strict yes Variable
video - yes Video or movie
wbr - -
xmp - - Preformatted text

6 How to Read This Guide

This section needs major revision and may be dropped.

6.1 Conventions

To ease readability and improve understanding, this document uses a number of conventions.

6.1.1 Notes, Tips and Warnings

Notes are used throughout this document to provide additional information. Tips are used to provide useful hints and suggestions. Warnings are used to point out common authoring errors and highlight important issues to be aware of.

[Need to provide examples of these]

6.1.2 Example Markup

Example markup is provided for both HTML and XHTML. In some cases, the markup is the same and thus only one example is needed, but in others there may be differences syntactic differences. Where HTML and XHTML differ, separate examples are given with each one clearly labelled.

HTML Example:

<!DOCTYPE html>
<html lang="en">
<head>
  <title>HTML Example</title>
</head>
<body>
  <p>This is a sample HTML document.
</body>
</html>

XHTML Example:

<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
  <title>XHTML Example</title>
</head>
<body>
  <p>This is a sample XHTML document.</p>
</body>
</html>

Sometimes, erroneous examples are included. This is usually done to illustrate common authoring errors, bad practices and other issues to be cautious of.

Erroneous Example:

<p>This markup contains a <em><strong>mistake</em></strong></p>
6.1.2.1 Attributes

Unless explicitly stated otherwise for a specific purpose, all attribute values in examples are quoted using double quotes. In HTML examples, boolean attributes are written in their minimised form and in XHTML examples, they are written in expanded form.

HTML Example:

<input type="checkbox" checked>

XHTML Example:

<input type="checkbox" checked="checked"/>
6.1.2.2 Void Elements

In XHTML examples, due to the XML Well-Formedness requirements, void elements are always marked up using the trailing slash.

XHTML Example:

<img src="image.png" alt="example"/>

In HTML, however, the trailing slash is optional and, unless explicitly stated otherwise, is always omitted.

HTML Example:

<img src="image.png" alt="example">
6.1.2.3 Namespaces

Some XHTML examples make use of XML namespaces. In such cases, the following prefixes are assumed to be defined even if there is no xmlns attributes in the fragment of code.

xml
http://www.w3.org/XML/1998/namespace
html
http://www.w3.org/1999/xhtml
math
http://www.w3.org/1998/Math/MathML
svg
http://www.w3.org/2000/svg

XHTML Example:

<html xml:lang="en">
...
</html>

XHTML Example:

<div>
<svg:svg><svg:circle r="50" cx="50" cy="50" fill="green"/></svg:svg>
</div>