HTML for the English Major: A Tutorial

This document presents a few bare essentials of HTML. HTML is a method for marking documents up to make their structures explicit, so that machines can read them. HTML has gone through many versions since its invention a little over a decade ago. This tutorial presents the latest and best version of HTML that is in widespread use, called XHTML 1.0 Strict. This tutorial is written for nontechnical readers and introduces a handful of the most important markup elements. This document also demonstrates all of its own principles: everything mentioned in it is put to use in it, and, conversely, all the structural elements it contains are explained in its content. In order to keep things simple, it does not have much graphic styling. Use of Cascading Style Sheets (CSS) to improve the appearance of a document will be covered in a separate tutorial. A tiny bit of CSS will be introduced near the end of this tutorial, however, in order to help make sense out of some HTML features.

Concept of HTML in a Nutshell
Getting Started
Block-Level Container Elements
Empty (Standalone) Elements
Inline Container Elements
<div> and <span>
Comments:
Head (<head> ... </head>) and Body (<body> .... </body>)
Document Type Declaration
Character Entities
Writing HTML
Validate!
"How Shall the World be Served?"
Resources for Further Information
Thanks

Concept of HTML in a Nutshell

HTML stands for Hypertext Markup Language. It is not a computer programming language. Rather, HTML is a small set of symbols used in a text document to clarify the structure of the document in such a way that software such as a Web browser can present that structure appropriately. For instance, a paragraph is a structural element of a document. Most browsers will, by default, present a paragraph with wordwrap until the last line and with a line of vertical space above and below the paragraph. Other document elements include headers, lists, and tables.

HTML cannot control how a document appears in a Web browser window, because browsers, media, hardware, connection speeds, and users' needs vary widely. Instead, HTML clarifies the structure of a document, and it is up to the individual browser to present that structure suitably.

Browsers ignore any extra space you have in your HTML code, such as tabs or extra lines, so you can format your HTML code any way you want without affecting the appearance of the document in the browser window. The browser uses the HTML symbols alone to determine how to display the content. Spacing is good to use in your code for your own purposes, such as to help you and other coders find elements when revising the document.

HTML should be considered an extension of punctuation. Over the centuries, punctuation has been developed to get across in writing structures of language that cannot be represented with letters. A capital letter signals the beginning of a sentence and a period the end. Commas designate clauses and parenthetic phrases. But, beyond the sentence level, writing relies heavily on spacing and font selection to express structural relationships. These things would vary greatly from computer to computer, depending on the size of the screen, memory capacity, graphical resources, etc. Hence, HTML does not use spaces to express structure, but, instead, symbolic markup, which each computer can interpret appropriately. And, just as Spanish punctuation lets you know where a question or exclamation begins as well as ends by enclosing it in ¡ ... ! or ¿ ... ?, likewise HTML encloses structural elements in pairs of symbols, one at the beginning and one at the end. HTML is conventional and consistent enough for computers to read it mechanically.

Elements can be divided up into two groups: containers and empty, or standalone, elements. Most elements of a document are containers — they contain text and certain other elements. For instance, a paragraph contains plain text and may contain emphatic or strong text or even an image. Other containers include tables, ordered and unordered lists, the list items that those lists contain, and emphatic text, which contains the text to be emphasized. Empty elements include images, horizontal rules (horizontal dividing lines), and line breaks that may occur within paragraphs for special purposes such as verse or addresses.

There are rules for what can contain what. Paragraps may not contain paragraphs within them. Table cells and list items may contain paragraphs and anything paragraphs contain, but paragraphs may not contain tables or lists. Lists may only contain list items. The list items, in turn, may contain various other elements. (This is explained better below, under "Ordered Lists.")

The HTML symbols that designate structural elements of a document are called tags. Elements that contain text or other elements begin with an opening tag and end with a closing tag. The opening tag consists of a symbol for the element enclosed in the < and > characters: <p> for a paragraph, <table> for a table. The closing tag has an additional / character to distinguish it from the opening tag: </p> and </table>. So a paragraph begins with the symbol <p> and ends with the symbol </p>, with its content coming between the two tags. Empty elements, such as images and horizontal rules, are handled differently. Empty elements consist of single tags that open with the < character and end with a space (or line break in your code, which is interpreted as a space), followed by />:

    <hr />

    <img
        src="fun.gif"
        title="Image of us having fun."
        alt="We have been having fun."
    />

Any opening or standalone tag may contain, after the name of the element (such as p or img) one or more modifying attributes. An attribute specifies some ... attribute of the element. In the second example above, the img element has three attributes: src, title, and alt. The src attribute, in this case, provides the name (and perhaps the path, or location) of the file containing the image code. The title attribute provides text to appear in case the image does not arrive in a graphical browser, or when the user rolls the mouse so that the pointer hovers over the image. The alt attribute provides text to be rendered in a nongraphical browser, such as a text browser, a braille browser, or an audio browser.

Note that the attribute is always specified by name with an equals sign and then the value of the attribute, in quotation marks. There are no exceptions.

HTML document elements can also be divided up between those that define an area of the document and those that surround a word or phrase. The first type of element is called a block-level element. Paragraphs, lists, and tables are block-level elements. The other type of element is called an inline element. Examples of inline elements are emphatic phrases, strong-appearing phrases, book titles, and links. Inline elements must always be contained within block-level container elements. For instance, emphatic text must always appear within a paragraph or list item or some such larger structure.

Name	E-Mail	Telephone
Amittai Aviram	avirama@gwm.sc.edu	777-2058
Stan Dubinsky	dubinsk@vm.sc.edu	777-2056
Judith James	jamesj@gwm.sc.edu	777-5063
William Richey	richeyw@gwm.sc.edu	777-2054

HTML for the English Major: A Tutorial

Contents

Concept of HTML in a Nutshell

Getting Started

Start copying (below this line).

Stop copying (right above this line).

Block-Level Container Elements

Paragraphs

Headers

Ordered Lists and List Items

Unordered Lists and List Items

Tables, Table Rows, and Table Cells (or "Table Data")

Block Quotations

Empty (Standalone) Elements

Images

(Incomplete markup — do not copy this example!)

Horizontal Rules

Line Breaks

Markup

Rendered

Inline Container Elements

Emphatic Text, Strong Text, and Title Citations

The Holdover <b>, The Still-Useful <i>, and a Glance at Cascading Style Sheets

Code:

Result:

Links ("Anchors")

Abbreviations and Acronyms

<div> and <span>

Centered Text

Comments

Head and Body

Document Type Declaration

Character Entities

Writing HTML

Validate!

"How Shall the World Be Served?"

Resources for Further Information

Thanks