../ -- html.html

HTML Style Guides

updated 2007-02-19.

"Everything I know about HTML and CGI" . .

Contents:

how I set up my web pages
tutorials
What colors should I set? (and a surprising answer)
Why write a FAQ ? Or put anything else on the web, for that matter ?
reference
validation tools
design advice
DOCTYPEs #doctypes
About CGI
- ampersands, semicolons, and CGI scripts
every document has metadata
backlinks are a nifty tool that you might consider adding to your web pages.
General User Interface tips and traps
misc
unsorted

how I set up my web pages

My goal is to make stuff on my web site maximally visible. (If this isn't your goal, why bother to put anything up ?)

My major goals for this site are

to help people (including myself) (by collecting content, and formatting it to make it easy for people (including myself) to use).
Why ? ``Buy the truth, and sell it not: also wisdom, and instruction, and understanding.'' -- Proverbs 23:23.
to make it easy for other people to help me. (My email address is *visible* at the bottom of *every* web page, so when people read my page and get a idea for improving the page, they can immediately tell me about it.)

I wanted to

make it easy for anyone to see a complete list of all my documents
make it obvious which documents have and haven't been viewed yet
make it easy to view any document listed above.

To do this, I

make my root directory contain *only* "index.html" and "html/" and "image/" and "mirror/" subdirectories.
make that the *only* "index.html" file anywhere on my sub-web.

[um, actually, I have a few other directories, but I *intend* to move everything to these directories]

Since all my pages (except "index.html") are in my "html/" subdirectory, it's easy for most browsers to get a exhaustive list of *all* my documents by viewing http://www.rdrop.com/~cary/html/ . (And a exhaustive list of all my images by looking in my "image/" directory).

With a book, it's obvious how large it is and how much left there is to view. With a web page, I'm often left wondering, "OK, I've seen 3 interesting pages. Is that it ? Is there just one more page ? Or does this site go on and on for a few thousand more pages ?"

I know I find it frustrating when I'm wandering around some interesting site, getting that deja-vu feeling from seeing the same documents over and over, wondering if I missed some fascinating page somewhere. I try to mention every sub-folder in index.html, letting users know they exist and letting them get there with a simple click rather than manually editing a URL.

While I really like the "previous", "up", "next" concepts recommended in the Style Guide for Online Hypertext and will put them on my pages Real Soon Now, I'm not exactly sure what to do on the last page. I could

"disable" that link, making the last page "special" and different from all the other leaf pages, or
make the "next" link and the "up" link point the same place, so starting from that page hitting "next" repeatedly goes back up to the index, then down to the first leaf (?), cycles through all the leafs, back up to the index,...

( Joshua Kaufman http://unraveled.com/joshua/ has some rants on good and bad "prev" and "next" links )

I'm thinking that, since it's that annoying "index.html" file that is getting in the way of maximum visibility, that I should eliminate it entirely -- or perhaps segregate it as "index/index.html".

For stuff specific to my own site, I start at the root URL http://rdrop.com/ and select "Documentation" | "Authoring web pages on Agora".

Everything I know about HTML (including advice on how to write "good" HTML) comes from these sources:

tutorials

Tutorials and information on the HTML standard (You may not need this if you use a WYSIWYG web page editor like "GNN Press" or "Microsoft Front Page Express", or "export HTML" from Microsoft Word ), but you definitely need to know this if you're going to look at the raw code in a text editor:

Webmonkey for Kids http://www.hotwired.com/webmonkey/kids/
How Can You Create a Web Page for Yourself? Eight Tips on using HTML and constructing Web pages http://www.ithaca.edu/library/training/htmlhints.html
Webmonkey has a bunch of tutorial crash courses http://www.hotwired.com/webmonkey/collections/crash_courses.html in web development, including PERL, Dynamic HTML, Cascading stylesheets, JavaScript, and Web Database. ... http://hotwired.lycos.com/webmonkey/design/ looks very helpful if you want to make websites that are (in my opinion) full of excessive eye-candy. ... http://hotwired.lycos.com/webmonkey/ [FIXME: lots of stuff here I haven't read yet]
Crash course on writing documents for the Web http://www.zdnet.com/pcweek/eamonn/crash_course.html
Crash Course in HTML http://www.webhelp.org/ (pretty good ... except for the bad advice on nbsp).
the Five-Minute HTML Quick Start http://www.projectcool.com/developer/tips/quickstart/ "The most important thing to remember? This is fun! Don't let anyone scare you away or psyche you out by telling you how difficult it is or what you can do wrong. You can't break anything! And as long as you like your pages, that's what counts. "
Tutorials @ W3C http://www.w3.org/2002/03/tutorials.html links to
- "Getting started with HTML" by Dave Raggett http://www.w3.org/MarkUp/Guide/ brief guide to writing HTML using a simple text editor such as NotePad on Windows.
the "NCSA (at UIUC) Beginner's Guide to HTML." (has a link to HTML validation service) which in turn points to an online listing of HTML editors (organized by platform).
"So You Wanna Be A WebMaster" was not very helpful when I last visited (1997 Dec 18) http://www.sable.com/homepage101/web-schl.shtml
Use of ALT texts in IMGs http://ppewww.ph.gla.ac.uk/~flavell/alt/alt-text.html. This is important if you want your page to be indexed properly by the indexing robots (and other reasons, such as "Do not curse the deaf or put a stumbling block in front of the blind" -- Lev. 19:14).
Learn to Program HTML in 21 Minutes by Philip Greenspun http://www.arsdigita.com/books/panda/html.html (old version: http://philip.greenspun.com/wtr/dead-trees/53003.htm ) [FIXME: to read]

color

The default background on many popular web browsers is a ugly grey. There are 2 ways to fix this:

Let your reader set the default background to the reader's own favorite colors.
As the author, set *all* the page colors with something like
```
<html>
...
<STYLE type="text/css">
	BODY { background: black; color: white}
	A:link { color: yellow } 
	A:visited { color: fuchsia }
	A:active { color: white }
</STYLE>
</head>
```
(setting *only* the background color is a bad idea according to http://www.w3.org/TR/REC-html40/types.html#h-6.5.1 ). You must do this if you set a background image.

I recommend option 1 for all but the most ultra-graphics-intensive "eye candy" pages. Some people like black on yellow, other like bright green on black. I personally prefer black text on a white background. Wouldn't you rather read text on your own favorite background and foreground colors ? I still don't understand why some authors believe that every reader has exactly the same favorite colors. Some people *like* lots and lots of tiny little letters filling the screen. Other people *like* using very large fonts. Why not give people what they want ?

Every time I move to a new machine, I set up my favorite colors.

In FireFox, I choose

[T]ools | [O]ptions... | Content | [C]olors...

then set the colors, then

OK

In Microsoft IE 4.0, I choose

[V]iew | Internet [O]ptions... | General | C[o]lors

then set the colors, then

OK | OK

Why write a FAQ ?

Q: Why would anyone put time and effort into a FAQ, then give it away for free ? Or put anything else on the web, for that matter ?

"Abigail's Dream" This page has stirred stronger emotions in me than any other web page I've ever seen. And it does it without any graphics or mood-setting music. Mirrored at http://public.logica.com/~stepneys/int/abigail.htm and at http://www.foad.org/~abigail/WWW/dream.html /* was http://cthulhu.mandrake.net/~abigail/ | http://cthulhu.mandrake.net/~abigail/WWW/dream.html . */
"Fluff and other whistle and bells never impressed Abigail. She wants the web to be a library, a meeting place, a communication channel. All the content free blitz only distracts, it does not contribute. Join her in her crusade! Help her to make the Web a better place." -- Abigail

Abigail says it better than DAV's original attempt an an explanation:

A1: Pseudo-rational explanation:
Game Theory shows that "write up an answer and send it to the FAQ Maintainer" is a better strategy than "hoard this knowledge to myself".
The proof is outside the scope of this FAQ. (It saves an "expert" time by allowing him to answer common questions once and for all by giving a newbie a copy of the appropriate FAQ, rather than patiently explain, for the umpteenth time, "What is PCMCIA ?" or "How can I tell if my Pentium has the FDIV bug?" or "Can I fake a keyboard so my computer will boot without it?" or "How do I rotate a 3D point?".)
A2: Metaphysical explanation:
I know of no religious/theological/ethical literature that mentions PC Cards specifically. The FAQ-making process, however, has roots in the 2nd command (Lev.19:18), knowledge (Prov.23:23), apathy (James.4:17), and the "answer a fool" paradox of Prov. 26:4,5.
Or perhaps I have no free will, I am doomed to produce this thing -- see the Minkowski block-universe explanation in _Time Machines_ by Paul J. Nahin.
A3: Psychological explanation:
David's ego is stroked by responses that say "thanks", "well done".
David's ego believes he's the only entity in the universe intelligent enough to write a FAQ on PC Cards.
A4: Psychotic explanation:
tHe vOiCeS iN mY hEaD tOlD mE tO dO iT. yOu mUsT oPeN yOUr mInd tO uS. rEsIsTanCe iS fUtIlE. hUmOr iS pOwEr.
Q: Why do I get the impression you're not taking me seriously ?

reference

Exhaustive references to the HTML standard. These are useful as a reference after you've read one of the above tutorials. All the picky little details. ( "character entity references" si_metric_faq.html#iso8859 , how to display greek and other symbols ...) These generally have pointers to validation tools and other related links.

A HTML quick reference sheet http://www.cc.ukans.edu/~acs/docs/other/HTML_quick.shtml /* was http://kuhttp.cc.ukans.edu/lynx_help/HTML_quick.html */
Of course, http://www.w3.org/ is the definitive reference for HTML.
http://www.htmlhelp.com/ has a "HTML Help BBS" where novices and experts can ask questions. (also links to CSSCheck, a Cascading Style Sheets lint, and other validators).
Style Sheets http://www.w3.org/Style/ is the definitive reference for "Style Sheets".

HTML validation tools

I need to occasionally check my web site for problems. These tools make it easy to find the most common problems with my web page:

occasionally I accidentally type in a something that makes my page completely unreadable in most web browsers, except the one I happen to use today.
Often, a page I link to moves or disappears.
occasionally I mis-spell a word. You have perfect spelling, right ?

You can skip this section if you never make mistakes.

Bobby, http://www.cast.org/bobby/ "Bobby is a free web-based service that will help you make web pages accessible to people with disabilities. It will also find HTML compatibility problems that prevent pages from displaying correctly on different web browsers."
Doctor HTML, a Web page analysis tool http://www2.imagiware.com/RxHTML/ | http://www2.imagiware.com/RxHTML/htdocs/single.html
``You Call That Web Site Testing?'' article by Adrian Roselli http://www.evolt.org/article/You_Call_em_That_em_Web_Site_Testing/25/2396 has a good list of tools, and some scathing commentary.
a spell checker for WWW documents. Enter a URL, and this program will retrieve the document and spell check it. Any HTML markup is automatically ignored. http://www.goldendome.net/Tools/WebSter/
http://siteinspector.linkexchange.com/ not only checks HTML Validity, Spelling and validates links going out *from* your page, but also looks for backlinks *to* your page.
HTML Validation Tools http://www.weblint.org/links.html (points to most of the tools I list here) http://www.weblint.org/ /* was http://www.cre.canon.co.uk/~neilb/weblint.html */
Weblint Gateways http://www.weblint.org/gateways.html (includes a German interface)
W3C HTML Validation Service http://validator.w3.org/
W3C CSS Validation Service http://jigsaw.w3.org/css-validator/
http://www.tetrion.com/htmlvalidator.html
http://www.opposite.com
http://www.spyglass.com/products/validator
http://www.khoros.unm.edu/staff/neilb/weblint/gateways.html
http://www.webtechs.com/html-val-svc
NetMechanic: checks your site for broken links, bad tags, and poor response time. http://www.netmechanic.com/
Kinder, Gentler HTML Validator http://ugweb.cs.ualberta.ca/~gerald/validate/ .
more HTML validation tools at ftp://ftp.math.utah.edu/pub/sgml .
http://www.yahoo.com/Computers_and_Internet/Software/Data_Formats/HTML/Validation_Checkers/ is a even longer list like this one.
WWW Test Pattern (a few compliance tests and online test tools) http://www.uark.edu/~wrg/
Validators and Document Checkers http://www.htmlhelp.com/links/validators.htm a long list, similar to this list.
HTML validator http://www.crossmyt.com/hc/htmlchek/htmlchek.html /* was http://uts.cc.utexas.edu/~churchh/htmlchek.html */ Has freely usable code, but hasn't been updated since 1995.
not exactly a validator ... Once you've uploaded your web page, you can use this tool to see what it looks like on the Mac "Safari" web browser. "useful for troubleshooting CSS and other cross-browser querks." http://danvine.com/icapture/

design advice

DAV: I encourage you to take all the useful information you know and stick it on the web first, before you even *think* about making it look "pretty".

Don't get so caught up in making it look pretty that you forget why you make web pages .

Then make it easy for people to comment on your page by putting your email address on that page (if you're worried about spam, put your email address in a .png picture so it can only be read by actual humans), and by properly linking your page to backlinks #backlinks , and perhaps a guest log, a relevant wiki, and/or a feedback form feedback.html .

Also make it easy for yourself or future maintainers to remember information about your page: Embed metadata #metadata into the page (perhaps in comments).

Once you put your own information up, you should provide links to other related information -- such as people with the same name as your own name #same_name

Only after that information is online should you even think about the following design advice.

It seems that most sites have these pages:

"Home", the root page ("index.html"): the company logo
about ("about us"): information about the company: the name of the president, when the company started, a summary of what the company does, the company motto;
contact ("contact us"): physical address(es) with a map of how to get there; phone numbers and fax numbers; email addresses or feedback forms .
careers: job postings. How exactly do you want people looking for a job to contact you: Do you want resumes in Word Perfect format ? Sometimes this is a specialized form to make sure that person fills in all the blanks. Personal web sites and non-profit organizations typically have a more informal "How you can help me" page.
site map (typically there's a search tool here and on the root page)
services
products : Paul Cary: "If you ever make a web page for a product, somewhere on the page you should have a button to a page that tells you exactly how to buy it and what the current price is."
privacy policy. Many people are told ( http://support.netdoor.com/email/spam.html ) "if you can't find their privacy statement, don't enter your address.". So if you don't have one on your website those people will refuse to tell you their email address. Every page that asks for an email address should link directly to this privacy policy / anti-spam statement. (see periodical.html#bad_things )

... will not send spam ... will not give or sell the addresses I collect to anyone who may send spam ... ... will use any collected email addresses a maximum of X times before deleting them, so if you simply do not respond to any email from us, you will receive a maximum of X messages from us ...
... opt-in ... I get far too much spam myself. I refuse to support any more spam. ...
"server load status" page. (optional, unless you're trying to sell advertising space).

The following links have some really good ideas (design principles)(style)(advice): for "giving your readers a pleasant viewing experience".

"Content-centered Web design" by Jorn Barger July 2000 http://robotwisdom.com/web/
"Patterns for Personal Web Sites" by Mark L. Irons http://rdrop.com/~half/Creations/Writings/Web.patterns/
the "Cranky User" series by Peter Seebach http://herd.plethora.net/~seebs/ops/ibm/ has some articles -- such as "How not to make your site accessible" "Curbing JavaScript dependency" -- on web site design.
"How To Market Your Website" http://www.getmoredone.com/tips8.html has good tips. I'm not sure about the "no links" suggestion.
"Checklist of WWWeb Design Errors" by Jorn 1996-01-05 http://www.robotwisdom.com/web/checklist.html
mozilla.org style guide by Jamie Zawinski http://www.mozilla.org/README-style.html some good advice for making web pages in general. pretty short.
NYPL: Style Guide http://www.nypl.org/styleguide/ "This Style Guide for the Branch Libraries of the New York Public Library explains the markup and design requirements for all Branch Libraries web projects, along with various standards and best practices. ... projects must be authored in structural XHTML 1.0 Transitional. ... In a perfect world, the library's website would be authored in XHTML 1.0 Strict ..."
web style guides http://www.steptwo.com.au/columntwo/archives/000456.html ??? [FIXME: read]
Before re-organizing your namespace, please read this. Often a person moves a file to a new URI (or deletes the file) without realizing how annoying it is to others when their links fail.
- ``Cool URIs don't change'' article by Tim Berners-Lee 1998 http://www.w3.org/Provider/Style/URI
- Campaign for Permanency in HTML http://www.geocities.com/Athens/6398/metacamp.htm
``a checklist you can use to evaluate a website.'' http://www.us-israel.org/jsource/eval.html (from the point-of-view of web site users)
``Thinking Critically about World Wide Web Resources'' by Esther Grassian, UCLA College Library http://www.library.ucla.edu/libraries/college/help/critical/ (from the point-of-view of web site users)
Dmitry's Design Lab http://www.webreference.com/dlab/ Graphics and design advice, some of it specifically applicable to web pages.
A HTML Pattern Language http://www.anamorph.com/docs/patterns/
MSDN Online Design Area http://msdn.microsoft.com/workshop/design/ "The Design area is the MSDN Online Web Workshop's online resource for creative professionals, with information on Web technologies, tools, color management, typography, and Web design." /* was http://www.microsoft.com/workshop/design/ */
http://www.webreference.com/
My principles of web design http://www.ckdhr.com/ckd/web/design/principles.html by Christopher Davis /* was Principles of Web Design http://www.kei.com/homepages/ckd/web-design/principles.html */
http://www.quadzilla.com/ nice tutorials on tables and style sheets. a long list of WYSIWYG HTML editors. Pointers to pre-written CGI programs and JavaScript programs.
http://members.tripod.com/~Rubea/ has a long list of other style guides.
http://www.projectcool.com/
HTML Bad Style Page http://www.earth.com/bad-style/
Composing Good HTML http://www.cs.cmu.edu/~tilt/cgh/ /* was at http://www.willamette.edu/html-composition/strict-html.html */
CERN's style guide for online hypertext (http://www.w3.org/pub/WWW/Provider/Style/Overview.html)
by Tim Berners-Lee http://www.w3.org/People/Berners-Lee/ includes links to Internet FAQ lists, RFCs (Request For Comments -- Internet standards etc), something called "Information by subject ". It has a wide variety of links much like this html.html page.
Guide to Web Publishing http://www.willamette.edu/wits/docs/official/htmlguide.html /* was http://www.willamette.edu/wits/docs/webguide.html */
Basic information on Web graphics design:
- http://www.webreference.com/dev/graphics/ .
- the-light.com http://the-light.com/netcol.html .
- /* was http://www.servtech.com/public/dougg/graphics/ */
- /* http://www.adobe.com/studio/tipstechniques/GIFJPGchart/main.html now offline ? */
lots of info on building web pages (including using forms to create interactive web pages). Pointers to lots of icon collections. "So to maximize the number of browsers that will show your images the way you want them seen, you should both fill your non-rectangular images with grey-192 (for those browsers that don't understand transparency) and set grey-192 as the transparent color for your GIFs. "
Web design tips http://scrtec.org/bright_sites/ offline ?
a step by step guide for using Homepage to set up a web page that uses information from a Filemaker database. http://scrtec-ne.unl.edu/SCRTECNE/TechTopics/tutorials/
the international "I Hate Frames Club" http://www.wwwvoice.com/hatefrm.html
Eric S. Raymond's *very* opinionated HTML style guide http://www.ccil.org/~esr/html-hell.html
http://www.interface-design.net/links-web.htm has a list of most of these links... just reference it ? user interface.

see also computer_graphics_tools.html#web for tools to help you make images, icons, buttons, etc. suitable for the Web.

let people use their own preferences

Let people use their favorite browser.

Let people use whatever width window they like.

Of course, this page is Best Viewed With Any Browser http://www.anybrowser.org/campaign/ . (which further points to lots more HTML advice pages). It quotes

"Anyone who slaps a 'this page is best viewed with Browser X' label on a Web page appears to be yearning for the bad old days, before the Web, when you had very little chance of reading a document written on another computer, another word processor, or another network."
-- Tim Berners-Lee in Technology Review, July 1996
"This page optimized for ... arguing with customers" by Jahn Rentmeister http://www.jahns-home.de/rentmei/html/opti.html describes why counting "hits" can be wildly inaccurate. Also hits the "Then we would have to provide and maintain a second version" myth. "If you don't tell people what's wrong, they won't be able to fix it." | mirror http://linuxmafia.com/faq/Web/opti.html /* was http://linuxmafia.com/~rick/opti.html */

"liquid design", "flowed"

Rather than force things to be a specific size and arrangement, some people prefer it when things automatically reflow to fit their screen.

"Liquid Design for the Web" Adrian Roselli http://www.evolt.org/article/Liquid_Design_for_the_Web/20/15177 instead of leaving users at low resolutions with a scroll bar at the bottom of their screen (requiring them to constantly scroll left-to-right-to-left to read content or see ads), or leaving users at high resolutions with large amounts of white space outside of your content, consider building pages that scale to fit the user. There are many advantages to this liquid design approach. (some browser-version-specific table information ... obsolete ?)
http://c2.com/cgi/wiki?CoordinateVersusNestedGui

General User Interface tips and traps

[FIXME: don't I have more stuff scattered elsewhere to put here ?]

Here I list user-interface ideas specific to web pages.

See also user_interface.html has general user-interface tips for all software (including web browsing).

Alan Cooper http://www.cooper.com/
http://www.osopinion.com/Opinions/ToddBurgess/ToddBurgess1.html
http://www.iarchitect.com/mshame.htm
Vincent Flanders http://www.webpagesthatsuck.com/ includes the Daily Sucker http://www.webpagesthatsuck.com/dailysucker/
Bruce Tognazzini http://www.asktog.com/

Bruce Tognazzini ... a recognized leader in human/computer interaction design. ... at Sun where he led the Starfire Project ... he founded the Apple Human Interface Group and acted as Apple's Human Interface Evangelist.
[FIXME: lots of articles here; read a few more.]
My thesis in Computer Science, published in 1967, argued that computers should be all-graphic, that we should eliminate character generators and create characters graphically and in various fonts, that what you see on the screen should be what you get, and that the human interface was more important than mere considerations of algorithmic efficiency and compactness.
...
Before creating the Mac project, I was Manager of Publications at Apple, and so for the Mac I was careful to insist that the excellence of the product extend, to use Horn's words, to "the unpacking instructions, the profusely-illustrated and beautifully-written manuals, ... tastefully packaged." Packaging was another major concern of mine ... ...
... Of course, he might well be correct when speaking from a programmer's point of view. However, I've always been more concerned with users. Programmers do their work but once, while users are saddled with it ever thereafter.
...
With regard to my thesis, its formal title was, "A Hardware-Independent Computer Drawing System Using List-Structured Modeling: The Quick-Draw Graphics System" Pennsylvania State University, 1967. ...
... Weinberg's ground-breaking "The Psychology of Computer Programming" was published in 1971. ...
...
A number of people asked for permission to redistribute my notes on the history of the Mac. Yes, so long as you are a not-for-profit organization or club and say "Copyright 1996 by Jef Raskin. Used by permission." If you make money from my writing, I should, too.
Copyright (c) 1996 by Jef Raskin. Used by permission.
-- Jef Raskin http://mxmora.best.vwh.net/JefRaskin.html
http://developer.apple.com/ has Apple Human Interface Guidelines [FIXME: re-read]
"Site Design." http://www.hesperian.co.uk/ia/ia_msc_chap5.asp " Student.Manchester, An Interactive Guide to the city, targeted at the sub 30yr. old demographic. An Insight into Web site design." by Martin John Allen, 1998. ... ??? http://www.hesperian.co.uk/sitemap.asp

every document has metadata

Every document has metadata associated with it. I suppose you could just memorize it -- but then, I suppose you could just memorize the contents of the entire file. Given that you're going to write it down somewhere, it's generally a very good idea (especially with program source code) to embed this metadata into the document itself, making a "self-describing file".

book.html#citation

David Cary plans to put *most* of these items of metadata into every document he creates.

``The system can learn a lot about each document just by keeping its eyes and ears open. If the associative retrieval system remembered some of this information, much of the setup burden on the user would be made unnecessary. The program could, for example, easily remember such things as

The program that created the document
The type of document: words, numbers, tables, graphics
The program that last opened the document
If the document is exceptionally large or small
If the document has been untouched for a long time
The length of time the document was last open
The amount of information that was added or deleted during the last edit
Whether the document has been edited by more than one type of program
Whether the document contains embedded objects from other programs
Whether the document was created from scratch or cloned from another
If the document is frequently edited
If the document is frequently viewed but rarely edited
Whether the document has been printed/faxed/emailed and where/to whom.
How often the document has been printed, and whether changes were made to it each time immediately before printing The retrieval system could find documents for the user based on these facts without the user ever having to explicitly record anything in advance. Can you think of other useful attributes the system could remember ?

''

-- p.106 _About Face_ (1995) book by Alan Cooper

"A full understanding of a program ... code ... [and] numerous idems of metadata describing the context in which a program was created and is used. Unlike comments, which usually describe a piece of a program, these metadata refer to the entire program. A partial list of program metadata:

Title of program
Author(s)
Further developer(s)
Maintainer(s)
Owner(s)
Publisher(s)
User(s)
names, faces, affiliations, postal and network addresses, and telephone and fax numbers for the above individuals
Location of source code (machine, directory, file(s) )
Version, revision number
Date and time of this version or revision
Date and time that the current listing was created.

Related to but distinct from the metadata are longer texts that describe the program, such as an abstract, statement of purpose, and history." -- p. 121, _Human Factors and Typography for More Readable Programs_ (1990) by Baecker and Marcus.

For documents that list addresses, each individual address should be dated as to when it was last confirmed. -- DAV

[FIXME: add information about file format header considerations computer_graphics_tools.html#file_formats here -- or link to discussion elsewhere] start compressed file with name, date, compression program, etc.

More about metadata:

"Metadata is nothing new" by Ned Batchelder http://nedbatchelder.com/text/metadata-is-nothing-new.html
make sure you capture with every document
- its acceptable distribution, [is it confidential ?]
- its creation date and ideally
- its expiry date.
Keep this metadata.
> -- Tim Berners-Lee 1998 http://www.w3.org/Provider/Style/URI.html
"Madhavan K. Nayar ... argues that a universal standard is needed to ensure that any compilation of data is complete and current and that its origin and accuracy can be traced. ... For a primer on the issues that might be addressed by an information integrity standard, go to http://chicagotribune.com/tech ."
How To Use Meta Tags http://searchenginewatch.com/webmasters/meta.html A very nice, short, explanation on how to use the 2 most important meta tags ("description" and "keywords") and a bunch of links to more information on other tags.
http://www.w3.org/Metadata/
http://mall-net.com/se_report/ discusses how to use "title" and meta tags to help people find your web pages from google.
W3 (at http://www.w3.org/TR/REC-html40/charset.html#h-5.2.2 and again at http://www.w3.org/International/O-charset.html ) recommends labeling every document with the character encoding used. For example, <META http-equiv="Content-Type" content="text/html; charset=EUC-JP"> or <meta http-equiv="Content-Type" content="text/html; charset=US-ASCII"> or <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> . See si_metric_faq.html#iso8859 for rants about the character set David uses.
http://whatis.com/meta.htm
http://www.pcwebopaedia.com/TERM/m/metadata.html
http://wombat.doc.ic.ac.uk/foldoc/foldoc.cgi?meta+data
http://www.geo.ed.ac.uk/agidexe/term?205
LINGO OF THE DAY
DECEMBER 24, 1999
metadata - Also known as file metadata - Also known as file attributes.
Includes information like the type of file, the file size, number of hard links to the file, inode number, timestamps (time of last access, time of last modification, and time of last attribute modification), mode flags, file ownership user and group Ids, and file permissions. When performing backups, it is often as critical to preserve metadata as it is to preserve file contents.
Source: Linuxcare reader Rob Hartly
http://wwhttp://www.linuxcare.com/news_columns/w.linuxcare.com/news_columns/ | mirror http://www.linuxcare.com/news_columns/lingo/archive_99december.epl
Dublin Core Metadata Initiative http://purl.org/DC/ has standardized a minimal set of metadata for HTML pages roughly equivalent to a library catalog card.
"METADATA is a registered trademark (Nos. 1,409,260 and 2,185,504)" -- http://cartome.org/metadata-domain.htm
"geo-sensitive meta tags" http://gigablast.com/tagsdemo.html has some interesting ideas for meta tags:
```
<meta name="zipcode"        content="87112,87113,87114">
<meta name="city"           content="albuquerque, abq, rio rancho">
<meta name="state"          content="new mexico">
<meta name="country"        content="usa, united states of america">
<meta name="classification" content="products,product">
```
However, the "author" and the "language" tag seem redundant -- what does <meta name="language" content="english"> give me that <html lang="en-US"> does not ? What does <meta name="author" content="matt wells"> give me that
David Cary feedback.html
d.cary+72@ieee.org.
does not ?

backlinks

backlinks are a nifty tool that you might consider adding to your web pages.

Using "view source", you can copy and paste these "forms" into your own pages.

The Backlinks Page Web Enhancement Project http://www.foresight.org/WebEnhance/backlinks.news.html suggests that it might be cool if every page had backlinks:

known by AltaVista

known by Yahoo!

known by infoseek
Link Popularity http://www.linkpopularity.com/ also checks who links to your web pages.

DOCTYPEs

The tip "Don't forget to add a doctype" http://www.w3.org/2001/06tips/Doctype points to

"Fixing Your Site With the Right DOCTYPE" article by Zeldman 12 April 2002 _A List Apart_ http://www.alistapart.com/stories/doctype/ explains "Why use a DOCTYPE?" and has a nice list of "DOCTYPEs that work"
List of valid DTDs you can use in your document http://www.w3.org/QA/2002/04/valid-dtd-list.html

All the following are listed on the 2nd reference ...

Currently I use something like

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
	"http://www.w3.org/TR/html4/strict.dtd"
>

I'm thinking about switching to

<!DOCTYPE html PUBLIC
    "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN"
    "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
to give me XHTML + MathML + SVG.

But it's so annoying to have to add all those </p> paragraph-end tags that http://www.w3.org/TR/2000/REC-xhtml1-20000126/ says is required under XML.

CSS: cascading style sheets

CSS and printing to paper

NYPL: Style Guide: CSS http://www.nypl.org/styleguide/css/ "... CSS ... Steal These Style Sheets! Style Sheets for your use ..."
CSS beyond the browser: Going to Print http://www.alistapart.com/stories/goingtoprint/ looks good for stuff that will be printed (resumes ...) *and* has a nifty way to make every link visible [FIXME: this would save me time ... perhaps make this optional like http://www.meyerweb.com/ui/setup.html ]

other CSS

[FIXME: ]

[FIXME: read http://www.usemod.com/cgi-bin/mb.pl?CascadingStyleSheets and http://www.usemod.com/cgi-bin/mb.pl?UserStyleSheets ]
css Zen Garden: The Beauty in CSS Design http://csszengarden.com/ some very pretty examples of the power of CSS. (DAV first heard about this from Tia Toh http://www.pixelsandwidgets.com/weblogs/archives/000171.shtml )
http://csstheme.manilasites.com/ which links to
- CSS Layout Techniques http://glish.com/css/ has a good explaination of how to use CSS to get many different column styles (*without* any tables). The simplest column style seems to be 2 columns ( fluid left column, fixed-width right column ), or a nested, floated menu in the upper right ( does menu have to be fixed-width ? rest of page fluid ).
  Also points to CSS tutorials, CSS validators, CSS reference pages, CSS web tips.
- A List Apart http://www.alistapart.com/ The ``previous'' issues discuss lots of usability issues, including
  10 Tips on Writing the Living Web http://www.alistapart.com/stories/writeliving/ inspirational
  ``From Web Hacks to Web Standards: A Designer's Journey: a CSS redesign in five easy pages'' by Jeffrey Zeldman http://www.alistapart.com/stories/journey/
  ``To Hell With Bad Browsers'' by Jeffrey Zeldman http://www.alistapart.com/stories/tohell/ an explaination of why writing to standards (so your writing looks good on future browsers) is better than writing to the quirks of past browswers.
- Eric A. Meyer http://www.meyerweb.com/eric/css/edge/ ``a cry for creativity... a rejection of what's practical in favor of what's possible.''
get started with cascading style sheets http://www.cnet.com/Content/Builder/Authoring/CSS/
http://www.hypermedic.com/ has a tutorial on "style sheets" (CSS), and information on XML and Typography.
"Position Is Everything: A compendium of CSS positioning bugs. Purpose of this site: To explain some obtuse CSS bugs in modern browsers, provide demo examples of interesting CSS behaviors, and show how to 'make it work' without using tables for layout purposes." http://www.PositionIsEverything.net/ includes "Three Column Stretch: A way to achieve 3 full-length liquid columns with header and footer, all with different backgrounds." "Please feel free to take, use, and if need be, abuse all code found on this site."

About CGI

Using CGI programs that have already been set up

"forms" are sections of HTML files that let users send information back to the server.

A "CGI" script accepts that information and does something with it.

There are already many public machines with CGI scripts you can use, just by putting the appropriate form in your HTML files. ("remotely hosted CGI")

http://Freedback.com/ has free "feedback email" CGI scripts available for anyone to use.
http://www.hosted-forum.com/ has a free "community discussion forum" for anyone to create their own forum.
feedback forms test_sendmail.html
guestbooks
Backlinks #backlinks
Check Your Browser's Headers http://grc.com/su/earthlink.htm tells you lots of information about a web browser (what your local computer is sending out)
What's that site running ? http://www.netcraft.com/cgi-bin/Survey/whats?host=www.aptfone.com;port=80 tells you lots of info about a web server (host site).
Find out exactly what your browser is sending to the server http://hoohoo.ncsa.uiuc.edu/cgi/examples.html
Why is this connection so slow ? http://www.forth.org/slow.html has a CGI script that does a traceroute (times all the hosts between you and the web server). Includes source code (in Forth).

Please check with your own ISP and see what other local CGI scripts you have available to you. Nearly all of them have a CGI script you can use for feedback forms, usually called something like "mailform" or "mailto". ("mailto.pl" is a Perl script written and freely distributed by Doug Stevenson http://www-bprc.mps.ohio-state.edu/mailto/mailto_info.html ). On my system, documentation is at http://rdrop.com/tools/www/mailform/

Writing and setting up your own CGI scripts

Perhaps these CGI scripts don't do exactly what you want. Then you might be able to write your own CGI script to handle the information returned by forms. This section talks about writing server-side programming. Most ISPs do not allow you to write server-side programs -- so this section will be utterly useless to most people.

see

"CGI Made Really Easy or, Writing CGI scripts to process Web forms" by James Marshall http://www.jmarshall.com/easy/cgi/ the code properly "allow ";" as well as "&" for parameter separators"

I far prefer <form method=GET action="..."> over the <form method=POST action="..."> as a form submission method http://www.w3.org/TR/REC-html40/interact/forms.html#submit-format , because it's so much easier to debug. Also, GET allows you to easily do clever things like make some links that have common selections pre-filled.

( "The use of POST rather than GET is a ... obstacle ... If you're starting from scratch and want to use SOAP, make sure your SOAP toolkit supports the HTTP GET mode of access." -- The Beauty of REST by Jon Udell March 17, 2004 http://www.xml.com/pub/a/2004/03/17/udell.html )

To understand how CGI works, it helps to know a little about the standard HTTP protocol. Normally, to fetch a standard HTML page (for example, http://www2.okstate.edu/ ) your local web browser opens a connection with the remote server on port 80. This indistinguishable (at the server end) from you typing

  telnet www2.okstate.edu 80

from your command prompt. Then the browser requests the specific page (in this case, "/") exactly like you typing

  GET / HTTP/1.0

(note that you *must* type 2 return characters, "\n\n" after the GET command) and then your web browser waits for the response. By now, if you've been following along, the remote server has replied with something like

HTTP/1.0 200 OK
MIME-Version: 1.0
Server: WebSTAR/2.1 ID/32004
Message-ID: <b1190587.43075@www2.okstate.edu>
Date: Sun, 01 Mar 1998 20:05:53 GMT
Last-Modified: Wed, 11 Dec 1996 22:10:34 GMT
Content-type: text/html
Content-length: 3375

<P><B><FONT SIZE="+1">

[rest of file snipped to save space] .

Note the blank line ( 2 return characters, "\n\n") between the end of the "headers" the server sent you, and the start of the actual text in the file (the first character in this file is "<").

Then the remote server breaks the connection. (With HTTP/1.1, the connection remains open, so you can request and receive multiple files over the same connection).

When you write your own CGI script, it's often a good idea to run it from the command prompt to make sure it is printing the right things to STDOUT -- minimally, it needs to duplicate the "Content-type" line and the 2 returns after it; my code looks something like gather.cpp.

If your CGI returns a image file, the sequence looks more like this:

You, the browser, do

  telnet thomppj.student.okstate.edu 80

and you get the response

  Trying...
  Connected to thomppj.student.okstate.edu.
  Escape character is '^]'.

Then you type

  GET /~thomppj/images/plant.gif HTTP/1.0

(note that you *must* type 2 return characters, "\n\n" after the GET command), and the web server responds

HTTP/1.1 200 OK
Date: Sat, 29 Nov 1997 05:07:23 GMT
Server: Apache/1.3b2 Debian/GNU
Last-Modified: Wed, 26 Nov 1997 20:22:30 GMT
ETag: "5768-6a12-347c8506"
Content-Length: 27154
Accept-Ranges: bytes
Connection: close
Content-Type: image/gif

GIF89a.....}

[rest of file snipped to save space] .

Note the blank line ( 2 return characters, "\n\n") between the end of the "headers" the server sent you, and the start of the actual text in the file (the first character in the file is "G").

More about CGI:

"A C++ Class for writing POST-style CGI-bin scripts" http://www.cs.umd.edu/~bederson/cgi.html This freely available "class" makes writing CGI-bin programs much easier.
GNU Cgicc http://www.fsf.org/software/cgicc/cgicc.html is an ANSI C++ compliant class library that greatly simplifies the creation of CGI applications for the World Wide Web.
Intro to CGI scripting http://www.cc.ukans.edu/~acs/docs/other/forms-intro.shtml
The Common Gateway Interface http://hoohoo.ncsa.uiuc.edu/cgi/

DAV: I think the "proper" method of decoding is: 1. Break apart into items (each attribute ... [FIXME] How the characters will be escaped when received by the CGI program: http://www.w3.org/TR/REC-html40/interact/forms.html#h-17.13.4

Here are some CGI programs that you might want on your server:

search tools so users can do text searches on all the documents on your site.
- Personal Library Software (PLS) http://www.pls.com/ (freely downloadable executables and documentation)
Matt's Script Archive http://scriptarchive.com/ has Free Perl CGI Scripts and Free C++ CGI Programs

ampersands, semicolons, and CGI scripts

A common error in CGI scrips and links that refer to them:

RFC 1866 (``Hypertext Markup Language - 2.0'') by Tim Berners-Lee himself ftp://ftp.isi.edu/in-notes/rfc1866.txt section "8.2.1. The form-urlencoded Media Type" ``encourages'' CGI authors to support `;' in addition to `&' way back in 1995.
"CGI implementors support the use of ";" in place of "&" to save authors the trouble of escaping "&" characters in this manner." http://www.w3.org/TR/REC-html40/appendix/notes.html#h-B.2.2
"CGI implementors are encouraged to support the use of `;' in place of `&' " http://www.w3.org/MarkUp/html-spec/html-spec_foot.html#FOOT26

Date: Sat, 21 Feb 1998 18:32:22 -0500 (EST)
From: Gerald Oskoboiny <gerald@w3.org>
To: www-html@w3.org
Subject: Re: Validation difficulties
Sender: www-html-request@w3.org

On Sat, 21 Feb 1998, Rob wrote:

> I've got a dilemma with the following
>
>   <A HREF="page.cgi?arg1=val1&arg2=val2">link</A>
>
> NSGMLS sys (rightly I assume) that &arg2 is not a valid entity and
> returns an error.

Yes, this has been a known problem for some time. I've been
meaning to write it up in detail sometime, but haven't yet.
In the meantime, see:

    http://www.cs.duke.edu/~dsb/kgv-faq/errors.html#bad-entity
and
    http://www.w3.org/MarkUp/html-spec/html-spec_foot.html#FOOT26

> So how can one get around this problem? Or should I just ignore it.

You can replace the '&'s in the href with '&amp;', or try using
';' as separators instead of '&' (a well-written CGI script will
allow you to use ';' instead of '&' between parameters; if this
CGI script doesn't, send e-mail to the authors, or if you are
the author, change it yourself.)

...
Hope this helps,

Gerald
--
Gerald Oskoboiny              <gerald@w3.org>  +1 617 253 2920
System Administrator, W3C     http://www.w3.org/People/Gerald/
World Wide Web Consortium, MIT Laboratory for Computer Science
545 Technology Square,  Room NE43-353  Cambridge MA  02139 USA

same name

Think you are unique ? Think again -- do a web seach on your name. Click on "people finder" http://webcrawler.com/ [FIXME:] and type in your name -- I think you'll be surprised at how many of you there are.

other people finder tools search_tools.html#people

I think it is a good idea to provide links to other people with the same name -- I am really impressed with these pages:

Eric Smith http://www.catsdogs.com/wwwesmith.html
John Bailey http://www.frontiernet.net/~jmb184/ The "John Bailey Crossroads"
The International Eric Jacobsen Page http://home.earthlink.net/~ejacobsensprint/whichej.htm
http://neal.gafter.com/

Here is my attempt to follow my own advice: david_cary.html .

misc

WWW Frequently Asked Questions: web browsers http://www.boutell.com/openfaq/browsers/ CGI Programming http://www.boutell.com/openfaq/cgi/ /* was http://www.boutell.com/faq */
Netscape's position on its HTML extensions http://home.mcom.com/assist/net_sites/html_extensions.html
Standard for Robot Exclusion
The META tag: Controlling how your page is indexed
www.sandia.gov/sci_compute/html_ref.html .

Once you have some content up, you might think about ways to let someone comment on your work. Of course you *will* put your email address at the bottom of every page, so people can directly respond to you with ideas for improvement and answers to the questions you raise. You can also create a feedback form which is easier for some people to use.

Even cooler than the feedback form is a "guest book", a page that automatically takes comments and puts them on your web site. This is done via "CGI", but if you have one of those annoying sysadmins who don't have a guestbook CGI already set up and who refuse to allow you to set one up, you can still use

Dreambook, a free guestbook server http://www.dreambook.com/ People type comments which are added to the list.
CritSuite: Critical Discussion Tools for the Web http://crit.org/ a more sophisticated commenting mechanism.
Other ways to create CritSuite annotations http://discuss.foresight.org/~pcm/other_writers.html /* was http://crit.org/~pcm/other_writers.html */

www subdomain

The "www" in a URL:

"Why is www. deprecated? Succinctly, use of the www subdomain is redundant and time consuming to communicate. The internet, media, and society are all better off without it." http://no-www.org/
"www. is NOT deprecated" http://www.hm2k.com/articles/yes-www "You can use both as long as you redirect your traffic from one to the other (eg: example.com redirects to www.example.com)."
"www. is not deprecated" http://www.yes-www.org/www-is-not-deprecated/ "No-www" http://en.wikipedia.org/wiki/No-www
The original wiki briefly mentions this issue
"Why "www."?" by Tim BL waffles on which is better. http://www.w3.org/Provider/Style/www.html

[FIXME: todo: consider adding this "no-www" icon to http://david.carybros.com/ ... i.e., index.html]

unsorted

GNN press makes shareware WYSIWYG HTML editor for Mac, PC, Unix platforms. HTML put "I have, I want" into my Web page; perhaps even into my .signature. Date: Tue, 11 Jun 1996 00:00:08 -0400 (EDT) From: transhuman at umich.edu Subject: >H Digest ... From: "Stephen de Vries" <PHEN at wwg3.uovs.ac.za> Subject: >H 3D Text. Transhuman Mailing List .... Stephen de Vries < Information state > I have : Memory techniques, beginner Delphi, C++, evolution, memes. I want : Practical Chinese, evolution, artificial life.

consider making links to "I wish I had time to create a page on this topic -- would you like to help ?" pages.

"No dead pages"

The Instant Home Page site http://banjo.Cise.nsf.gov/ihp/ihp.html lets the casual user type in the info needed to create a home page ... and the completed form is then saved to your computer.

Tulsa Computer Society: Internet SIG. http://tcs.org/internet.htm Lots of web site tutorials and development links !

Free demo version of Dreamweaver http://www.macromedia.com/software/dreamweaver/productinfo/roundtrip/ OSU College of Arts & Sciences' Web Team. http://wrigley.okstate.edu/

Html Writers Guild http://www.hwg.org/ has help lists for different skill levels.

http://www.creativegood.com/help/ ???

Usable Web: Guide to Web usability resources http://usableweb.com/ human factors, user interface issues, and usable design specific to the World Wide Web.

"ONLY FREEWARE" http://www.cias.net/sawicki/ Includes a long list of HTML tools -- HTML editors, link checkers, books on Java, Java development tools, etc.

http://www.coolnerds.com/

Why server side processing is evil http://www.mired.org/home/mwm/no-ssi.html /* was http://www.phone.net/home/mwm/no-ssi.html */

useit.com: Jakob Nielsen http://www.useit.com/ has good information including "The Alertbox: Current Issues in Web Usability", "the Death of File Systems", "The Anti-Mac Interface", "how people read on the Web". In particular, http://www.useit.com/alertbox/20000416.html makes the interesting statement that

The extra choice requires extra thinking, and the time saved by using an optimal interaction technique is often smaller than the time wasted on having to think instead of just moving ahead with a single interaction technique that is always used. It takes at least one second and often two seconds to decide between two possible interaction techniques which is why it is usually better not to offer users a choice.

possibly related humor: "Care and Feeding of Web Pages" http://homes.jcu.edu.au/~imla/web.html "How to Report Software Bugs (Programmer's version)" http://homes.jcu.edu.au/~imla/drivelbug.html "Please Don't Feed the Engineers" http://homes.jcu.edu.au/~imla/eng.html /* was http://www.jcu.edu.au/~imla/web.html , http://www.jcu.edu.au/~imla/drivelbug.html , http://www.jcu.edu.au/~imla/eng.html */.

http://www.aesthetic-images.com/ebuie/ "Software Usability" "photography"

The HTML Terrorist's Handbook : Composing Evil HTML http://www.zikzak.net/~acb/hacks/htmlth.html being a Guide on the usage of HTML as an offensive weapon. [FIXME: Has this gone offline ?]

The HTML Hell Page http://earthspace.net/~esr/html-hell.html

Fred Langa's HotSpots and BrowserTune, "browser test and tuneup site". http://www.winmag.com/flanga/ mirror http://www.browsertune.com/flanga/

[html.html] Useful WWW sites regarding hypertext, hypermedia, and world wide web. http://www.humanities.mcmaster.ca/hypertext.places.htm includes: "What is hypertext and hypermedia ?" "Hyperfiction" "Bartlett's Quotations" " Strunk's Elements of Style " "Creating High Impact Documents: A guide to visually sophisticated documents from Netscape." [html.html] Guides to Writing HTML http://union.ncsa.uiuc.edu:80/HyperNews/get/www/html/guides.html Pointers to lots of guides on HTML.

A Style Guide for online hypertext http://www.w3.org/Provider/Style/Overview.html has very persuasive arguments for the "Why ?" button.

The proper way credit graphics you use. http://www.ttlhost.com/tananda/setoftheweek.html The proper way to add graphics to your web pages. http://www.widowsweb.com/widows/plea.html

A Webmasters Smorgasbord of Free Resources http://smorgasbord.freeservers.com/

tidy.c - HTML parser and pretty printer http://www.w3.org/People/Raggett/tidy/ Actually *fixes* some of the most common errors, and tells you about errors it finds that it doesn't know how to fix. (free source code from this web page, as well as binary executables for a variety of platforms). Looks pretty useful, for both

(a) cleaning up ugly markup generated by some "to HTML" translators
(b) catching and fixing common errors in hand-written HTML.

. (other source code here for related parsing tools). [FIXME: don't I have other ``pretty printer'' info I could collect in one section ?] [FIXME: do I have more pretty printer links ? idea_space.html#translation ]

WWW and HTML Documentation http://oneworld.wa.com/htmldev/devpage/dev-page.html yet another tutorial / reference page. long list of links to HTML converters and HTML editors.

HTMLGoodies.com http://www.mcp.com/publishers/que/authors/joe_burns/ and http://www.htmlgoodies.com/ ???

Why web usage statistics are (worse than) meaningless http://www.cranfield.ac.uk/docs/stats/

Har's Quick-n-Dirty JavaScript Center http://sklarnet.com/js/jsc.htm "real-world JavaScript ideas and examples with the emphasis on useful and functional instead of cute-but-ultimately-useless."

web design Tips and Tricks http://sklarnet.com/LN/webtips.htm

Jeffrey M. Glover http://jeffglover.com/ wrote "Top Ten Ways To Tell If You Have A Sucky Home Page" http://jeffglover.com/sucky.html "Don't have a hissy-fit if something from your web site is on the list. I encourage you to create your web site however you want, regardless of what some doofus says is "sucky" or "a don't"!"

I used to subscribe to W3C World Wide Web Mailing Lists http://www.w3.org/Mail/Lists.html David Cary subscribed to the "www-html" www-html@w3.org To subscribe, please email www-html-request@w3.org with subscribe as the subject.
http://search.w3.org/Public is a dedicated search engine for the w3.org mailing lists.

Vijay Mukhi's technology cornucopia http://www.vijaymukhi.com/ lots of tutorials and information and source code: Java, socket programming, ActiveX, JavaScript, Netscape Plug-ins, TClets (TCL/TK programs which can be used on the internet via Sun's Netscape plug-in) "I want to spread this program globally, so that the whole world can marvel at my genius."

tools to help optimize a graphic so it downloads very fast from your web page yet still looks reasonably good.

The Web Standards Project: Fighting for Standards in our Browsers http://webstandards.org/

ECMAScript Language Specification http://www.el-mundo.es/internet/ecmascript.html

Standard ECMA-262 ECMAScript Language Specification http://www.ecma.ch/stand/ecma-262.htm

ECMA: Standardizing Information and Communication Systems http://www.ecma.ch/

AOLpress http://www.aolpress.com/ seems like a pretty nice WYSIWYG HTML editor (I really like the "check links" feature). Macintosh and Windows versions available. Note that you do *not* need a AOL subscription to get this free software.

http://www.cnet.com/Content/Reviews/Compare/Browsers4/ss01a.html latest web browser comparisons (Microsoft Internet Explorer v. Netscape Communicator)

DAV: I often use the term "URI" ( http://www.w3.org/TR/REC-html40/types.html#type-uri ). "URI" is not a typo.

WTS - Web Tree Scanner is a program to visualize the tree of a WWW server and check the links. http://www.fsai.fh-trier.de/~schmitzj/Xclasses/programs.php?prg=wts published under the GPL. /* was http://www.fsai.fh-trier.de/~schmitzj/Xclasses/programs.html */

Open Group http://www.opengroup.org/ ???

Web Page Evaluators http://www.cgu.edu/degrade/evaluators.html

http://cgi-lib.standford.edu/cgi-lib/ "is the home page for the cgi-lib.pl library which is ... the de facto standard library for creating cgi scripts in the Perl language. It also contains good descriptive information, examples"

"The Language of the Internet" article By Eddie Rabinovitch http://www.comsoc.org/ci/public/1998/feb/internet_column.html /* was http://pubs.comsoc.org/ci1/public/1998/feb/internet_column.html */

"Improving Your Web Site: Tools, Ideas, and Gizmos" article by Eddie Rabinovitch http://pubs.comsoc.org/ci1/public/1999/aug/

HTML Standards Compliance - Why Bother ? http://wdvl.com/Authoring/HTML/Standards/
Pages Optimized for Lynx http://www.crl.com/~subir/lynx/enhanced_pages.html
Lynx Friendly http://www.cs.umanitoba.ca/~djc/personal/lynxfriend.html "Simplicity, carried to an extreme, becomes elegance." -- Jon Franklin [FIXME: add "Lynx friendly" tag to my pages ?]
Jeff Pierce shares some HTML design tips at http://www.msu.edu/~pierce16/design.htm
http://www.ieee.org/web/developers/ has some web site design tips, recommended web tools, keyword and tagging guidelines, web templates, etc.
"The Alertbox: Current Issues in Web Usability" column by Dr. Jakob Nielsen http://www.useit.com/alertbox/ has some pretty good design tips. I especially liked "Fighting Linkrot" by Jakob Nielsen http://www.useit.com/alertbox/980614.html ("linkrot contributes to dissolving the very fabric of the Web"), and "Why You Only Need to Test With 5 Users" by Jakob Nielsen http://www.useit.com/alertbox/20000319.html .
Alan Cooper http://www.cooper.com/ Lots of useful information about user interface design. [FIXME: pull all UI interface links together in one section ?]
Niall Murphy's User Interfaces for Embedded Systems http://www.iol.ie/~nmurphy/

[Is this a useful tool ?]

Date: Sun, 24 May 1998 06:49:38 -0500
To: christlib@swcp.com
From: Dave Babbitt 
Subject: Re: Christlib: Entries in market of Christlib pages
Reply-To: christlib@swcp.com


[snip]

>Under Construction: http://www.swcp.com/dsc/

[snip]

You aught to check out UserLand Frontier:

What Is Frontier? (http://www.scripting.com/frontier5/whatIsFrontier.html)

Web Tutorial (http://www.scripting.com/frontier5/tutorials/web/default.html)

HALO Renderer (http://www.techsoln.com/frontier/HALO/)

Siteliner (http://www.macrobyteresources.com/scripting/frontier/html/siteliner.html)

It would automate the management and production of that site beautifully!


Dave Babbitt

Check out http://www.babbitt.org/

http://www.wolinskyweb.com/web101/resources.htm links to "web page design" articles; "HTML editors", etc much like this page. Does that make this page irrelevant ?
"The 18 Commandments of Good Web Design" by Jeysie http://sscn.virtualave.net/jeysie/commands.html
the Bandwidth Conservation Society http://www.infohiway.com/faster/ [FIXME: ... add to my musings on functional data compression]
[FIXME: is this the same as Bandwidth Conservation Society http://paxar.bc.ca/bpc/Bandwidth_Stuff/ ? ]
TWiki http://twiki.org/ looks very cool. "What is TWiki? The TWiki web is a web based collaboration tool. ... if you know how to fill in an HTML form, you already know how to create and change documents in TWiki. ... TWiki eases one of the concerns about classic Wiki, which is that the radically egalitarian "edit this page" scheme leaves no change log. TWiki includes powerful revision support. Every change leaves a footprint, and you can follow these easily and effectively."
Why even have a website http://www.csn.ul.ie/~caolan/Personal/Design.html (also describes some interesting website content management tools)
http://freespace.virginnet.co.uk/davidb.meiklejohn/tech/ has a few words and good reference links to HTML, Cascading Style Sheets, CGI, SVG (Scalable Vector Graphics).
eFUSE.com http://www.efuse.com/ ``the friendly place to learn how to build a better web site''
http://home.earthlink.net/~thomasareed/pixelpen/ has some web publishing tips, including some advanced stuff like ``Writing CGIs in C'' ``Writing CGIs in Frontier'' http://home.earthlink.net/~thomasareed/pixelpen/6_powertips/cgi/index.html (which actually just links to other sites ...)
http://www.wordsinarow.com/ tells how to register your website by hand with many search engines and directories. Also has interviewing services (for a fee, they track down people and interview them for magazine articles or customer feedback ...) ``If you do HTML coding by hand ... NoteTab Pro, available from www.notetab.com ... a FREE "lite" version you can download and try ... We prefer NoteTab Pro to any other program for writing HTML, and we've tried a bunch of them.''
Designing Accessible Web Pages http://www.uic.edu/depts/accc/webpub/webaccess.html
Webpage design flaws: Mistakes to avoid http://amasci.com/mistake.html Top Ten Mistakes in Web Design
[CGI]
http://developerlife.com/ has lots of source code (with documentation) related to web servers, CGI, XML, etc.
http://slashdot.org/articles/01/03/20/1423223.shtml ``I thought webdesigners job was to care about style. Journalists should take care of content.'' -- Fredrik Borg
[more information your browser leaks] http://jonathanclark.com/where.php
``Paper Layout Document: Layout professional papers with HTML'' by Alex Nicolaou http://www.cgl.uwaterloo.ca/~anicolao/layout.html claims that it's possible to write professional quality publications in HTML. DAV: seems reasonable, but is putting headings inside tables really a good idea ?
If you want to put paper written books online, consider using Theological Markup Language (ThML) http://www.ccel.org/ThML/ .
Technical Writers Anonymous http://www.technicalwritersanonymous.com/ [YARMAC ?]
What search engines like to see in your web pages http://www.searchengines.com/searchEnginesRankings.html ... a descriptive title ...
Web Developer's Virtual Library: Encyclopedia of Web Design Tutorials, Articles and Discussions http://www.wdvl.com/
http://docs.literacytent.org/web_authoring/ seems to be a good intro to learning HTML and creating a web site ... tells up front that he's heavily biased in favor of writing raw HTML code in a text editor. I like Steve Linberg's emphasis on the seperation of form and content, paintings and museum, etc.
``640 x 480 Isn't Dead Just Yet'' article by Adrian Roselli http://www.evolt.org/article/640_x_480_Isn_t_Dead_Just_Yet/22/275/

the real problem with designing outside of the 640x480 box isn't really the 480 height, since most users are accustomed to scrolling down, but the width. Many people never notice the scrollbar on the bottom and those that do resent having to scroll left to right to left to right, etc, just to read your content or navigate your site.
... Keeping lines of text around 30-70 characters offers the best readability for the widest variety of users. This holds true on the web as well as in print, where hundreds of years of printed text has taught professionals that very same lesson.
Comments near the end include

I have to surf at 800x600 and really hate to scroll horizontally
...
Video display technology has come a long way ... the web cannot be held back by these stragglers. ... I hate coming across these obsolete, narrow, "childrens storybook" websites! Even with a trackball, navigating through their wasted space is annoying.
...
if you check any of my sites, you would see that they all work in pretty much every size. ... for me, part of the fun and challenge of designing for the web is writing code that scales with the page. the same for the window.
...
the idea of 'fluid' design, a design that resizes to fit any resolution.
...
use liquid layouts so the user can choose the width.
The Eggman's Guidelines for a Successful Website http://www.the-eggman.com/seminars/webtips.html mentions

``Design Your Pages For a 640 x 480 Resolution Screen ... If You Design For a Higher Resolution Screen, Visitors May Have to Scroll Right and Left to See The Entire Page. This is Very Irritating. Even visitors with higher resolution screens often browse with a smaller browser window''
...
The Internet is more than the Web -- It's E-Mail, Auto-responders, Interactive Mailing Lists, Discussion Groups, Collaborative Computing and more.
Writing for the Web: Some Quick Tips http://www.the-eggman.com/writings/webwriting.html
http://www.wdvl.com/Authoring/Design/Pages/something.html ``horizontal scrolling should be avoided if at all possible.'' also lists some style manuals
http://www.wdvl.com/WDVL/Website/Design/details.html claims that black text on white background leads to eyestrain compared to black text on pale yellow background. DAV is sceptical.
Web Developer's Virtual Library: Encyclopedia of Web Design Tutorials, Articles and Discussions http://www.wdvl.com/ is totally overwhelming to someone who just wanted to put a simple web page up.
Design Not Found http://www.37signals.com/dnf/ good user interface stuff
http://www.sheldonbrown.com/computer-net-links.html ???
http://websiteowner.info/tutorials/html/formelements.asp UI ???
http://ou800doc.caldera.com/SDK_vtcl/CTOC-vtclgN.style.html UI ???
Chris Johnson http://www.dcs.gla.ac.uk/~johnson/ UI ???
[CGI programming] The Common Gateway Interface - RFC Project Page http://cgi-spec.golux.com/ If you want to improve the standards that CGI is based on, this is the place.
Radulian Pop http://tornado.brevard.edu/poprf/research.htm seems to like this page. That makes me happy :-).
HTML Style Guides http://www.bgsu.edu/departments/tcom/style.html [FIXME: read ...]
``Characterization and Assessment of HTML Style Guides'' paper by Julie Ratner, Eric M. Grose & Chris Forsythe http://www.acm.org/sigchi/chi96/proceedings/intpost/Ratner/rj_txt.htm [FIXME: read] (this is a meta to this document)
Q: HOW DID YOU GET SO MANY HITS ON YOUR PAGE? http://amasci.com/faq.html#hits has some interesting ideas, very similar to DAV's philosophy of web page development.

3. Make your website be your filing cabinet. If you have little projects underway, put them on your website while working on them. Reject the paper-publishing traditions of polishing an article to perfection before publication. Instead, type things directly into your site in rough draft form (lable them UNDER CONSTRUCTION).
Expunge the fear of embarassment from your life, and instead practice making foolish mistakes in front of thousands of strangers. Stop using your PC to store files, instead use your website as your main storage. Let people poke through your filing cabinet. It will contain far more than a perfectly polished website does.
...
7. Always add a link to the top of all of your pages which links back to your main site.
-- Bill Beaty
[FIXME: also has many other useful ideas ... perhaps I should implement them on my web site.]
Webpage design flaws: Mistakes to avoid http://amasci.com/mistake.html
to see the ``source'' of a page *after* all the little JavaScript things have run, check out http://bugzilla.mozilla.org/show_bug.cgi?id=55583
The correctness rating scheme at http://twiki.org/cgi-bin/view/Wikilearn/PageStatus looks interesting. More at http://twiki.org/cgi-bin/view/Wikilearn/GettextResources#Rants
feedback form: "How do I submit forms by e-mail?" http://intranetjournal.com/faqs/jsfaq/how4.html
``Choosing a Web Designer'' by Walter Ian Kaye (suggested interview questions) http://www.natural-innovations.com/boo/CaWD.html
IEEE standard updates practices for building and managing web sites: standard can help raise productivity and lower costs for site operations, while making sites easier to use and more credible http://standards.ieee.org/announcements/2001rev.html

The standard, IEEE 2001(TM), "Recommended Practice for the Internet - Web Site Engineering, Web Site Management and Web Site Life Cycle," defines guidelines for intranet and extranet pages that improve productivity, reduce costs, and make sites easier to use and more credible.
The standard offers a variety of best practices and can help reduce liability associated with web site development and operation. One part of the standard recommends disclosure information to be used on all sites, such as who created the site, its legal address, and the date of its last substantive update. ... developed in collaboration with Consumers Union
[why DAV prefers GET over POST] URI URL http://www.w3.org/Addressing/

...
It is also unfortunate that, for example, headings in HTML documents are not addressable unless they are marked up as anchors explicitly.''
...
The requirement to have all resources in a hypermedia system addressable was identified long ago in Douglas Engelbart's seminal paper (see also, An Evaluation of the World Wide Web with respect to Engelbart's Requirements). The ability to make a reference to a resource with a URL enables linking, searching, and a variety of navigation and access techniques.
Some services make information available via the web, but not addressable. For example, results of database queries using POST (rather than GET) are not addressable. A items in a catalog put on the web this way can't be linked to, and cannot participate in third-party search services. This unfortunate choice by some information providers reduces automation and scalability in the web.
``Designing for Multiple Browsers Without Being Bland'' by Stephen Traub http://www1.shore.net/~straub/wprmultb.htm [FIXME: how did he get nice columns ?]
``Elements of Style for Web Design'' by Christine A. Quinn http://www.stanford.edu/~cquinn/papers/bostonpaper.html [FIXME: reread. Good ideas for web sites].
http://linuxmafia.com/~rick/essays/newlug.html has lots of good tips on maintaining web pages. ``Use referral pages.'', with an example of the exact HTML code to use.
The Robots Exclusion Protocol http://www.robotstxt.org/wc/exclusion.html describes the robots META tag that a HTML author can use to ask robots not to scan a particular page. [meta ?]
``Guidelines for Designing a Good Web Site for ESL Students'' by Charles Kelly http://iteslj.org/Articles/Kelly-Guidelines.html
``10 Netsurfing Tips: How to Save Time & Avoid Frustration Tips to Make Your Websurfing Life Easier.'' by Charles I. Kelly 1998 http://aitech.ac.jp/~ckelly/midi/help/surftips.html recommendations for people on a slow (or expensive) connection:

Disable Auto-load Images
Disable JavaScript
Always Choose the Text-only Page When Offered
Always Choose the Non-frame Page When Offered: In most cases frames take more time ... and are harder to navigate than non-frame pages.
Save Frequently-used Pages to Your Own Hard Disk
Disable Plugins

Web page authors: if someone actually takes this advice, how will your pages look ? Are they usable ?
Hints for Web Authors by Warren Steel http://www.mcsr.olemiss.edu/~mudws/webhints.html
[html.html] FreeFind http://www.freefind.com/ "Add a search engine to your web site today. It's free!"
What is good hypertext writing? About style in hypertext copy - as opposed to layout. http://kbs.cs.tu-berlin.de/~jutta/ht/hypercyber.html ??? Yacc2html Annotate yacc and lex grammars with links. ???
If you're setting up a web server, have fun making a funny 404 page. Some ideas:
- http://www.mindspring.com/~isixtyfive/404page/404.html
- funny 404 messages http://www.jordhulen.dk/bc/files/FileZilla.jpg
- http://www.artlebedev.com/mandership/93/ suggests that a 404 page should leave the original address line in place, rather than replacing it with the address of "the" 404 page, in order to make it easier for the user to correct a mis-typed URL.
Initially it seemed to make sense to have the first page people see (the index page) also be the "site map". But I'm starting to see that starting with an exhaustive list of everything -- even stuff that has gone obsolete / I'm not interested in anymore -- may not be the optimum web site experience.
http://infocentre.frontend.com/servlet/Infocentre?access=no&page=article&rows=5&id=286
"lists.evolt.org - Workers of the Web, evolt!" http://lists.evolt.org/ "mailing lists for the web development community."
http://directory.google.com/Top/Computers/Programming/Languages/JavaScript/
"The JavaScript Source is an excellent JavaScript resource with tons of "cut and paste" JavaScript examples for your Web pages. All for free!" http://javascript.internet.com/
Special Educational Needs and Disability Act 2001 ("the SENDA Act") http://www.worldwidewiki.net/wiki/SpecialEducationalNeedsandDisabilityAct | http://www.hmso.gov.uk/acts/acts2001/20010010.htm [FIXME: read]
http://w3.gorge.net/ encourages people to start putting their own pages online, but the link to tutorials has gone stale.
Bill Weinman http://bw.org/ wrote the book <creative html design.2> http://www.htmlbook.com/ [FIXME: read]
http://www.htmlguru.com/ ???
http://www.gohtm.com/ converts PDF, Excel, Word, etc. to HTML ?
"How to display your web site logo on the address bar and in the favorites list" article (by who ?) http://chami.com/tips/internet/110599I.html
http://chami.com/ [FIXME: to read]
What do Screen Readers really say? http://eleaston.com/bob/screenreader-visibility.html [FIXME: move to braille.html ?]
"Building Accessible Websites" by Joe Clark http://fawny.org/ [FIXME: toread]

There's a simple explanation here, but it's impossible to make one group understand the other. You've got your pre-Internet people and your post-Internet. Pre-Internet types have an unreconstructed 1950s conception of privacy in which only facts everyone would talk about can ever be discussed by anyone. Whereas post-Internet types put their entire lives online. There is nothing really not worth talking about, or indeed anything one should not talk about (pace Jaron Lanier http://www.extremetech.com/print_article/0,3998,a=22576,00.asp ).
-- http://fawny.org/rhcp.html
How to make web pages - A good practice guide to HTML and CSS http://htmldog.com/ (has beginner, intermediate, and advanced sections)
"Minimal HTML" by Mark L. Irons http://rdrop.com/~half/Creations/Writings/TechNotes/minimal.html To this end, Minimal HTML throws away the parts of the HTML 3.2 specification that deal with presentation, focusing instead on document structure and content. By paying more attention to content, your pages will be more useful to your visitors. Always keep this in mind: search engines index content, not design. By improving content, you increase the chance that people will find and benefit from your creations.
Google Information for Webmasters: Webmaster Guidelines http://www.google.com/webmasters/guidelines.html
"Thesis. How to cope with incorrect html" http://elsewhat.com/thesis/ (useful if you are writing something that parses HTML -- another web browser, a web-scraper, etc.)
"An ideal solution would be collecting all links at the bottom of a document. The reader has finished with the document content and has to move on." http://www.artlebedev.com/mandership/83/
URI Opacity Revisited http://atownley.org/2008/04/uri-opacity-revisited/

standard footer: why ?: so I won't forget the following.
full URI of the file: why ?: so if people print it out, they can go back to the online version to check for updates.
date file last modified: why ?: ??
date file started: why ?: because I might want to know when I started it, and if I don't write it down now (and I might as well put it *in* the file, it's metadata), I won't remember.
link pointing to index: why ?: so if people get to this page "sideways" through search engines etc., they can go "up" to my index ("no dead-end pages").
direct email address of maintainer (visible in the browser): why ?: so people who have been given a paper printout of this page can directly respond via email, without the hassle of hunting down the email address of the maintainer.
original author, current maintainer: why ?: to give credit. I know I do better work when I'm get a little recognition from it.

What is the point of the <address></address> tag ? Do I need one ?

Standard footer for all pages DAV writes:

[FIXME: consider adding a Bobby link http://bobby.cast.org/bobby/bobbyServlet?URL=http%3A%2F%2Frdrop.com%2F%7Ecary%2Fhtml%2Fhtml.html&output=Submit&gl=wcag1-aaa to the footer ?]

[FIXME: Does this look like a good thing to put in the footer of every page ?

]

This page started 1998-01-15 (but some information much older, going back to 1996 Nov 12) by David Cary and has backlinks

Pages linked to this page known by Google

errors , bug reports to

David Cary feedback.html
d.cary+72@ieee.org.

Alas, CritSuit is offline. Is there any replacement for it?

Return to index // end http://david.carybros.com/html/html.html /* was http://rdrop.com/~cary/html/html.html */