demosthenes.info

A blog by Dudley Storey on , , , , , , and anything else that strikes his fancy.

featured articles

popular favourites

UTF-8: the key to languages, characters and glyphs on web pages

While the majority of typographic control for web pages – font selection, leading, kerning, hyphenation and more – is the realm of , there are some basic character elements that are the responsibility of HTML, the basis of which is character encoding in UTF-8 formatting.

Throughout this blog I emphasize “use UTF-8 in web pages”. We specify we are using UTF-8 in several places:

UTF-8 is a character encoding scheme: a format that decides how characters are encoded in a document. (Note that this is different from, but intertwined with, font selection – the choice of typeface used). Put in very simple terms, utf-8 allows us to use any character from any language in a document, along with a wide variety of glyphs (symbols) and punctuation, including many that are not on your keyboard.

Some of these typographic symbols are subtle: a true left opening quote (“), which is different from the character generated by your keyboard ("), an em-dash (–) from an en-dash (-). They do make an important difference in your document, contributing to a well-presented page.

web developer guide

featured comment

by JoelB in Goodbye, JQuery Validation: HTML5 Form Errors With CSS3

what i'm reading

A Storm of Swords: A Song of Ice and Fire: Book Three
A Storm of Swords: A Song of Ice and Fire: Book Three

what i'm watching

Californication: The Third Season
Californication: The Third Season

what i'm playing

Mass Effect 3 Collector's Edition
Mass Effect 3 Collector's Edition

what i'm hearing

Dub FX
Dub FX

blogs

podcasts

no ads ever

This blog is free of advertising, and always will be.

creative commons licensed

The content of this blog is free to use in whatever way you wish under the Creative Commons license.