Validation aside, the fact remains that encoding certain characters is important to an HTML document so that it can render properly and safely as a web page.
Encoding &
as &
under all circumstances, for me, is an easier rule to live by, reducing the likelihood of errors and failures.
Compare the following: which is easier? which is easier to bugger up?
Methodology 1
- Write some content which includes ampersand characters.
- Encode them all.
Methodology 2
(with a grain of salt, please ;) )
- Write some content which includes a ampersand characters.
- On a case-by-case basis, look at each ampersand. Determine if:
- It is isolated, and as such unambiguously an ampersand. eg.
volt & amp
> In that case don't bother encoding it.
- It is not isolated, but you feel it is nonetheless unambiguous, as the resulting entity does not exist and will never exist since the entity list could never evolve. eg
amp&volt
> In that case don't bother encoding it.
- It is not isolated, and ambiguous. eg.
volt&
> Encode it.
??