Character Encodings in HTML

Posted on 05 April 2013
Under notes

Here’s something I don’t get in HTML. Say we have this code: <!DOCTYPE html> <html lang="en"> <head> <meta charset="utf-8"> This is how most HTML documents begin. We’ve specified the character encoding in the fourth line. But in which encoding does the browser parse this code itself? I presume it just reads upto this line in utf-8 (or some other encoding) hoping it won’t encounter any special character till that line. It still sounds like a sloppy way to do it. Does anyway actually know how the browser parses this code?

Character Encodings in HTML

Comments