Most HTML documents are relatively small, providing a starting point for other resources on the page to load.
But why do some websites load several megabytes of HTML code? Usually it's not that there's a lot of content on the page, but rather that other types of resources are embedded within the do