extraction
extraction copied to clipboard
Use lxml if available and parse head only.
Though the README.md hinted that lxml will be used if available, the choice of parsers was forced to be only html5lib in the code.
Also, have added checks to parse only the <head> tag to improve performance on particularly large HTML files.