newsdiffs
newsdiffs copied to clipboard
politico scraper fails on "print.cfm"
print_link = soup.findAll('a', href=re.compile('http://dyn.politico.com/printstory.cfm.*'))[0].get('href')
IndexError: list index out of range
indeed politico's pages don't have this kind of print_link anymore