politico scraper fails on "print.cfm"

Open Fil opened this issue 11 years ago • 0 comments

    print_link = soup.findAll('a', href=re.compile('http://dyn.politico.com/printstory.cfm.*'))[0].get('href')
IndexError: list index out of range

indeed politico's pages don't have this kind of print_link anymore

Mar 19 '15 08:03 Fil