Shevchenko Vitaliy
Shevchenko Vitaliy
In [1]: import isodate In [2]: isodate.parse_datetime("04-05-2015T14:32:00Z-09:00") Out[2]: datetime.datetime(401, 1, 1, 14, 32, tzinfo=)
> ``` > s_iter = [''.join(map(str,y)).lstrip() for y in s_iter] > ``` > > E UnicodeEncodeError: 'ascii' codec can't encode character u'\u2014' in position 85: ordinal not in range(128)
If you run goose under python bs3 parser will not be able to use.
Fix IndexError if title is the same as site_name and add test for this case. Fix for #194.
Fix for #196 issue. If after title cleaning we still have TITLE_SPLITTER in title, use old algorithm of title cleaning (from previous version of goose).
For example http://www.ghgprotocol.org/city-accounting Extracted title is "GHG Protocol for Cities | Greenhouse Gas Protocol"