Alexandre Enkerli

Results 17 comments of Alexandre Enkerli

Wow. Got to say, this very thread is an example of how development projects work in people’s _dreams_. Good work, guys! Have yet to experiment with Markov chains. Been hearing...

Sounds like there are instructions here: https://github.com/googlecreativelab/coder/wiki (Haven’t tried Coder in any form, yet. Just noticed this issue while checking how active the project was.)

Thanks for the tip.

Got a similar issue with two sites producing empty `cleaned_text`. This, from @barrust, filled me with hope: `goose.config.known_context_patterns.append({'attr': 'class', 'value': 'post-outer'})` However, Python throws an error saying that the Configuration...

Hm... Close but not quite. In goose (Python2), adding that line to `content.py` and running `article = g.extract(url='https://investsurrey.ca/node/126')` gives me the error: `'Configuration' object has no attribute 'hasattribute'` Searched around...

Thanks for both pieces of information, @barrust ! Will try `hasattr` in Python-Goose, though it’s probably best for me to focus on Goose3. Got a strange issue there. Will post...

Also, @barrust, in my case, the Goose object doesn’t have `hasattr` either. These are my lines 50-51 in `content.py`: ```` if self.config.hasattr('known_context_patterns'): KNOWN_ARTICLE_CONTENT_TAGS.extend(self.config.known_context_patterns) ```` Yet python complains again. ```` >>>...

To help the Python2 folks, a last-ditch effort to add class selectors to Python-Goose... Lines 50-51 of my `content.py`: ```` if hasattr(self, 'known_context_patterns'): KNOWN_ARTICLE_CONTENT_TAGS.extend(self.config.known_context_patterns) ```` Ran the following script: ````...

@piccolbo Well, if the same tag applies to all sorts of sites, it’s not really manual. Allegedly, Blogger uses the same `post-outer` class everywhere, so adding this one to known...

At this point (July 2018), the following works in Python3: In[]: `url_template = "http://climate.weather.gc.ca/climate_data/bulk_data_e.html?stationID=5415&Year={year}&Month={month}&format=csv&timeframe=1&submit=%20Download+Data"` and: In[]: `url = url_template.format(month=3, year=2012)` `weather_mar2012 = pd.read_csv(url, skiprows=15, index_col='Date/Time', parse_dates=True, encoding='utf-8', header=0)` An important...