Sebastian Benthall
Sebastian Benthall
Something is quite wrong with the IETF data collection procees. ``` $ python bin/collect_mail.py -u https://www.ietf.org/mail-archive/text/dns-security/ ['2008-05.mail', '2008-06.mail', '2008-07.mail', '2008-08.mail', '2008-09.mail', '2008-10.mail', '2008-11.mail', '2008-12.mail'] ``` So far so good, but...
We have some archaic plotting code in the notebooks. Here's an example: ``` plt.figure(figsize=(12.5, 7.5)) for i, activity in enumerate(acts): colors = 'rgbkm' ta = activity.sum(1) rmta = ta.rolling(window).mean() rmtadna...
At the IETF 110 HRPC meeting, there's an expressed interest in IETF membership/attendance, with respect to: - Are new people are joining ? (Organizational health) - Are they being retained?...
We have a standard way of configuring a path for email archives: it's the `mail_path` value in the config file. This is used by the mail collecting scripts. It is...
``` $ python bin/collect_mail.py -u https://mm.icann.org/pipermail/cc-humanrights/ Traceback (most recent call last): File "/usr/lib/python3.9/urllib/request.py", line 1346, in do_open h.request(req.get_method(), req.selector, req.data, headers, File "/usr/lib/python3.9/http/client.py", line 1253, in request self._send_request(method, url, body,...
Improve docstring coverage on BigBang See : https://github.com/gidden/docstring-coverage Prereq for #340
There have been a lot of additional data sources tossed around: - [ ] ITU - [x] GitHub Issues - [ ] Decidim - [x] Discourse - [ ] 1M2M...
The email domain study has given us a comprehensive view of organizational participation in IETF working groups but has suffered from a lot of messiness in the data. Some steps...
There's a small function which strips down a full URL of a mailman archive to get its last part, which is used as the mailing list 'name' https://github.com/datactive/bigbang/blob/main/bigbang/mailman.py#L185-L198 This gets...
- Get list of active working groups from datatracker - use that to populate list of working groups to crawl - as an option for the `collect_mail` script from @npdoty