Christovis

christovis.github.io

Results 13 comments of


                                            Christovis

generalize get_list_name ?

Yep, agree, we should find a way to generalise this method and maybe place it in `utils.py` ? For Listserv mailing lists I have a function `ListservList.get_name_from_url(mlist_url)` [here](https://github.com/datactive/bigbang/blob/e6a7ba464fed84bfb0968fcbcc8079ed2d495727/bigbang/listserv.py#L812) that get's...

consolidate entity resolution scripts into single module

I was wondering whether [this](https://gist.github.com/ammarshah/f5c2624d767f91a7cbdc4e54db8dd0bf) might be helpful to filter out personal contributions, and [this](https://github.com/cncf/gitdm) to find affiliations (as suggested during the IAB-AID workshop.

Participation activity by hour/day/timezone

Contributions per timezone is showcased for the 3GPP in this example: [timezones.ipynb](https://github.com/datactive/bigbang/blob/main/examples/3GPP/timezones.ipynb) and can easily expanded to hour of day. You can use it as a template for the IETF.

ingest from GitHub projects

This is part of #417

email domain analysis: list top/bottom working groups by PCA dimension

Could you explain this a bit more. I don't know what is meant by top/bottom and tags? Is what every you describe here partially contained in the _Multi-dimensional scaling_ of...

combine load_data and open_list_archives ?

**> How is the LISTSERV data stored locally after it has been collected, @Christovis ?** After a mailing archive or list has been scraped and single messages are a `mboxMessage`...

combine load_data and open_list_archives ?

**> How does the LISTSERV functionality load the data from csv or mbox?** It is assumed that the user can by themselves identify the file format in which the data...

combine load_data and open_list_archives ?

At the moment I believe we could have an: 1. `MessageParser(ABC)` that can contain functions like `from_url(), create_email_message(), get_datetime(), to_dict(), to_pandas_dataframe(), to_mbox()` such that they don't need to be duplicated....

combine load_data and open_list_archives ?

@sbenthall given the new code structure that is emerging (at least for 3GPP, W3C, IEEE) is this issue still relevant or does it need to be rephrased?

Dashboards

For nice visualisation of the archive of mailing list communication network we could use [kumu](https://kumu.io/).

1
2
›