VidaminT

Results 8 comments of VidaminT

1. Progress: Got the pipeline up and running: pulling Wikipedia bios, grabbing Wikidata info (gender + occupation), and saving everything into a clean dataset. Tested it on a small batch...

1. Progress: - Scaled it way up — we’re now at ~1.1M bios all enriched and cleaned. - Gender, occupation, and country normalization are in place, and I added buckets...

1. Progress: - Spent a lot of time debugging the API calls. I found and fixed two major bugs that were stopping the script from getting the QIDs and timestamps...

1. Progress: - Timestamp collection, enriching, normalizing and initial dashboard is done. Need some feedback on dashboard for improvements if needed - Implemented a working monthly incremental refresh pipeline using...

Progress: Snapshot of reports/representation_gaps.md ## 2. Gender Representation ![Gender Representation Over Time](C:/Users/drrahman/Downloads/Gender%20Representation%20Over%20Time%20(Filterable%20by%20Continent).png) A modest improvement since 2015 is visible. Between 2015 and 2025, the male share declined from ≈ 72%...

1. Progress: Included statistical analysis and the intersectional analysis. The statistical one digs into things like interrupted time series and changepoint detection to see where we're actually seeing significant shifts...

1. Progress: Presented today (11/17). 2. Blockers: None 3. Availability: ~10 4. Next step: Create wiki page of the project overview (detailed article page; with link to the powerpoint), update...

1. Progress: Wiki page - https://github.com/hackforla/data-science/wiki/Wikipedia-Representation-Gaps; needs review 2. Blockers: None 3. Availability: 3-5