Zane Selvans
Zane Selvans
I mostly work in Python, and would say I have an intermediate level of familiarity with `pytest` and testing in general. However, I work on data processing & analysis, not...
- [x] pgloader --version ``` pgloader version "3.6.2" compiled with SBCL 2.0.6.debian ``` - [ ] did you test a fresh compile from the source tree? I did not, tbh....
In some data, notably the FERC Form 1, we find that information which should really be in a single row is spread across multiple rows. So far we've handled this...
This update isn't as simple as I thought. See #1780
Right now we have hard-coded an assumption that the EIA-860m data will always include data from the 1 year following the data which is available from the EIA-860. However there...
Our current `heat_rate_by_unit()` calculation only works for generation units with a `unit_id_pudl` which excludes many combined cycle generators, and much of the older data. * [x] Assign `unit_id_pudl` values to...
In the course of [setting up continuous integration](https://github.com/catalyst-cooperative/rmi-ferc1-eia/issues/151) in the `rmi-ferc1-eia` repository, we discovered that the current plant part list generation process uses a huge amount of memory -- something...
Historically we've used `pudl.helpers.simplify_strings()` for string normalization in many contexts, but there's also `pudl.helpers.cleanstrings_snake()` (pulled over from the RMI / PPE repo), and in the FERC 1 / XBRL refactor...
The `pudl.helpers.cleanstrings` function is kind of a hot mess, as it dates from the early days. I was tempted to rewrite it as part of #1739 but am restraining myself....