Zane Selvans

Results 177 issues of Zane Selvans

I mostly work in Python, and would say I have an intermediate level of familiarity with `pytest` and testing in general. However, I work on data processing & analysis, not...

- [x] pgloader --version ``` pgloader version "3.6.2" compiled with SBCL 2.0.6.debian ``` - [ ] did you test a fresh compile from the source tree? I did not, tbh....

In some data, notably the FERC Form 1, we find that information which should really be in a single row is spread across multiple rows. So far we've handled this...

ferc1
data-cleaning
rmi
xbrl

This update isn't as simple as I thought. See #1780

eia860
new data
settings

Right now we have hard-coded an assumption that the EIA-860m data will always include data from the 1 year following the data which is available from the EIA-860. However there...

eia860
new-data
settings

Our current `heat_rate_by_unit()` calculation only works for generation units with a `unit_id_pudl` which excludes many combined cycle generators, and much of the older data. * [x] Assign `unit_id_pudl` values to...

eia923
analysis
data-repair
Epic
rmi

In the course of [setting up continuous integration](https://github.com/catalyst-cooperative/rmi-ferc1-eia/issues/151) in the `rmi-ferc1-eia` repository, we discovered that the current plant part list generation process uses a huge amount of memory -- something...

output
testing
performance
ppe
rmi

Historically we've used `pudl.helpers.simplify_strings()` for string normalization in many contexts, but there's also `pudl.helpers.cleanstrings_snake()` (pulled over from the RMI / PPE repo), and in the FERC 1 / XBRL refactor...

ferc1
data-cleaning
ppe

The `pudl.helpers.cleanstrings` function is kind of a hot mess, as it dates from the early days. I was tempted to rewrite it as part of #1739 but am restraining myself....

ferc1
data-cleaning