pandas-genomics
pandas-genomics copied to clipboard
Pandas ExtensionDtypes for dealing with genomics data
Currently, [`calculate_edge_alphas`](https://github.com/HallLab/pandas-genomics/blob/master/pandas_genomics/accessors/utils/edge_encoding.py#L12) only allows for a single outcome. Change that to a list of str instead.
[Multiprocessing package](https://docs.python.org/3/library/multiprocessing.html)
See here: https://pandas.pydata.org/pandas-docs/version/1.0.0/development/extending.html#compatibility-with-apache-arrow
- [ ] HWE exact test, possibly in one method - [ ] SNP Call Rate - [ ] Sample Call Rate - [ ] Sample Heterozygosity - [ ]...
Return a boolean array for this kind of comparison: ``` gt_array == "A/A" ``` Should require a minor update to the `GenotypeArray.__eq__` method
https://zzz.bwh.harvard.edu/plink/tutorial.shtml
This would be useful for round-trip IO of plink data, and would allow for specialized methods that require information about family structure or reported sex - [ ] Extension Array...
This is not a `pandas-genomics` specific issue, but I came across this link via https://pandas-genomics.readthedocs.io/en/latest/. It appears someone forgot to renew hall-lab.org and this domain is now used to advertise...