pandarallel
pandarallel copied to clipboard
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Fix #191 Didn't find a way of doing this that doesn't convert the generator to a list, unfortunately. To check that it works in colab (adapted from @agiveon's example): ```python...
## General - **Operating System**: OSX - **Python version**: 3.7.13 - **Pandas version**: 1.3.5 - **Pandarallel version**: 1.6.2 ## Acknowledgement - Issue happens on Colab only. When I use VScode,...
Ideally, it should run on: - Linux - Windows - Mac
Function names: - `test_dataframe_groupby_expanding_apply` - `test_dataframe_groupby_apply_complex`
Without pandarallel - ```python3 unspsc_title_similarity_df['titles_similarity_score'] = unspsc_title_similarity_df[['label_title_1', 'label_title_2']].apply(lambda x: get_spacy_similarity(*x), axis = 1) ``` ```shell 100%|██████████████████████████████████████████| 1770/1770 [00:30
## General - **Operating System**: Arch Linux (Linux kernel version 5.16.16) - **Python version**: 3.10.4 - **Pandas version**: 1.4.1 - **Pandarallel version**: 1.6.1 ## Acknowledgement - My issue is **NOT**...
I was trying to parallelise my code with `pandarallel` package in the following way: import pandas as pd from sklearn.cluster import SpectralClustering from pandarallel import pandarallel import numpy as np...
I tried using this package to query API end points based on the pandas dataframe. I am getting the following error using the package - ``` Traceback (most recent call...
**Please write here what feature `pandarallel` is missing:** You support the same functionality if you write an apply on a dataframe instaed of a series. e.g. df.groupby(args)[[colName]].parallel_apply(func1) Would it be...
I start parallel with a thread in the main function_ Apply didn't report an error, but I used a thread to call parallel in Django service first_ When applying the...