eland
eland copied to clipboard
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
I'm working on a AWS EC2 server. I followed the [tutorial](https://www.elastic.co/guide/en/elasticsearch/reference/8.3/docker.html#_disable_swapping) and started a multi-node cluster with docker-compose. Then followed [this NER example](https://www.elastic.co/blog/how-to-deploy-nlp-named-entity-recognition-ner-example) trying to deploy a model to the...
Guiding the user through the configuration options and detect misconfigurations early.
#479 shows that new versions of PyTorch will sometimes be incompatible with older versions. In this case a Torchscript model traced in version 1.12 of PyTorch cannot be evaluated in...
Add support for: - `df[:]` - `df[::]` - `df[:100]` - `df[100:]` - `df[100:200]` - `df[100:][1:]` Slicing functionality that should raise a ValueError: - `df[::2]` (step) - Probably others?
I have 100 million documents stored in Elasticsearch. Each document has a field named `vector` with the type `dense_vector`. I want to calculate the mean vector of this bucket. Is...
Closes #449 We are dumping the entire ES index to pandas data frame and then converting it into csv. This might not hold for large datasets and cause unresponsiveness. So,...
when using Elasticsearch client to add field alias. i.e: ``` es.indices.put_mapping(index="my-index", properties={ 'new-field-alias': { 'type': 'alias', 'path': 'old-field-name' } }) ``` This creates a [field alias](https://www.elastic.co/blog/introducing-field-aliases-in-elasticsearch) to field "old-field-name" called...
Could Eland help facilitate conversion between popular Python data visualization and Vega (Kibana)?
Below is a question I've asked elsewhere (don't have an answer yet). Could Eland be a candidate for making this easier? Eland specifically focuses on Pandas, but I don't see...
May be a no-op, but we need to run our test suite with pandas 1.4.x to make sure.
This small change will improve the performance of `FieldMappingCache` Existing drawbacks: - `field_mapping_cache` is initialized on every yield of `itertuples` and `iterrows` - Whenever eland dataframe is used to `repr()`...