eland icon indicating copy to clipboard operation
eland copied to clipboard

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

Results 137 eland issues
Sort by recently updated
recently updated
newest added

I'm working on a AWS EC2 server. I followed the [tutorial](https://www.elastic.co/guide/en/elasticsearch/reference/8.3/docker.html#_disable_swapping) and started a multi-node cluster with docker-compose. Then followed [this NER example](https://www.elastic.co/blog/how-to-deploy-nlp-named-entity-recognition-ner-example) trying to deploy a model to the...

Guiding the user through the configuration options and detect misconfigurations early.

topic:NLP

#479 shows that new versions of PyTorch will sometimes be incompatible with older versions. In this case a Torchscript model traced in version 1.12 of PyTorch cannot be evaluated in...

topic:ml

Add support for: - `df[:]` - `df[::]` - `df[:100]` - `df[100:]` - `df[100:200]` - `df[100:][1:]` Slicing functionality that should raise a ValueError: - `df[::2]` (step) - Probably others?

enhancement
help wanted
topic:series
topic:dataframe

I have 100 million documents stored in Elasticsearch. Each document has a field named `vector` with the type `dense_vector`. I want to calculate the mean vector of this bucket. Is...

Closes #449 We are dumping the entire ES index to pandas data frame and then converting it into csv. This might not hold for large datasets and cause unresponsiveness. So,...

when using Elasticsearch client to add field alias. i.e: ``` es.indices.put_mapping(index="my-index", properties={ 'new-field-alias': { 'type': 'alias', 'path': 'old-field-name' } }) ``` This creates a [field alias](https://www.elastic.co/blog/introducing-field-aliases-in-elasticsearch) to field "old-field-name" called...

enhancement
topic:dataframe

Below is a question I've asked elsewhere (don't have an answer yet). Could Eland be a candidate for making this easier? Eland specifically focuses on Pandas, but I don't see...

enhancement
topic:dataframe

May be a no-op, but we need to run our test suite with pandas 1.4.x to make sure.

This small change will improve the performance of `FieldMappingCache` Existing drawbacks: - `field_mapping_cache` is initialized on every yield of `itertuples` and `iterrows` - Whenever eland dataframe is used to `repr()`...