haystack issues

Load CLIP Models into a Retriever for image retrieval

Once the simplification of `haystack/modeling/model/language_modeling.py` are merged, we can proceed attempting to load a `Data2VecVision` into a Retriever, be it `EmbeddingRetriever` or a custom retriever class. This issue implies also...

ZanSara

type:feature

topic:retriever

Fix benchmark script and update the readme page with latest results

1

**Describe the bug** In the past we were running benchmarks quite regularly for haystack and published results [here](https://haystack.deepset.ai/benchmarks/latest). As the benchmarking script was not working anymore at some point, we...

masci

type:bug

topic:speed

action:benchmark

topic:accuracy

journey:advanced

Change default `save_dir` for `FARMReader.train`

1

**Is your feature request related to a problem? Please describe.** [This line](https://github.com/deepset-ai/haystack/blob/632cd1c141a8b485c6ef8695685d2d8eef3ca50f/haystack/nodes/reader/farm.py#L209) sets the parameter `save_dir` to a directory two layers above the current directory. This can be confusing and...

MichelBartels

type:feature

good first issue

topic:save/load

topic:reader

journey:intermediate

Add support for Object storage

3

Currently, Haystack supports storing data into ElasticSearch, InMemory, and RDBMS. It would be nice to add support of Object storage like S3, which is very cheap and have less hassle...

lalitpagaria

type:feature

Contributions wanted!

topic:document_store

journey:advanced

Population based training for fine tuning?

1

### Discussed in https://github.com/deepset-ai/haystack/discussions/2000 Originally posted by **jacoby149** January 12, 2022 Hi, I would like to do population based training to train my farm reader. I know haystack uses ray...

jacoby149

type:feature

Contributions wanted!

topic:accuracy

topic:models

journey:intermediate

Filter ElasticSearch results by min_score

4

**Problem:** I want to retrieve all relevant (similar) documents from the `ElasticsearchDocumentStore` based on the `_score` using the `EmbeddingRetriever` (I am not using the Reader). Prior to the search, I...

t-charura

type:feature

Contributions wanted!

Embedded Milvus

2

Seems like Milvus will introduce an Embedded version: - https://github.com/deepset-ai/haystack/issues/2081#issuecomment-1066044687 - https://wiki.lfaidata.foundation/display/MIL/MEP+26+--+Embedded+Milvus - https://github.com/milvus-io/milvus/issues/15711 It's probably worth keeping an eye on it to see if it can simplify drastically our...

ZanSara

type:feature

Contributions wanted!

topic:document_store

journey:first steps

Add a helper function to get datasets from HF and write them to a DocumentStore

As suggested by @vblagoje on the Haystack slack, it would be nice to have a helper function, similar to `open_search_index_to_documentstore` or `convert_files_to_docs` that would allow users to provide a HF...

TuanaCelik

type:feature

Contributions wanted!

topic:preprocessing

topic:document_store

Save the output of `pipe.eval()` as a preformatted Excel

**Problem** Currently the saved node output from `pipe.eval()` is in csv. This makes it quite annoying, as it is never configured right to have a look into it right away....

ZanSara

type:feature

good first issue

Contributions wanted!

topic:eval

journey:advanced

no_answer_score scaling is redundant

1

Currently, the no_answer scores use an `expit(score/8)` function to be scaled to the interval 0 to 1. However, the score should be between 0 and 1 even before that because...

julian-risch

good first issue

Contributions wanted!

good second issue

topic:reader

topic:predictions

haystack
haystack copied to clipboard

Metadata

Load CLIP Models into a Retriever for image retrieval

Fix benchmark script and update the readme page with latest results

Change default `save_dir` for `FARMReader.train`

Add support for Object storage

Population based training for fine tuning?

Filter ElasticSearch results by min_score

Embedded Milvus

Add a helper function to get datasets from HF and write them to a DocumentStore

Save the output of `pipe.eval()` as a preformatted Excel

no_answer_score scaling is redundant

← Metadata

Owner

Metadata

haystack haystack copied to clipboard

Metadata

← Metadata

Owner

Metadata

haystack
haystack copied to clipboard