Flowise icon indicating copy to clipboard operation
Flowise copied to clipboard

[FEATURE] Automatic Inclusion of File Name in Metadata for CSV and PDF Imports

Open seonghobae opened this issue 1 year ago • 5 comments

Describe the feature you'd like I would like Flowise to have a feature where both CSV and PDF import nodes automatically capture and include the name of the imported file in the metadata. A clear and concise inclusion of file names in the metadata would provide significant benefits for users who need to track the origins of their data across various documents and ensure efficient data processing and management.

Additional context As it stands, there doesn't appear to be a built-in feature to automatically append the file name of imported CSV or PDF documents to their respective metadata within Flowise. This functionality would be particularly useful when handling multiple files, as it would aid in maintaining organized records and enhance traceability. If this can be implemented or if there are any plans for such an enhancement in upcoming releases, it would be excellent to know. Additionally, if there are any current methods or scripts that can serve as a temporary solution, that information would also be valuable.

Thank you for considering adding this capability. It would surely be a valuable addition to the FlowiseAI toolkit.

seonghobae avatar Apr 08 '24 21:04 seonghobae

thanks for the suggestion! For now, its abit manual as you will have to add as part of the metadata: image

HenryHengZJ avatar Apr 09 '24 10:04 HenryHengZJ

@HenryHengZJ , @seonghobae Great enhancement suggestion indeed - 2 questions related to this :

  1. instead of this "manual/Hard coded process" - is it possible to take the file name from pdffile field (it didn't work...) - something like...

image

  1. also , if I want the file name or any other additional field to be upserted to my Embedding Postgres DB as an additional embedding key - any idea how to do it ? tried to "play" with the additional configuration of Postgres node but without success

Thanks for any lead here - been struggling on this for some time

Thankd for any lead here - been strugelling on this for some time

haimco50 avatar Apr 15 '24 21:04 haimco50

@HenryHengZJ , @seonghobae Great enhancement suggestion indeed - 2 questions related to this :

  1. instead of this "manual/Hard coded process" - is it possible to take the file name from pdffile field (it didn't work...) - something like...

image

I don't know It's possible, because I'm not expert of javascript language. However If I put files into vector store with GitHub, can write repository information and file informations into metadata automatically.

  1. also , if I want the file name or any other additional field to be upserted to my Embedding Postgres DB as an additional embedding key - any idea how to do it ? tried to "play" with the additional configuration of Postgres node but without success

Thanks for any lead here - been struggling on this for some time

In recent version, upsert function has been updated.

seonghobae avatar Apr 28 '24 07:04 seonghobae

Adding me to the list of interested in this feature. Testing on a list of 61 long PDF files will be really useful having automatically added as metadata the file name. Interested to know if someone has found a way to achieve this without hardcoding, but reading the actual name from the file.

davide445 avatar Aug 01 '24 21:08 davide445

Try using document store, you can do that by setting the metadata field: image

Its abit manual but not too difficult

HenryHengZJ avatar Aug 31 '24 11:08 HenryHengZJ