[FEATURE] Automatic Inclusion of File Name in Metadata for CSV and PDF Imports
Describe the feature you'd like I would like Flowise to have a feature where both CSV and PDF import nodes automatically capture and include the name of the imported file in the metadata. A clear and concise inclusion of file names in the metadata would provide significant benefits for users who need to track the origins of their data across various documents and ensure efficient data processing and management.
Additional context As it stands, there doesn't appear to be a built-in feature to automatically append the file name of imported CSV or PDF documents to their respective metadata within Flowise. This functionality would be particularly useful when handling multiple files, as it would aid in maintaining organized records and enhance traceability. If this can be implemented or if there are any plans for such an enhancement in upcoming releases, it would be excellent to know. Additionally, if there are any current methods or scripts that can serve as a temporary solution, that information would also be valuable.
Thank you for considering adding this capability. It would surely be a valuable addition to the FlowiseAI toolkit.
thanks for the suggestion! For now, its abit manual as you will have to add as part of the metadata:
@HenryHengZJ , @seonghobae Great enhancement suggestion indeed - 2 questions related to this :
- instead of this "manual/Hard coded process" - is it possible to take the file name from pdffile field (it didn't work...) - something like...
- also , if I want the file name or any other additional field to be upserted to my Embedding Postgres DB as an additional embedding key - any idea how to do it ? tried to "play" with the additional configuration of Postgres node but without success
Thanks for any lead here - been struggling on this for some time
Thankd for any lead here - been strugelling on this for some time
@HenryHengZJ , @seonghobae Great enhancement suggestion indeed - 2 questions related to this :
- instead of this "manual/Hard coded process" - is it possible to take the file name from pdffile field (it didn't work...) - something like...
I don't know It's possible, because I'm not expert of javascript language. However If I put files into vector store with GitHub, can write repository information and file informations into metadata automatically.
- also , if I want the file name or any other additional field to be upserted to my Embedding Postgres DB as an additional embedding key - any idea how to do it ? tried to "play" with the additional configuration of Postgres node but without success
Thanks for any lead here - been struggling on this for some time
In recent version, upsert function has been updated.
Adding me to the list of interested in this feature. Testing on a list of 61 long PDF files will be really useful having automatically added as metadata the file name. Interested to know if someone has found a way to achieve this without hardcoding, but reading the actual name from the file.
Try using document store, you can do that by setting the metadata field:
Its abit manual but not too difficult
