Integrate Watercrawl as a Data Source for Knowledge Base
Self Checks
- [x] I have searched for existing issues search for existing issues, including closed ones.
- [x] I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
- [x] [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
- [x] Please do not modify this template :) and fill in all the required fields.
1. Is this request related to a challenge you're experiencing? Tell me about your story.
Yes, this request is related to a challenge I’m experiencing.
Currently, Dify allows users to sync web data into its Knowledge Base using Firecrawl. However, my platform, WaterCrawl, offers a different approach to structured web crawling, enabling users to extract and organize web content more efficiently.
I already deployed the WaterCrawl Dify plugin in your marketplace (WaterCrawl Plugin). We have noticed from user feedback that many of them use the Dify application and would like to use WaterCrawl as a Knowledge Base source as well. However, the current plugin integration does not support this functionality, limiting their ability to utilize our system effectively.
By adding WaterCrawl as an alternative data source in Dify, users would have more control, flexibility, and accuracy when importing web data. This would solve the challenge of relying on a few web data providers, giving users more options based on their specific needs.
I am also willing to implement this feature myself and ensure it meets Dify's code standards.
2. Additional context or comments
WaterCrawl is designed to provide structured web crawling and intelligent data extraction, making it an excellent fit for AI-powered applications.
3. Can you help us with this feature?
- [x] I am interested in contributing to this feature.