colrev
colrev copied to clipboard
feat: Prospero SearchSource
Description
Enhance the CoLRev environment by implementing search functionality for the PROSPERO database, which specializes in systematic review protocols. This feature involves scraping search results, including detailed protocol information, and supporting load heuristics for recognizing and processing exported results.
Implementation Notes
- Use the BeautifulSoup library to develop a scraper that navigates the PROSPERO search interface, fetching search results along with detailed data from sub-pages, such as the date of registration.
- Documentation should guide users on how to perform searches and load search results.
- Implement unit tests simulating the retrieval process with a simple example.
- The packages docs page explains the steps of developing CoLRev packages.
- The search-feed offers functionality for storing records.
- CEP003 describes principles for SearchSources.
- CEP002 describes the standard data schema for records.
- The colrev.crossref package implements similar unit tests.
User Story
- The user initializes a CoLRev project using
colrev init. - He/she conducts a search operation with
colrev search -a colrev.prospero, specifying search parameters relevant to their topic. - The package retrieves records based on the parameters and saves them in the
data/searchdirectory of the project. - When users run
colrev load, the records from the search directory are added to thedata/records.bibfile, which integrates all search results in the project. - When users run
colrev searchagain, records are retrieved from the API, updating the existing records.
Useful Links
Expected Effort
2 months, 3-4 people.