Omri Mendels
Omri Mendels
Thanks for the awesome package! we use it in [Microsoft Presidio](https://github.com/microsoft/presidio). In addition to a download button, is it possible to add a link to source? For some users, notebooks...
## Background Presidio currently leverages spaCy for NER. It is possible to [switch to a stanza model](https://microsoft.github.io/presidio/tutorial/05_languages/), or to create additional NER recognizers using 3rd party packages such as Flair...
As agreed with the folks who developed [Presidio CLI](https://github.com/insightsengineering/presidio-cli) and did an amazing job on it, we'd like to integrate it into Presidio. Tagging @dinakar29 and @knightdave. Let's use this...
**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** Steps to reproduce the behavior: ```python from presidio_analyzer import AnalyzerEngine, BatchAnalyzerEngine analyzer_engine = AnalyzerEngine() batch_analyzer_engine...
Presidio leverages ML models which might detect an instance in one sentence but not in another. By automatically adding all instances of a previously identified entity, we can increase detection...
Coreference resolution would potentially allow Presidio to link named entities to a specific person. For example, in the sentence "Paul used to live in Argentina, he now lives in Uruguay",...
Presidio uses the [phonenumbers](https://github.com/daviddrysdale/python-phonenumbers) python package for better handling and coverage of phone numbers. [See code here](https://github.com/microsoft/presidio/blob/main/presidio-analyzer/presidio_analyzer/predefined_recognizers/phone_recognizer.py). It currently uses five regions ("US", "UK", "DE", "FE", "IL") as default, but...
In the existing [sample for batch analysis](https://github.com/microsoft/presidio/blob/main/docs/samples/python/batch_processing.ipynb), the logic goes over all the columns and looks for PII. We'd like to extend this with logic which also evaluates how likely...
## Change Description Extended `install_nlp_models.py` to include transformers models based on config Added a new Dockerfile which is tailored for transformers models >Note: as part of this PR I had...
## Change Description Describe your changes ## Issue reference This PR fixes issue #XX ## Checklist - [ ] I have reviewed the [contribution guidelines](https://github.com/microsoft/presidio/blob/main/CONTRIBUTING.md) - [ ] I have...