Omri Mendels

Results 45 issues of Omri Mendels

Thanks for the awesome package! we use it in [Microsoft Presidio](https://github.com/microsoft/presidio). In addition to a download button, is it possible to add a link to source? For some users, notebooks...

## Background Presidio currently leverages spaCy for NER. It is possible to [switch to a stanza model](https://microsoft.github.io/presidio/tutorial/05_languages/), or to create additional NER recognizers using 3rd party packages such as Flair...

enhancement
Advanced
analyzer

As agreed with the folks who developed [Presidio CLI](https://github.com/insightsengineering/presidio-cli) and did an amazing job on it, we'd like to integrate it into Presidio. Tagging @dinakar29 and @knightdave. Let's use this...

**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** Steps to reproduce the behavior: ```python from presidio_analyzer import AnalyzerEngine, BatchAnalyzerEngine analyzer_engine = AnalyzerEngine() batch_analyzer_engine...

bug
analyzer
structured-data

Presidio leverages ML models which might detect an instance in one sentence but not in another. By automatically adding all instances of a previously identified entity, we can increase detection...

enhancement
good first issue
analyzer

Coreference resolution would potentially allow Presidio to link named entities to a specific person. For example, in the sentence "Paul used to live in Argentina, he now lives in Uruguay",...

good first issue

Presidio uses the [phonenumbers](https://github.com/daviddrysdale/python-phonenumbers) python package for better handling and coverage of phone numbers. [See code here](https://github.com/microsoft/presidio/blob/main/presidio-analyzer/presidio_analyzer/predefined_recognizers/phone_recognizer.py). It currently uses five regions ("US", "UK", "DE", "FE", "IL") as default, but...

good first issue
documentation
analyzer

In the existing [sample for batch analysis](https://github.com/microsoft/presidio/blob/main/docs/samples/python/batch_processing.ipynb), the logic goes over all the columns and looks for PII. We'd like to extend this with logic which also evaluates how likely...

good first issue
analyzer

## Change Description Extended `install_nlp_models.py` to include transformers models based on config Added a new Dockerfile which is tailored for transformers models >Note: as part of this PR I had...

## Change Description Describe your changes ## Issue reference This PR fixes issue #XX ## Checklist - [ ] I have reviewed the [contribution guidelines](https://github.com/microsoft/presidio/blob/main/CONTRIBUTING.md) - [ ] I have...