Basile Dura

Results 11 issues of Basile Dura

## Description This PR proposes a first _trainable_ pipeline :tada: The pipeline here is a trainable entity qualifier, that can automatically set a custom `ent._.qualifiers` extension. Said extension can be...

## Description Add pipeline components to handle rule-based pseudonymisation. 1. Handle static rules, such as detection of phone numbers, mail addresses, SSN, etc 2. Handle contextual information, using `nlp.pipe(zip(docs, contexts),...

enhancement

## Description We might benefit from using a parse tree. Parsing is a most transverse task, and could provide an easy way to create baselines for many pipeline components (qualifiers,...

discussion

## Feature type New qualifier pipeline, overtaking and improving the `eds.family` component. ## Description As of today, EDS-NLP only equiped to detect family members. We should add detection of other...

enhancement

## Feature type Matcher pipeline to handle the single label/multiple subconcepts use-case. ## Description As discussed in #58, we would certainly benefit from having EDS-NLP handle the nitty-gritty detail of...

enhancement
discussion

## Feature type Modification of the pollution matcher to be more center-specific (eg AP-HP, Bordeaux, etc). ## Description Light modification of the `eds.pollution` pipeline to separate patterns between the main...

enhancement

## Description The way we've handled spaCy extensions in EDS-NLP has been erratic at best, with each pipeline declaring its own set of new extensions, cluttering [spaCy's `Underscore` object](https://spacy.io/usage/processing-pipelines/#custom-components-attributes). For...

discussion

_Minor_ typo: the `to` is repeated.

A most minor documentation fix.

**Is your feature request related to a problem? Please describe.** I have a use-case where I need to compute the eigendecomposition of symmetric tridiagonal matrices. **Describe the solution you'd like**...

enhancement