openllmetry icon indicating copy to clipboard operation
openllmetry copied to clipboard

🚀 Feature: Log prompts while preserving user PII with Presidio

Open clemlesne opened this issue 1 year ago • 3 comments

Which component is this feature for?

All instrumentations backend

🔖 Feature description

Remove PII from prompts before sending it, with Presidio.

🎤 Why is this feature needed?

Today, either we log prompts to monitoring backend or nothing. Monitor those details are important for prompt quality and A/B testing. But, this includes sending PII to the monitoring backend, which is in most companies not conceived to handle this type of data (GDPR if you hear me).

clemlesne avatar Apr 07 '24 12:04 clemlesne

Thanks @clemlesne that's a great idea! We can maybe add it at the SDK level on the span processor. I wonder if there's any latency or performance impact.

nirga avatar Apr 07 '24 19:04 nirga

Presidio works very badly on languages other than english. just a heads up. we ended up using another PaaS MSFT service (Cogservices). But perhaps it would make sense to have a plugin interface process PII before loggin it.

romeroej-inch avatar Apr 18 '24 18:04 romeroej-inch

Or potentially this might be something that should be done on the ingestion side? I worry about creating a significant overhead on production devices.

nirga avatar Apr 18 '24 18:04 nirga