BrendanChambersBourgeois comments

Repositories
Issues
Comments

Results 5 comments of


                                            BrendanChambersBourgeois

Updated it to just do defang. ``` └─$ python evals/cli/oaieval.py gpt-3.5-turbo defang_domain_check.dev.v0 ... [2023-05-08 09:36:30,998] [oaieval.py:147] Final report: [2023-05-08 09:36:30,998] [oaieval.py:149] accuracy: 0.703125 ```

Add Defang_Check eval

Added checks for is defang or fanged. View evals in JSON ### Eval ```jsonl {"input": [{"role": "system", "content": "Respond with only a 1 or 0 to signify if the user's...

Add Defang_Check eval

More information on defanging: (ChatGPT-4 output ^_^) The main issue with GPT-3 and GPT-4 is that it often does not fully defang things. e.g. subdomain.example[.]com is not defanged as subdomain.example...

Add Defang_Check eval

Also please let me know if being more generic is better for the model... I'm not sure if this would be better for defang_domain_check.dev.v0. View evals in JSON **more generic**...

BrendanChambersBourgeois

Changed tonality

Add Defang_Check eval

Add Defang_Check eval

Add Defang_Check eval

Add Defang_Check eval