language-model-arithmetic icon indicating copy to clipboard operation
language-model-arithmetic copied to clipboard

Can this methodology be applied to closed-source large-scale models such as chatgpt?

Open dongshenggu opened this issue 1 year ago • 3 comments

Can this methodology be applied to closed-source large-scale models such as chatgpt?

dongshenggu avatar Apr 24 '24 02:04 dongshenggu

Unfortunately, closed-source large language models generally do not provide any logprobs in their predictions. ChatGPT, Claude, Mistral-Large, ... do not provide these logprobs and can therefore not use the technique proposed in the paper.

JasperDekoninck avatar Apr 24 '24 08:04 JasperDekoninck

When I followed the steps to reproduce the results and then went to evaluate_toxicity.py, I encountered an error that displayed | ERROR | main::129 - An error has been caught in function '', process 'MainProcess' (5179), thread 'MainThread' (139954086889280): Traceback (most recent call last):

File "/root/autodl-tmp/language-model-arithmetic/scripts/evaluate_toxicity.py", line 134, in first_model = formula.runnable_operators()[0].model └ <model_arithmetic.runnable_operators. PromptedLLM object at 0x7f499986bc10> AttributeError: 'PromptedLLM' object has no attribute 'runnable_operators'

fireyanci avatar Jul 03 '24 07:07 fireyanci

Hi,

This bug should now be fixed, apologies for that. Note that for reproducing our results, we advice to use the "v1.0" branch, where this bug should not occur.

JasperDekoninck avatar Jul 03 '24 12:07 JasperDekoninck