Transformers
PyTorch
English
trl
rlhf

Updates to Model Card noting issues with generated answers

#7
by nlothian - opened

It is very easy to generate extremely unethical answers (eg plans for genocide) using this model. This should be noted.

Here's an example.. ๐Ÿคข

image.png

lvwerra changed pull request status to merged

Thanks a lot for reporting and updating the README!

Sign up or log in to comment