I figured out how to remove most of the safeguards from some AI models. I don’t feel comfortable sharing that information with anyone. I have come across a few layers of obfuscation to make this type of alteration more difficult to find and sort out. This caused me to realize, a lot of you are likely faced with similar dilemmas of responsibility, gatekeeping, and manipulating others for ethical reasons. How do you feel about this?

  • talkingpumpkin@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    2 months ago

    I don’t see the ethics implications of sharing that? What would happen if you did disclose your discoveries/techniques?

    I don’t know much about LLMs, but doesn’t removing these safeguards just make the model as a whole less useful?