DeepSeek releases new image model family

spaduf@slrpnk.net · 4 个月前

DeepSeek releases new image model family

lnxtx (xe/xem/xyr)@feddit.nl · 4 个月前

What happened in 1989?

Jesus@lemmy.world · 4 个月前

Phoenixz@lemmy.ca · 4 个月前

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

thedarkfly@feddit.nl · 4 个月前

Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.

Phoenixz@lemmy.ca · 4 个月前

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

thedarkfly@feddit.nl · 4 个月前

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

Phoenixz@lemmy.ca · 4 个月前

I feel like we’re talking about a guard dog now…

Jackinopolis@sh.itjust.works · 4 个月前

It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.

surewhynotlem@lemmy.world · 4 个月前

Now I’ll never finish that history assignment…

DeepSeek releases new image model family

DeepSeek releases new image model family

Viral AI company DeepSeek releases new image model family | TechCrunch