Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I’m not making any claims about the danger or lack of danger. Anyway, in the absence of specifics, this is a boring conversation.


Great, so given a chance of danger, not releasing a network keeps your options open. You can make an API available, and if there turns out to be a problem you can close it again, or ban a specific user, or implement hotfixes. None of those can be done with a publically released network.

edit: You know what, let's take a concrete issue that could happen today. You've made a generative image network. Five weeks after releasing it on Huggingface, you discover to your chagrin that the dataset that you used to train it contains an astonishing amount of child pornography, something like 1%. Your spot checks didn't find this because it's all in a subfolder that you forgot to check. Who knew it wasn't a good idea to download datasets from 4chan? As a result, this network is now extremely good at generating images of children in sexual situations, and because of mode collapse, it's creating fake images of real children, something which all but the most libertarian consider morally abhorrent. At any rate, you consider this morally abhorrent, and you'd love to work with the police to prevent any further misuse. Unfortunately, your network has been downloaded at least ten thousand times and it has already been fine-tuned to be even better at child porn by the nice folks at <insert dubious discord here>. Now you have an appointment with a senator in three days, and you have to explain to her why you thought it was a good idea to publish this network for open download, even though you could have made way more money by keeping it closed. Good luck?

Now of course you can argue that in this case all the material was already out there. But that doesn't change the fact that you were the one who did the training run, and released the network, and you're the reason why perceptual hashes now won't find collisions on the generated pictures anymore. If there was a limited amount of generated images in circulation, you could just take the API down, apologize profusely, donate 10k to RAINN or whatever and restart your project under a new name. But as it is, that option is no longer available. The point is, we don't know what a network is doing, and so we don't know what it's going to do in the wild. We cannot prove the absence of capability, so we should hedge our bets.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: