My read on it was that the terms of use of content services can prevent that con...

CaptainFever · on June 21, 2024

Related case: https://en.m.wikipedia.org/wiki/HiQ_Labs_v._LinkedIn

wrs · on June 21, 2024

Hm, I thought the overall goal was that you would train LLMs on that data, but the owners of the data would be compensated when output was generated that was influenced by it.

Somehow we have to be able to train LLMs on high-quality information, without having the resulting generative capability destroy the economic support for creating that information in the first place.