Huffman has said, “We are not in the business of giving that [Reddit’s content] away for free.” That stance makes sense. But it also ignores the reality that all of Reddit’s content has been given to it for free by its millions of users. Further, it leaves aside the fact that the content has been orchestrated by its thousands of volunteer moderators.
touché
It’s literally not “Reddit’s content”. Says so in the user agreement:
Huffman should be careful calling it “Reddit’s content” — by claiming ownership, he’s arguably taking on liability.
The [stuff in brackets] is editorial. That’s when they add on their own reference to something said elsewhere.
In this case, huffpig didn’t actually say content. He said data.
It’s actually worse, because it dehumanizes everyone on reddit, via that the data is our only value to him.
So, fuck huffpig
I think the word “data” also supports the theory that this is actually about training data for LLMs rather than ad revenue. If it was actually about 3rd party apps, then why not just require all apps to feed the ads? But according to the Apollo developer, there wasn’t even a way to fetch the ads through the API.
I think spez saw what OpenAI/Microsoft were accomplishing using parsed data and got dollar signs in his eyes. The irony is that OpenAI probably already ripped every comment off Reddit up until now, and don’t really need more going forward.