Meta, Accepted that he has downloaded a large data set with millions of pirate books last month via torrent. he had.
Meta downloaded a data set known as Libgen and containing millions of pirate books over torrent. Later, the evidence showed that Meta has downloaded at least 81.7 Terabyt data through Anna’s Archive. It was also reported that the company has downloaded 80.6 Terabytes of data from Libgen. It was an interesting development that brought this issue to the legal level. As a defense argument, the company said that they did not seed the 82 terabytes data they downloaded through Torrent, that is, they did not share it again to download others. Within the scope of the lawsuit filed, the company claimed that there was no evidence that the plaintiffs had chosen their books, and at this point, contradictory. Because Michael Clark, one of the company’s project management names, stated that the configuration settings they use were “privatized to be made in the least possible amount of possible amounts”. In other words, the company made a little Seed during the download process.
You may be interested in
In a statement supported by more than 8,500 people last year by signing the signature, the works written in artificial intelligence systems, which are strengthened from large language models such as commodity signed Llama, were criticized without permission and without payment. “These technologies mimic our language, stories, style and ideas. Millions of copyright books, articles, essays and poems are almost a food for artificial intelligence systems, they are seen as eternal dishes without bills ” The authors who say to the publishers of the companies that develop these systems He stated that they did not license and said that they were damaged.
The authors demanded that artificial intelligence and big language model developed these steps to take these steps:
1. If you want to use our properly protected materials in your productive artificial intelligence programs, get the bride leave first.
2. Pay compensation to authors for the past and ongoing uses of our work in your productive artificial intelligence programs.
3. Content provided by artificial intelligence systems, whether or not they violate the existing laws, if our works are used in the results of artificial intelligence, compensate the authors fairly.
This issue did not come to the agenda for the first time with the notification. Developed by OpenAI Chatgpt, “GPT”Is trained with a language model called and this language model is obtained from many places. It is not known exactly where these positions are, but the data is among the data according to the cases filed recently.torrentEven the information obtained through. Famous comedian and writer behind these cases Sarah Silvermanas well as writers Christopher Golden And Richard Kadrey took place. Three names via both chatgpt OpenAIBoth “Lama” through the big language model MetaHe sued the ya over the copyright violation.
The basis of the case opened in the OpenAI was to summarize the books of the authors when Chatgpt was commanded. The authors say that this violates copyrights. In a separate case against Meta, it is stated that the authors’ books are accessible in the data sets used in the training of the LLAM language model. Within the scope of both cases, the authors are for the artificial intelligence models of the companies that are protected with copyrights. “that they do not allow it to be used as a training material.“ He said that three names requested legal compensation and refund within the scope of the process.