Wikipedia has a solution for the deluge of AI training bots hogging its servers

6 months ago 8
wikipedia logo on a smartphone screen in front of a screen of binary code

You're not the only one who turns to Wikipedia for quick facts. Lately, a deluge of AI bots training on Wikipedia articles has put enormous strain on the organization's servers.

To curb the influx of "non-human traffic" scraping the site for training data, Wikipedia is taking a proactive approach: serving up its data directly to AI developers.

On Wednesday, the Wikimedia Foundation announced a partnership with Google-owned company Kaggle to release a beta dataset "featuring structured Wikipedia content in English and French." Uploaded on April 15, the company said the dataset "simplifies access to clean, pre-parsed arti...

Source: https://mashable.com/article/wikipedia-ai-training-dataset-offsets-bots-overloading-servers

Read Entire Article

Disclaimer of liability !!!

NEWS.SP1.RO is an automatic news aggregator. In each article, taken over by NEWS.SP1.RO with maximum 500 characters from the original article, the source name and hyperlink to the source are specified.

The acquisition of information aims to promote and facilitate access to information, in compliance with intellectual property rights, in accordance with the terms and conditions of the source.

If you are the owner of the content and do not wish to publish your materials, please contact us by email at [email protected] and the content will be deleted as soon as possible.