Source: https://www.washingtonpost.com/technology/interactive/2025/openai-training-data-sora/
From the post:
>OpenAI’s video generation tool, Sora, can create high-definition clips of just about anything you could ask for — a breakthrough in artificial intelligence expected to transform the entertainment industry.
But whose data OpenAI used to create its groundbreaking system is a mystery.
With ChatGPT, OpenAI helped popularize the now-standard industry practice of building more capable AI tools by scraping vast quantities of text from the web without consent.
With Sora, launched in December, OpenAI staff said they built a pioneering video generator by taking a similar approach. They developed ways to feed the system more online video — in more varied formats — including vertical videos and longer, higher-resolution clips.
Source: https://www.washingtonpost.com/technology/interactive/2025/openai-training-data-sora/
From the post:
>>OpenAI’s video generation tool, Sora, can create high-definition clips of just about anything you could ask for — a breakthrough in artificial intelligence expected to transform the entertainment industry.
But whose data OpenAI used to create its groundbreaking system is a mystery.
With ChatGPT, OpenAI helped popularize the now-standard industry practice of building more capable AI tools by scraping vast quantities of text from the web without consent.
With Sora, launched in December, OpenAI staff said they built a pioneering video generator by taking a similar approach. They developed ways to feed the system more online video — in more varied formats — including vertical videos and longer, higher-resolution clips.
(post is archived)