OpenAI Reportedly Expands Training Data Efforts by Requesting Real-World Work Samples From Contractors

OpenAI is reportedly asking third-party contractors to upload examples of real work produced in past and current jobs as part of an effort to generate higher-quality training data for its AI models, according to a report by Wired. The initiative, carried out with training data provider Handshake AI, reflects a broader industry push to use authentic professional outputs to improve models designed to automate white-collar tasks.

Materials reviewed by Wired indicate that contractors are asked to describe job tasks and submit concrete work products, such as documents, presentations, spreadsheets, images, or code repositories. OpenAI reportedly instructs contributors to remove proprietary and personally identifiable information and provides tools to assist with data sanitization.

Legal experts have cautioned that the approach introduces potential intellectual property risk. Evan Brown, an intellectual property lawyer, said the model relies heavily on contractors’ judgment in determining what information is confidential, creating exposure for AI developers as they scale training operations.

James Dargan

James Dargan is a writer and researcher at The AI Insider. His focus is on the AI startup ecosystem and he writes articles on the space that have a tone accessible to the average reader.

Share this article:

AI Insider

Discover the future of AI technology with "AI Insider" - your go-to platform for industry data, market insights, and groundbreaking AI news

Subscribe today for the latest news about the AI landscape