Survey of Industry Leaders Shows Synthetic Data is Essential to Building More Capable AI Models

Executives believe that synthetic data is key to more efficiently and cost-effectively creating labeled training data.

Synthesis AI, a pioneer in synthetic data technologies, today released a new report in conjunction with Vanson Bourne, a global technology market research firm, highlighting how 89% of technology executives view synthetic data as a key emerging technology to creating more capable models, cutting the cost of data labeling, improving access to data, and reducing the time it takes to build AI models.

Marketing Technology News:MarTech Interview with David Grossman, CMO at Backstage

Industry leaders believe that, on average, 59% of their industry will utilize synthetic data in five years, either independently or in combination with ‘real-world’ data. This suggests that synthetic data will play an important role in the development of next-generation AI models.

The survey report, Adapt or Be Left Behind: 89 Percent of Tech Execs See Synthetic Data As a Key to Transforming Their Industry, is based on a survey of 100 senior technology executives on their perceptions of synthetic data, potential benefits and barriers of implementation, and what industry leaders think it will take to continue driving the adoption of synthetic data.

Synthetic data refers to computer-generated images and simulations used to train computer vision models. Synthetic data is emerging to be an essential element in building accurate and capable AI models, as it provides developers with vast amounts of perfectly labeled data on-demand.

“AI is driven by the amount, quality, and speed of training data. Synthetic training data is already making waves in several industries including autonomous vehicles and robotics. There is a critical need for more education on the underlying technology and benefits to drive broader industry adoption,” said Yashar Behzadi, CEO and founder of Synthesis AI. “Building core synthetic data capability will be the key to whether or not some companies adapt or fall behind in the future. Synthetic data has the potential to deliver perfectly labeled data on-demand, potentially cutting millions of dollars and months of work related to the current process of collecting, preparing, and manually labeling training data.”

Marketing Technology News:Vonage Supports Expanding Small Business Market through the Pandemic with Future-Proof Cloud…

Andy Thurai, Vice President and Principal Analyst at Constellation Research, said, “Today’s AI models are limited by real-world data for a couple of reasons – collecting real-world data is very expensive, and most companies don’t have the time and resources to collect the volume of data that is required to train models that the tech giants do. The survey results indicate synthetic data is a new market where there is a knowledge gap that needs to be addressed. A blend of the real world and synthetic data will provide the best combination that is impossible to match just by raw data collection. If a model can handle all possible scenarios based on assumptions, then it is ready for real-world scenarios.”

Synthetic data adoption is increasing, but a key to further adoption is enhanced understanding of this emerging technology across the board, all the way from the C-suite to machine learning engineers. Only half (51%) of the respondents were knowledgeable, state-of-the-art synthetic data approaches indicating a critical gap.

Respondents who were aware of recent advances in synthetic data expressed confidence in the technology’s ability to address key issues with current “real-world” data approaches. This indicates that if the knowledge gap is reduced, many more will likely see and understand synthetic data’s benefits.

Prominent barriers to synthetic data adoption are organizational knowledge and a slow buy-in from colleagues.

Other barriers to adoption included:

  • Concerns that models built with synthetic data are not as good as ‘real-world’ data (46%);
  • Difficulty in creating high-quality synthetic data for complex systems (45%);
  • The costs of integration and implementation (42%).

Recent advances in synthetic data are addressing the key identified barriers and the technology is predicted to be a significant enabler of the next generation of AI models.

Marketing Technology News:WPP Leaders Recognised for Driving Change in Gender Diversity

Picture of PRNewswire

PRNewswire

PR Newswire, a Cision company, is the premier global provider of multimedia platforms and distribution that marketers, corporate communicators, sustainability officers, public affairs and investor relations officers leverage to engage key audiences. Having pioneered the commercial news distribution industry over 60 years ago, PR Newswire today provides end-to- end solutions to produce, optimize and target content -- and then distribute and measure results. Combining the world's largest multi-channel, multi-cultural content distribution and optimization network with comprehensive workflow tools and platforms, PR Newswire powers the stories of organizations around the world. PR Newswire serves tens of thousands of clients from offices in the Americas, Europe, Middle East, Africa and Asia-Pacific regions.

You Might Also Like