SuperAnnotate helps companies manage their AI datasets

High-quality data may be the key to high-quality AI. With studies finding that dataset curation, rather than size, is what really affects an AI model’s performance, it’s unsurprising that there’s a growing emphasis on dataset management practices. According to some surveys, AI researchers today spend much of their time on data prep and organization tasks.

Brothers Vahan Petrosyan and Tigran Petrosyan felt the pain of having to manage lots of data while training algorithms in college. Vahan went so far as to create a data management tool during his PhD research on image segmentation.

A few years later, Vahan realized that developers — and even corporations — would happily pay for similar tooling. So the brothers founded a company, SuperAnnotate, to build it.

“During the explosion of innovation in 2023 surrounding models and multimodal AI, the need for high-quality datasets became more stringent, with each organization having multiple use cases requiring specialized data,” Vahan said in a statement. “We saw an opportunity to build an easy-to-use, low-code platform, like a Swiss Army Knife for modern AI training data.”

SuperAnnotate, whose clients include Databricks and Canva, helps users create and keep track of large AI training datasets. The startup initially focused on labeling software, but now provides tools for fine-tuning, iterating, and evaluating datasets.

With SuperAnnotate’s platform, users can connect data from local sources and the cloud to create data projects on which they can collaborate with teammates. From a dashboard, users can compare the performance of models by the data that was used to train them, and then deploy those models to various environments once they’re ready.

SuperAnnotate also provides companies access to a marketplace of crowd-sourced workers for data annotation tasks. Annotations are usually pieces of text labeling the meaning or parts of data that models train on, and serve as guideposts for models, “teaching” them to distinguish things, places and ideas.

To be frank, there are several Reddit threads about SuperAnnotate’s treatment of the data annotators it uses, and they aren’t flattering. Annotators complain about communication issues, unclear expectations, and low pay.

For its part, SuperAnnotate claims it pays fair market rates and that its demands on annotators aren’t outside the norm for the industry. We’ve asked the company to provide more detailed information about its practices and will update this piece if we hear back.

Edit: A few hours after this story was published, SuperAnnotate sent this statement via email: “About eight months ago, during a period of rapid scaling, we encountered challenges in maintaining clear communication with some annotators working on our projects. As is sometimes the case during rapid growth, a few process gaps emerged. We took this feedback seriously and have since made improvements to both how annotators interact with the platform and communication processes.”

There are several competitors in the AI data management space, including startups like Scale AI, Weka, and Dataloop. San Francisco-based SuperAnnotate has managed to hold its own, however, recently raising $36 million in a Series B round led by Socium Ventures, with participation from Nvidia, Databricks Ventures, and Play Time Ventures.

The fresh capital, which brings SuperAnnotate’s total raised to just over $53 million, will be used for augmenting its current team of around 100, for product R&D, and for growing SuperAnnotate’s customer base of roughly 100 companies.

“We aim to build a platform capable of fully adapting to enterprises’ evolving needs and offering extensive customization in data fine-tuning,” Vahan said.

TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday.

Topics

AI, data labeling tools, Enterprise, Fundraising, Socium Ventures, Startups, SuperAnnotate

Kyle Wiggers

Senior Reporter, Enterprise

Kyle Wiggers is a senior reporter at TechCrunch with a special interest in artificial intelligence. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Brooklyn with his partner, a piano educator, and dabbles in piano himself. occasionally — if mostly unsuccessfully.

View Bio

Security

PSA: You shouldn’t upload your medical images to AI chatbots
Zack Whittaker

11 hours ago
Apps

Chroma, backed by Pinterest and Twitter co-founders, sells to AI audio company Bronze
Sarah Perez

11 hours ago
Enterprise

Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group
Maxwell Zeff

12 hours ago

Latest in AI

Security

PSA: You shouldn’t upload your medical images to AI chatbots
Zack Whittaker

11 hours ago
Enterprise

Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group
Maxwell Zeff

12 hours ago
AI

OpenAI brings ChatGPT’s Advanced Voice Mode to the web
Aisha Malik

13 hours ago

Topics

More from TechCrunch

SuperAnnotate helps companies manage their AI datasets

Apple says Mac users targeted in zero-day cyberattacks

PSA: You shouldn’t upload your medical images to AI chatbots

Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group

Oura valued at $5B following deal with medical device firm Dexcom

Windows comes to the Meta Quest

Twenty is building an open source alternative to Salesforce

Related

PSA: You shouldn’t upload your medical images to AI chatbots

Chroma, backed by Pinterest and Twitter co-founders, sells to AI audio company Bronze

Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group

Latest in AI

PSA: You shouldn’t upload your medical images to AI chatbots

Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group

OpenAI brings ChatGPT’s Advanced Voice Mode to the web

Topics

More from TechCrunch

SuperAnnotate helps companies manage their AI datasets

Most Popular

Newsletters

TechCrunch Daily News

TechCrunch AI

TechCrunch Space

Startups Weekly

Related

Latest in AI