, , , , , ,

Datasaur lets you build a model automatically from a set of labels

Long before people were talking about ChatGPT and generative AI, companies like Datasaur were dealing with the nuts and bolts of building machine learning models, helping label things to train the model. As AI has taken off, this kind of capability has become even more important.

In order to bring model building to more companies without a data science specialist, Datasaur announced the ability to create a model directly from the label data, putting model creation in reach of a much less technical audience. It also announced a $4 million seed extension that closed last December.

Company founder Ivan Lee says the recent surge in AI interest has been great for the company, and actually plays well into the startup’s strategy. “What Datasaur has always strived to be is the best place to gather the training data that you need to feed into your models, whether they are LLMs, or traditional NER models, sentiment analysis or what have you,” Lee told TechCrunch.

“We are just the best interface for these non-technical users to come in and label that data,” he said.

The rise of LLMs is helping raise awareness in general about how AI can help in a business context, but he says that most companies are still very much in the exploratory stage, and they still need products like Datasaur to build models. Lee says one of his goals from the start has been to democratize AI, particularly around natural language processing, and the new model building feature should put AI in reach of more companies, even those without a specialized expertise.

“And this feature is one I’m particularly excited about because it allows teams without data scientists, without engineers to just markup and label this data however they see fit, and it’ll just automatically train a model for them,” Lee said.

Lee sees this as a way to move beyond the initial target market of data scientists. “Now we’re going to open it up so construction companies, law firms, marketing companies, who may not have a data engineering background, but can still build NLP models [based on their training data].”

He says he has been able to limit the amount of venture investment he has taken – the previous seed was a modest $3.9 million in 2020 – because he operates leanly. His engineering team is mostly in Indonesia, and while he expects to hire, he takes pride in operating the company in an efficient manner.

“My philosophy has always been profitability, grow in a scalable manner, never grow at all costs,” Lee said. That means he considers every hire and the impact it will have on the business.

By having a remote, cross-cultural workforce, employees can learn from each other and that brings a diversity to the company by its nature. “There is a significant difference in the workplace culture between the U.S. and how things operate in Indonesia. And so one thing is we’ve had to be intentional about capturing the best of both worlds,” he said. That could mean encouraging Indonesian colleagues to speak up or push back on what a manager is saying, which is something they are loath to do culturally. “We’ve been very proactive about encouraging that,” he said.

But he says there’s a lot U.S. employees can learn about how things operate in Asia, as well, like respect for your colleagues and this culture of putting the team first, and he has had to help the teams navigate these cultural differences.

The $4 million investment was led by Initialized Capital with participation from HNVR, Gold House Ventures and TenOneTen. The company has raised a total of $7.9 million.

https://techcrunch.com/2023/08/03/datasaur-lets-you-build-a-model-automatically-from-a-set-of-labels/


January 2025
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
2728293031  

About Us

Welcome to encircle News! We are a cutting-edge technology news company that is dedicated to bringing you the latest and greatest in everything tech. From automobiles to drones, software to hardware, we’ve got you covered.

At encircle News, we believe that technology is more than just a tool, it’s a way of life. And we’re here to help you stay on top of all the latest trends and developments in this ever-evolving field. We know that technology is constantly changing, and that can be overwhelming, but we’re here to make it easy for you to keep up.

We’re a team of tech enthusiasts who are passionate about everything tech and love to share our knowledge with others. We believe that technology should be accessible to everyone, and we’re here to make sure it is. Our mission is to provide you with fun, engaging, and informative content that helps you to understand and embrace the latest technologies.

From the newest cars on the road to the latest drones taking to the skies, we’ve got you covered. We also dive deep into the world of software and hardware, bringing you the latest updates on everything from operating systems to processors.

So whether you’re a tech enthusiast, a business professional, or just someone who wants to stay up-to-date on the latest advancements in technology, encircle News is the place for you. Join us on this exciting journey and be a part of shaping the future.

Podcasts

TWiT 1013: Calamari in Crisis – Touching the Sun, Fake Spotify Artists, Banished Words This Week in Tech (Audio)

Touching the Sun, Fake Spotify Artists, Banished Words AI Needs So Much Power, It's Making Yours Worse How many billions Big Tech spent on AI data centers in 2024 NASA Spacecraft 'Touches Sun' In Defining Moment For Humankind Elon Musk Calls Out NASA's Moon Ambitions: 'We're Going Straight to Mars' Elon Musk and the right's war on Wikipedia Trump Asks Supreme Court to Pause Law Threatening TikTok Ban US Treasury says Chinese hackers stole documents in 'major incident' Judge blocks parts of California bid to protect kids from social media Finland probes Russian shadow fleet oil tanker after cable-cutting incident US appeals court blocks Biden administration effort to restore net-neutrality rules The Ghosts in the Machine (fake spotify artists) Massive VW Data Leak Exposed 800,000 EV Owners' Movements, From Homes To Brothels Banished Words | Lake Superior State University 2025 Public Domain Day 2025 Happy Birthday, Bitcoin! The top cryptocurrency is old enough to drive End of the lines? QR-style codes could replace barcodes 'within two years' Host: Leo Laporte Guests: Richard Campbell, Anthony Ha, and Stacey Higginbotham Download or subscribe to This Week in Tech at https://twit.tv/shows/this-week-in-tech Get episodes ad-free with Club TWiT at https://twit.tv/clubtwit Sponsors: ZipRecruiter.com/Twit joindeleteme.com/twit promo code TWIT canary.tools/twit – use code: TWIT zscaler.com/security
  1. TWiT 1013: Calamari in Crisis – Touching the Sun, Fake Spotify Artists, Banished Words
  2. TWiT 1012: Our Best Of 2024 – The Best Moments From TWiT's 2024
  3. TWiT 1011: The Year in Review – A Look at the Top Stories of 2024
  4. TWiT 1010: The Densest State in the US – TikTok Ban, Drones Over Jersey, GM Quits Robotaxis
  5. TWiT 1009: Andy Giveth & Bill Taketh Away – Trump's Tech Titans, Crypto Boom, TikTok's US Ban, Intel CEO Exits