1344

Of your peers have already listened to this podcast

25:30 Minutes

The most insightful time you'll spend today!

Podcast

Cloud Dataprep: The Easiest Way to Cleanse Data

Anyone who has worked with data has felt the pain of data preparation. It’s a struggle—if you are working with the wrong tools—to massage data or cleanse data of anomalies, outliers, and just plain old dirty data.

Eric Anderson, a Product Manager at Google working on Cloud Dataprep and a Harvard Business School alumni, understands how time-consuming it is for data scientists to clean up and ready large data sets for big data and machine learning initiatives. He has first-hand experience of how, in the real world, one of the bumps that slows down big data or machine learning projects is bad data.

He shares how Google Dataprep helps summarize, transform, visualize and clean up data. It focuses on that initial step, for any kind of data work, which is to get your data in position, in the right structure, joined with proper data sets so that it can be analysed.

If, for example, you have address data that is in a single string and you want to parse out the states into a new column, Dataprep could you help do that easily. It’ll even look at your data and alert you that you have union territories—not states—and it’ll ask you what you want to do.

Find out more about Google Dataprep. Listen in now.

More Relevant Stories for Your Company

How-to

How Can an IT Manager Get into the Machine Learning Game?

It’s a rare technology forecast that doesn’t include machine learning and AI. But for IT managers these technologies can seem like a world away. This can be worrying for many IT managers who feel like they are being left out of the future. It needn’t be this way. One of

Case Study

Google Data Studio Makes Reporting a Breeze For Genesys

As a customer experience platform, Genesys helps clients nurture great relationships with customers, and creates seamless user journeys across all channels and devices. Its technologies are used by more than 10,000 companies in over 100 countries, making Genesys the top provider of its kind for 25 years in a row.

Case Study

AgroStar: Small farms in India getting big help from the cloud

AgroStar has launched a cloud-based mobile app that is helping to boost crop yields and encourage best practices for small farmers in India. Launched as an on-premises ecommerce platform selling farm tools in 2008, the firm turned to Google Cloud Platform (GCP) to expand its offering. It now uses cloud-based analytics and is

How-to

How Modern is Your Data Warehouse? Find Out With This Test

As more and more businesses turn to advanced data analytics to help them make smarter decisions, run real-time analytics, and improve business operations, an increasing number are modernizing their data warehouses to make it all possible. For many businesses, knowing how to modernize means understanding where their data warehouse sits

SHOW MORE STORIES