CloudFactory

Data Prep: What Data Scientists Wish You Knew

Webinar + Discussion

Optimize ML Data Prep with These Tips

If you’re considering a machine learning project, you probably know that you need data, and lots of it. And while many companies are swimming in volumes of data, that data is almost never ready for AI and ML projects. It must be prepared, which can include cleansing, annotation, and more.

CloudFactory’s VP of Client Success Paul Christianson and Infinia ML data scientist Ben Schneller discuss what data scientists wish you knew about preparing your data for AI projects.

Their conversation covered topics such as:

  • How much time your AI project should allocate to data prep and annotation
  • The importance of having a “data readiness” strategy
  • Operationalizing data prep and annotation to produce high quality training data at scale

WATCH THE WEBINARTell us about yourself

Paul Christianson

Paul ChristiansonPRESENTER

Paul Christianson helps CloudFactory clients dominate their markets by gaining a competitive edge in how they capture data to create amazing user experiences. Prior to CloudFactory, Paul worked on large-scale client software implementations at IBM. Paul is a graduate of the University of North Carolina at Chapel Hill.

Ben Schneller

Ben SchnellerPresenter

Ben Schneller is a data scientist at Infinia ML, which builds machine learning-powered applications that help businesses analyze their documents, manage their talent, and audit their AI systems. Ben holds a BS in Bioengineering and a MEng in Bioinformatics, both from the University of Illinois at Chicago.

About

CloudFactory

For over a decade, CloudFactory has powered quality data at scale. Its managed workforce processes pipelines of big data with high accuracy on virtually any platform, with the expertise and communication of a trained internal team. As a global leader in impact sourcing, CloudFactory creates economic and leadership opportunities for talented people in developing nations.

About

Infinia ML

Through proprietary technology and a world-class team, Infinia ML builds machine learning-powered applications that help businesses analyze their documents, manage their talent, and audit their AI systems. The company’s Chief Scientist, Lawrence Carin, Ph.D., is one of the world's most published machine learning experts. Together, the team has 31 patents, 11 books, 9 Ph.D.s and more than 600 published papers. Learn more at InfiniaML.com.