Data are becoming the new raw material of business
The Economist

Data Science in 30 Minutes: Why Big Data Needs Thick Data with Tricia Wang

This FREE webinar will be on June 26th at 5:30 PM ET. Register below now, space is limited!

Join The Data Incubator and co-founder, Tricia Wang, June 26th at 5:30 PM, LIVE online, for the next installment of our free online webinar series, Data Science in 30 minutes: Why Big Data Needs Thick Data.

Why do so many companies make bad decisions, even with access to unprecedented amounts of data? Tricia has the answer: companies are implementing “big data” without what she calls the secret, missing ingredient, “thick data” – precious, unquantifiable insights from actual people – to make the right business decisions and thrive in the unknown. She’ll share stories and lessons from how her company, Sudden Compass, advises and teaches organizations to unlock insights from big data and turn their big data projects from optimizing the bottom-line to driving growth.
Continue reading

Data Science in 30 Minutes: Building Data Science Capabilities That Scale

This FREE webinar took place on May 17th, 2018. Sign up below for the full video! CSO, William Merchan joined The Data Incubator for the May installment of our free online webinar series, Data Science in 30 minutes: Building Data Science Capabilities That Scale.

Data scientists and machine learning engineers saw the highest job growth of any role last year, yet few companies have successfully turned their aggressive hiring into profitable, scalable data science capabilities. In this session, CSO, William Merchan, shares lessons learned from building a platform that supports collaborative data science for a variety of clients, from startups to Fortune 500 companies. Learn about the technology gaps, roadblocks to innovation and efficiency, and talent retention challenges that have proven to be detrimental to data science success in an enterprise environment — and how to mitigate them.
Continue reading

Data Science in 30 Minutes: Alan Schwarz, Former NYTimes Journalist, on Numbers-Based Journalism

Alan Schwarz, former NY Times journalist joined The Data Incubator for the February 2018 installment of our free online webinar series, Data Science in 30 minutes: Numbers-Based Journalism.

Sign up below to get access to the video of this webinar for free!

Alan Schwarz, former N.Y. Times investigative reporter and Pulitzer finalist, discussed numbers-based journalism that shook industries from the National Football League to Big Pharma. Alan used data analysis to expose the NFL’s cover-up of concussions as well as issues in child psychiatry.
Continue reading

Data Science in 30 Minutes: Kirk Borne – A Fortuitous Career in Data Science

Booz Allen Hamilton’s Kirk Borne joined The Data Incubator in August for our FREE monthly webinar series, Data Science in 30 minutes!

Kirk Borne took us on a journey through his career in science and technology, explaining how the industry – and himself – have evolved over the last 4 decades. Starting with skipping lunches in high school to a systematic twitter obsession, Kirk shed light on his road to success in the data science industry.
Continue reading

Data Science in 30 Minutes: Scikit-Learn with Core-Contributor Andreas Mueller

scikit-learn‘s Andreas Mueller joined The Data Incubator in December 2017 for our FREE monthly webinar series, Data Science in 30 Minutes!

We talked about everything new in 0.19, that got released in July of this year, and what the plans are for 0.20 that will be released early next year. Highlights are the multiple metric grid-search, faster T-SNE and better handling of categorical and mixed data.
Continue reading

Data Science in 30 Minutes: A Conversation with Gregory Piatetsky-Shapiro, President of KDnuggets

KDnuggets’ Gregory Piatetsky-Shapiro, Ph.D  joined The Data Incubator in January for the first 2018 installment of our free online webinar series, Data Science in 30 minutes! Gregory discussed his career – from Data Mining to Data Science and examine current trends in the field.

From Data Mining to Knowledge Discovery to Data Science: Gregory Piatetsky talked about his pioneering career in data science, including founding KDnuggets, and co-founding KDD Conferences and ACM SIGKDD, and examined current trends in the field, Data Science Automation, citizen Data Scientists, and implications of AI.
Continue reading

Data Science in 30 Minutes: Infrastructure for Usable Machine Learning with Spark Creator and Stanford Professor, Matei Zaharia

Databricks co-founder, Matei Zaharia, Ph.D joined The Data Incubator for the April 2018 installment of our FREE monthly webinar series, Data Science in 30 minutes: Infrastructure for Usable Machine Learning.

Despite incredible recent advances in machine learning, building machine learning applications remains prohibitively time-consuming and expensive for all but the best-trained, best-funded engineering teams. This expense usually comes not from a need for new and improved statistical models but instead from a lack of systems and tools for supporting end-to-end machine learning application development, from data preparation and labeling to productionization and monitoring. In the Stanford DAWN project, we are developing a set of tools to make these processes easier, from weak supervision approaches to dramatically reduce the need for labeled data, to query-specific model specialization to reduce serving cost, and end-to-end ML systems that encapsulate a complete task and greatly simplify the interface to the user.

Sign up to receive the video of this episode of Data Science in 30 Minutes: Infrastructure for Usable Machine Learning with Matei Zaharia

About the speakers:

MateiZaharia, PhD is an assistant professor of computer science at Stanford and Chief Technologist and co-founder of Databricks. His research interests broadly span data-intensive systems, including distributed computing and systems for machine learning. In his past research, Matei developed widely used open source software including the Apache Spark computing engine, Apache Mesos cluster manager, and Alluxio storage system. His research was recognized through the 2014 ACM Doctoral Dissertation Award and VMware Systems Research Award.
See also or Matei’s Wikipedia page


Michael Li founded The Data Incubator, a New York-based training program that turns talented PhDs from academia into workplace-ready data scientists and quants. The program is free to Fellows, employers engage with the Incubator as hiring partners.

Previously, he worked as a data scientist (Foursquare), Wall Street quant (D.E. Shaw, J.P. Morgan), and a rocket scientist (NASA). He completed his PhD at Princeton as a Hertz fellow and read Part III Maths at Cambridge as a Marshall Scholar. At Foursquare, Michael discovered that his favorite part of the job was teaching and mentoring smart people about data science. He decided to build a startup to focus on what he really loves.

Michael lives in New York, where he enjoys the Opera, rock climbing, and attending geeky data science events.