Data are becoming the new raw material of business
The Economist

Data Science in 30 Minutes: Data Privacy and Big Data Ethics with @data_nerd, Carla Gentry

This FREE webinar will take place LIVE online on March 21st at 5:30PM ET. Register below now, space is limited!

Join The Data Incubator and Carla Gentry, data science expert and influencer, for the next installment of our free online webinar series, Data Science in 30 Minutes: Data Privacy and Big Data Ethics.

Ethics and transparency have to go hand in hand with data scientist and business – remember before you make promises you can’t keep, machine learning, AL, NLP, etc… all require good data, communication within the team creating or designing, system compatibility, solid logical programming and MATH… It’s not just a cool buzzword and something to add to your resume or website to be deemed relevant. Bias, whether implied or intentional, affects lives, knowledge of data is important now more than ever.

 

Continue reading


Data Science in 30 Minutes: Infonomics, The New Economics of Information with Gartner’s Doug Laney

This FREE webinar will take place LIVE online on February 20th at 5:30PM ET. Register below now, space is limited!

Join The Data Incubator and Doug Laney, Senior Analyst and Advisor with Gartner‘s Chief Data Officer research group for the next installment of our free online webinar series, Data Science in 30 Minutes: Infonomics, The New Economics of Information.

Doug will share an overview of his research on information value and highlights from his new book, “Infonomics: How to Monetize, Manage, and Measure Information for Competitive Advantage.” We will explore the origins of this concept, along with why and how organizations should treat information as an actual corporate asset. We will also discuss the specifics of how data and analytics leaders such as chief data officers (CDOs), chief data scientists, enterprise architects, CIOs, and even CFOs can understand and take advantage of information’s unique economic properties to help transform their organizations. Doug will share his methods for applying asset management best-practices to information, and how to monetize information, including real-world examples of how companies and government agencies have monetized their (and others’!) information. And we will conclude with Gartner’s information valuation models and how some organizations have identified and generated millions of dollars of value by applying them.
Continue reading


Data Science in 30 Minutes: Examining Machine Learning Trends with Cloudera Research Engineer, Shioulin Sam

This FREE webinar will take place LIVE online on January 23rd at 5:30PM ET. Register below now, space is limited!

Join The Data Incubator and Shioulin Sam, Research Scientist at Cloudera Fast Forward Labs for the next installment of our free online webinar series, Data Science in 30 Minutes: Examining Machine Learning Trends

We will explore the latest and greatest in machine learning, including (but not limited to) semantic recommendations and multi-task learning. In regard to semantic recommendations, we will discuss how multi-modal embeddings – an emerging technique from deep learning – enable us to build a better system that actually understands content. We will also look at how multi-task learning – an approach in which models are trained to learn related tasks in parallel – is central to the notion of Software 2.0, and helps computers learn more the way we do. We will showcase both capabilities with a live demo of our prototypes.
Continue reading


Data Science in 30 Minutes: Uber’s Chief Scientist Explores Frontiers of Machine Learning and AI

This FREE webinar will take place LIVE online on December 19th at 5:30PM ET. Register below now, space is limited!

Join The Data Incubator and Zoubin Ghahramani, Chief Scientist for Uber, for the December 2018 installment of our free monthly webinar series, Data Science in 30 minutes: Uber’s Chief Scientist Explores Frontiers of Machine Learning and AI.

Zoubin will review fundamental concepts and recent advances in artificial intelligence. He will then highlight some areas of research at the frontiers, touching on topics such as deep learning, probabilistic programming, Bayesian optimisation, and AI for data science. Finally, he will describe how these areas fit into Uber’s mission.
Continue reading


Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark


IBM‘s Holden Karau joined  The Data Incubator in June 2017 and for our free online webinar series, Data Science in 30 minutes – Sign up below for the full video!

Holden Karau presented a super fast introduction to PySpark – how to use Python and Spark together when you exceed the limitations of a single machine. Apache Spark is a fast and general engine for distributed computing & big data processing with APIs in Scala, Java, Python, and R. This tutorial will briefly introduce PySpark (the Python API for Spark) with some hands-on-exercises combined with a quick introduction to Spark’s core concepts. We will cover the obligatory wordcount example which comes in with every big-data tutorial, as well as discuss Spark’s unique methods for handling node failure and other relevant internals.

Continue reading


Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

This FREE webinar will take place LIVE online on August 21st at 5:30PM ET. Register below now, space is limited!


Join The Data Incubator and Mike Tamir, Head of Data Science for Uber Advanced Technologies Group, for the August 2018 installment of our free monthly webinar series, Data Science in 30 minutes: Deep Learning to Detect Fake News.

Mike will discuss how he created FakerFact.org, an Artificial Intelligence tool that enables readers to detect when an article is focused on credible information sharing vs. when the focus is on manipulation. We will explore real world use case applications for automated “Fake News” evaluation using contemporary deep learning article vectorization and tagging. We begin with the use case and an evaluation of the appropriate context applications for various deep learning applications in fake news evaluation. We will discuss several methodologies for article vectorization with classification pipelines, ranging from traditional to advanced neural network deep architecture techniques. We close with a discussion on troubleshooting and performance optimization when consolidating and evaluating these various techniques on active data sets.
Continue reading


Data Science in 30 Minutes: The Accidental Data Scientist with Katrina Riehl, Director of Data Science for HomeAway.com

This FREE webinar will take place LIVE online on July 24th at 5:30PM ET. Register below now, space is limited!


Join The Data Incubator and Katrina Riehl, Director of Data Science for HomeAway.com, for the July 2018 installment of our free monthly webinar series, Data Science in 30 minutes: The Accidental Data Scientist.

Katrina will detail the journey her career has taken from researcher and software developer to Data Scientist. She will explain how her technology roles and skills have evolved as this new discipline emerged over the last decade. First, starting out as a young Python and Artificial Intelligence enthusiast and eventually after many years, finally embracing Data Science as a discipline, and leading a strong and diverse Data Science team.
Continue reading


Data Science in 30 Minutes: Why Big Data Needs Thick Data with Tricia Wang


This FREE webinar took place on June 26th, 2018. Sign up below for the free video!

Tricia Wang, co-founder of SuddenCompass joined The Data Incubator for the June 2018 episode of our free online webinar series, Data Science in 30 minutes: Why Big Data Needs Thick Data.

Why do so many companies make bad decisions, even with access to unprecedented amounts of data? Tricia has the answer: companies are implementing “big data” without what she calls the secret, missing ingredient, “thick data” – precious, unquantifiable insights from actual people – to make the right business decisions and thrive in the unknown. Tricia shared stories and lessons from how her company, Sudden Compass, advises and teaches organizations to unlock insights from big data and turn their big data projects from optimizing the bottom-line to driving growth.
Continue reading


Data Science in 30 Minutes: Building Data Science Capabilities That Scale


This FREE webinar took place on May 17th, 2018. Sign up below for the full video!

DataScience.com CSO, William Merchan joined The Data Incubator for the May installment of our free online webinar series, Data Science in 30 minutes: Building Data Science Capabilities That Scale.

Data scientists and machine learning engineers saw the highest job growth of any role last year, yet few companies have successfully turned their aggressive hiring into profitable, scalable data science capabilities. In this session, DataScience.com CSO, William Merchan, shares lessons learned from building a platform that supports collaborative data science for a variety of clients, from startups to Fortune 500 companies. Learn about the technology gaps, roadblocks to innovation and efficiency, and talent retention challenges that have proven to be detrimental to data science success in an enterprise environment — and how to mitigate them.
Continue reading


Data Science in 30 Minutes: Alan Schwarz, Former NYTimes Journalist, on Numbers-Based Journalism

Alan Schwarz, former NY Times journalist joined The Data Incubator for the February 2018 installment of our free online webinar series, Data Science in 30 minutes: Numbers-Based Journalism.

Sign up below to get access to the video of this webinar for free!

Alan Schwarz, former N.Y. Times investigative reporter and Pulitzer finalist, discussed numbers-based journalism that shook industries from the National Football League to Big Pharma. Alan used data analysis to expose the NFL’s cover-up of concussions as well as issues in child psychiatry.
Continue reading