Data are becoming the new raw material of business
The Economist


Turning Bold Questions into a Data Science Career at Amazon: Alumni Spotlight on David Wallace

At The Data Incubator we run a free eight-week data science fellowship to help our Fellows land industry jobs. We love Fellows with diverse academic backgrounds that go beyond what companies traditionally think of when hiring data scientists. David was a Fellow in our Winter 2016 cohort who landed a job with one of our hiring partners, Amazon.

Tell us about your background. How did it set you up to be a great data scientist? 

Before joining The Data Incubator, I completed my Ph.D. in chemistry at Johns Hopkins University, where I focused on the design and synthesis of new magnetic materials. My work gave me the opportunity to work alongside scientists in many different disciplines, and exposed me to a vast array of experimental techniques and theoretical constructs. From a data science perspective, this meant that I was constantly encountering new types of data and searching for scientifically rigorous models to explain those results. As the volume and complexity of these datasets increased, graphical data analysis tools like Excel and Origin weren’t making the cut for me, and I gradually made the transition to performing data transformation and analysis entirely in Python. That was a big technical leap that took a lot of time and frustration, but I think it ultimately made me a better researcher.

From a research perspective, working in a vibrant academic setting also meant learning how to ask bold questions, even at the risk of sounding stupid in front of a large group of mentors and peers–something I’ve done more than I care to admit. For me, finding the right question to ask is just as important as having the technical expertise to find an answer, and that’s one of the things that makes Data Science so exciting.

What do you think you got out of The Data Incubator?

TDI provided the opportunity to work with an incredibly intelligent and motivated group of people on difficult problems that were directly relevant to Data Science. In the DC office we were constantly troubleshooting problems together, trying new ideas, and helping each other to improve as Data Scientists. This collaborative atmosphere, coupled with a very strong curriculum and knowledgeable mentors, really helped me to take my programming and machine learning capabilities to the next level. Aside from the technical aspects, TDI was just a fun experience, and I made some great friends with whom I’ll stay in touch throughout my career. Shouts out to Team Werewolf and Team Chupacabra!

Could you tell us about your Data Incubator project?

When I moved across the country to start grad school five years ago, I had no idea where to live, and no idea how to search for a place besides reading blogs and searching Craigslist. It was a frustrating and scary problem, to say the least. To solve that problem, I created a web application that leverages nine different geospatial datasets, containing information such as crime rates and grocery store locations, to help a user target their search for a new home in Baltimore city. The primary functionality makes use of a statistical method called Gaussian kernel density estimation in order to compute recommended hotspots for each user and then display those hot spots on a map. I built the application using Python, Flask, Cartodb and Bootstrap, all of which are topics covered in TDI’s curriculum. Check it out at: http://stomping-grounds.herokuapp.com

What advice would you give to someone who is applying for The Data Incubator, particularly someone with your background?

Start learning Python! Codecademy and Google have great tutorials for new programmers. Once you’ve got the basics down, start applying it to your research, bit by bit–Stack Exchange will be your best friend. Get used to diving into new packages and deciphering their documentation, because you’re going to be doing it a lot. Most importantly, don’t get discouraged by how much there is to learn.

What’s your favorite thing you learned while at The Data Incubator? This can be a technology, concept, or whatever you want!

I had never been exposed to graph analysis before, so when we used NetworkX to build a social network in the first week at TDI, I was completely blown away. I’ve ended up using NetworkX in interviews and personal projects several times since that first week because it provides a really intuitive and efficient way to deal with complex networks.

Where are you going to be working? And tell us a little about your new job!

I’m joining Amazon as a Data Scientist on their Consumer HR team. I’ll be working alongside psychologists, engineers and HR professionals to ask and answer tough questions about Amazon’s dynamic and growing workforce. This is a great opportunity for me to use the technical skills I’ve developed at JHU and TDI while learning about a completely new field. I’m psyched to get started!

Tweet about this on TwitterShare on FacebookShare on LinkedIn

Back to index