Tableau: Can you describe a little bit about what you do at the Health Media Collaboratory?
Glen Szczypka, Deputy Director, Health Media Collaboratory: Our mission is data for public good. Over the last ten years, with the advent of the Internet, the advent of social media, we now have screens in front of us at all times.
We get inundated with these data messages. And those messages can lead us to making really bad health choices. So we want to study that data, harness that data, and use it to make people healthier.
Tableau: If I’m new to analyzing social data, what should I keep in mind? Any tips for working with social data in Tableau?
Glen: The first thing you need to realize is social data is dirty data. And just because you use a key word, you can't assume that your question that you're asking is going to be contained in a tweet.
You need to know that the tweet you're looking at is the behavior that you're trying to study. So you really need to clean your social media data before you even put it into Tableau.
With the front end of a tweet, there are about four sources of information that you can get. But at the back end of a tweet, there could be 20 to 25 different types of metadata.
And Tableau is great with tweets. You get longitudinal, latitudinal data. And Tableau works great with that. You can map out where the tweets are into these great cluster circles. It works very well with the metadata variables on the backside of a tweet.
Tableau: What sort of data are you looking at?
Glen: We collect from a variety of social media platforms, Tumblr, Twitter, Facebook, You Tube, and WordPress. Our next platform is Foursquare. And Foursquare is all about geolocation, so we're really excited about working with that. It's a rapidly-changing environment for social data. New platforms are available. Anytime they become available, we try to to collect data there.