5.5 Million Rows of Baby Name Data
Now that Tableau Public supports datasets up to 10 million rows, a few of us on the Tableau Public team thought we'd try our hands at some fantastically large datasets. To kick it off, I used the full Social Security Administration baby names dataset. This dataset contains every name used to apply for a social security card between 1910 and 2013. (One caveat: to protect privacy, a name needed to have more than 5 occurrences in a given state and year to count.) This is a monster dataset at over 5.5 million rows of data.

That's a lot of babies...and a lot of diapers.
I created this Tableau Story, first highlighting some things I found interesting and then letting you input your own name and get a history of it. I compared the popularity of all the Team Public names (not a lot of love for Dashiell it seems!), comparing the different versions of Mary, and my favorite, comparing all the one name pop divas. Who knew Rihanna was so influential to recent mothers?
Checking out my own name, it appears that another one name musician had an influence on moms and dads in the late 90's.

Who will sa-a-ave your soul... if you keep asking me if I was named after the singer?
Clicking on the treemaps will filter the map so you can see what names are more popular where. In the Mary, Marie, Maria point, Maria is much more prominent in California and Texas, places with more of a Hispanic population. Our own German Tableau Publican, Florian, doesn't have a very common name in the US, but it is more common in the upper midwest, where there is more of a German population.

With over 5.5 million rows of data, I've just barely scratched the surface of the kinds of stories you can tell with this dataset. Feel free to download the workbook and see what you can discover! If you find anything fun, be sure to tweet it to me @jeweloree.
Photo credit Donnie Ray Jones via Flickr
相关故事
订阅我们的博客
Tableau 的人员每天都在搜寻关于数据、分析和可视化的精彩新闻。 我们的使命是帮助人们查看并理解数据,其中重要的一环就是通过我们的博客分享这些新闻。 从关于如何更高效使用 Tableau 的技巧,到了解人们日常如何处理数据挑战, Tableau 汇集了众多的数据爱好者。