Minnesota Pollution Control Agency Clears View of Data
The Minnesota Pollution Control Agency (MPCA) protects Minnesotaâs natural environment through monitoring, clean-up, regulation enforcement, policy development and education. The MPCA strives to meet the highest levels of transparency and accountability; to that end the agency has established a data analysis unit to respond to ad hoc information requests. The team chose Tableau to help them answer questions faster and more completely, and to further adoption of a data-driven culture within the MPCA.
Highly Interesting and Highly Public
Minnesota has robust legislation regarding open records, so all state agencies are held to high standards regarding transparency and governmental accountability.
The MPCA Data Analysis team was established to respond to research questions and data requests from both internal clientsâsuch as other staff members and agency leadershipâand from external clients including policymakers and concerned citizens.
âBy law, we have high standards for data accessibilityâit is a priority to provide the information unless we are required by law to keep it private,â says Leslie Goldsmith, data analysis supervisor at Minnesota Pollution Control Agency (MPCA).
She points out that MPCA data is both highly interesting and highly public, making a focus for both citizens and politicians alike.
âIf you can't provide an answer, people may become concerned that youâre hiding something. Or they worry that you are not competent. That creates unnecessary noise in the discussion,â she explains.
When requests come from politicians, the team must ensure that it is being clear and transparent about the data on which its analysis is based.
âIf you've provided an answer for a particular hearing, for example, you want to be able to reproduce that answer,â Goldsmith explains.
âTypically when a concerned citizen asks a question, they just want an answer. But our policymakers not only want an answer, they want to see the data so they can have their staff verify it.â
Data in a Box
The Data Analysis team is also focused on helping the agency further its efforts to be a data-driven organization.
âWe would run into situations where you'd get a different answer out of a question each time, based on who was answering the question,â says Goldsmith. Part of the teamâs mission was to help establish consistent and truthful answers.
âWe help people differentiate between data that was truly missing and data that existsâbut they just donât know how to get at it,â says Goldsmith. âOr people will bring us their box of data and say, âCan you make something interesting out of this?ââ
That âbox of dataâ typically comprises two different types of data: field data (in some cases, literally so) about environmental conditions and stressors, and organizational data such as performance metrics, counts of works in progress, and data about milestone events.
The data is also stored across a number of data sources, including Oracle, Microsoft SQL Server, Esri Database, Access, and Excel and PostgreSQL. âData is stored everywhere in the agency. It's not as central as one might hope,â says Goldsmith.
Some questions come up routinely, such as the number and monetary value of actions the MPCA has taken against industries.
âWe seem to get that question every month,â she says. âAnd each time the requester asks for it a slightly different way than the way it was asked the month before. If you have to constantly pull the data through an Access query or you need to hit the databaseâit's redundant with previous work and it takes time.â
âIn Excel, You Had to Get out the Big Hammerâ
After accessing the data, the team still needs to put it in a useable format.
âMany people don't like spreadsheetsâthey get lost in all the numbers. We wanted to take all of these different kinds of data and make some decent pictures from them. And if you tried to do that in Excel, you had to get out the big hammer and your format blew up and seven people wanted it 18 different ways!â says Goldsmith with a laugh. âThat makes it really hard to be productive.â
She estimates that the process of making presentable graphs using data from Excel and Access took about two hours for each report. And those two hours would be spent each time the report was refreshedâsome daily, others weekly or monthly.
In 2007, Goldsmith went looking for a better tool and ran across a mention of Tableau.
As a longtime proponent of visual analysis, Goldsmith remembered reading about Tableau back when it was still a research project at Stanford.
Intrigued, Goldsmith arranged for a few trials of Tableau Desktop 4.
âI used it in the trial and was doing something useful with it 20 minutes out of the box,â says Goldsmith. âWe have never looked back.â
The team has been using Tableau for nearly five years, starting with Tableau Desktop 4 and quickly moving to Tableau Server 4 and then upgrading with each new release. The MPCA team is now using Tableau Server 8.
The MPCA has 18 analysts authoring visualizations in Tableau Desktop and publishing them agency-wide through Tableau Server. All 900 agency employees can access and interact with visualizations published to Server. The agency uses Active Directory to control access to dashboards and visualizations.
âWe Tend to Say âYesâ with Tableauâ
Goldsmith believes that Tableau helps the MPCA meet its goals for governmental transparency and accountability.
âWe have 2,600 workbooks created over the last five years, and many of them have been answers to those one-off questions,â says Goldsmith. âWhen someone contacts us and asks us to do something, we try to say âyes.â And we tend to say âyesâ with Tableau.â
For example, her team built an automatically updated workbook around enforcement efforts. This allows the agency to respond rapidly and consistently to questions from the public, the press or politicians.
âThis workbook shows how many enforcement actions we've taken, the value of those actions, and the environmental benefit of those enforcement actions. So it makes it really easy to respond to questions,â she says.
She also appreciates that Tableau enables her to share not just the analysis, but also the data behind it.
âThe packaged workbook is a great way to bundle together the static pictureâthe table and the data behind the visualizationâat a particular point in time, so you do have reproducibility and good data lineage,â she says.
Tableauâs ability to produce speedy analysis plus underlying data also helps to allay fears and suspicions that occur when answers are incomplete or take too long to produce.
We save about a half-hour to an hour for each question. We don't have to spend that time, that tedious horribleness that comes with other tools.
âDive, Play, ExploreâReally Fastâ
The team is using Tableau as an important tool to forward a data-driven culture within the agency.
âA lot of these questions that we can answer with Tableauâotherwise they would be considered impenetrable and unanswerable. âGee, if only we could answer that but c'est la vie; we can't,ââ she says.
Goldsmith considers Tableauâs flexibility in connecting to various data sources to be particularly helpful.
âWhen people brought us that box of data, Tableau is a great tool for that kind of flexibility,â says Goldsmith. âYou can dive into the data and play with it and explore it really fast.â
The team also uses Tableau to help educate agency groups about the importance of quality data.
âWhen you talk to people and you show them a table or spreadsheet, it doesn't always click. But if you visualize itâitâs so much more impactful,â says Goldsmith. âWhen managers see a timeline starting at 1392 because someone made a data entry error, they get it.â
Goldsmith is confident that using Tableau has improved her teamâs productivity. She estimates that it would take two hours simply to pull the data and determine the answerâformatting the response would take another hour or two.
âWe save about a half-hour to an hour for each question,â she says. âWe don't have to spend that time, that tedious horribleness that comes with other tools.â
âA couple of hours in Tableau, and the response is prepped, ready for consumption. And it's connected to the data so I never have to do it again,â she says. âWe can produce stuff that can be used over and over,â she says. âItâs particularly helpful in high-demand areas.â
She continues, âIn terms of ROI, I estimate that it takes about 40 hours of saved labor to start getting actual, measurable return on the cost of a Tableau Desktop license. Anybody who uses Tableau muchâthey are probably going to be in pure return after a couple of months.â
"Without Tableau, we would be working a whole lot harder to accomplish a whole lot less.â