WAMU 88.5 : The Diane Rehm Show

The New World of Massive Data Mining

Every time you go on the Internet, make a phone call, send an email, pass a traffic camera or pay a bill, you create data, electronic information. In all, 2.5 quintillion bytes of data are created each day. This massive pile of information from all sources is called “Big Data.” It gets stored somewhere, and everyday the pile gets bigger. Government and industry are finding new ways to analyze it. Last week the administration announced an initiative to aid the development of Big Data computing. A panel of experts join guest host Tom Gjelten to discuss the opportunities -- for business, science, medicine, education, and security … but also the privacy concerns.

Program Highlights

The term big data refers to the massive amounts of digital information companies and governments collect about us and our surroundings every day, pictures, records, temperatures, conversations. Our guests discuss how government and private industry are using big data and the main concerns surrounding its collection and utility.

What Is "Big Data?"

Villasenor said that big data is "really big." The amount of data that's estimated to have been created or replicated would fill 11 billion iPod classics, each holding about 160 gigabytes. "Remember that the world population is only 7 billion so that's a truly incomprehensible amount of data," Villasenor said.

Practical Uses

Every organization, whether it's government or private sector, uses information in different ways, said Leiter. In the world of terrorism, data that was collected clandestinely could be cross-checked with information that was available publicly to try to identify people who were doing suspicious things. In the private sector, organizations like banks use data routinely to identify cyber fraud and organized crime activity. "There's almost no application, either in government or the private sector, that can't benefit from some of this big data," Leiter said.

Privacy An "Enormous" Concern

Privacy is an enormous concern, but big data isn't necessarily always directly correlated with privacy, Villasenor said. For instance, the total amount of data needed to represent all the websites an average person visits in one year is not that big - about one or two megabytes. But a lot of people would consider that information very private, Villasenor said. "That said, of course, the more data that's out there, then the more opportunity there is that it could potentially be used in ways that were detrimental to privacy," he said.

You can read the full transcript here.

NPR

Book News: Stephen King's New Bogeyman? Digital Publishing

Also: the legacy of Kierkegaard; the creator of Lyle Crocodile has died; Aussie airliner Qantas commissions flight-length books.
NPR

Washington State Butcher Spikes Pig Feed With Weed

Despite its name, the "pot pig" experiment isn't an attempt to develop a new meaty treat for stoners. Instead, a Seattle butcher is feeding marijuana seeds, stems and root bulbs to swine as a cheeky money-saving measure.
NPR

Fox News Reporter James Rosen Caught Up In Federal Probe

There is word of another controversial leak investigation by the Department of Justice. The target is Fox News reporter James Rosen, who was monitored by the department after breaking a story about North Korea's nuclear weapons program in 2009.
NPR

Tumblr Users Urge New Owner Yahoo To Keep The Site Weird

When news of Yahoo's purchase of Tumblr first hit, Tumblr users took their reactions online. The posts were quirky and sharp with plenty of worry about the future.

Leave a Comment

Help keep the conversation civil. Please refer to our Terms of Use and Code of Conduct before posting your comments.