The New World of Massive Data Mining | WAMU 88.5 - American University Radio

WAMU 88.5 : The Diane Rehm Show

The New World of Massive Data Mining

Every time you go on the Internet, make a phone call, send an email, pass a traffic camera or pay a bill, you create data, electronic information. In all, 2.5 quintillion bytes of data are created each day. This massive pile of information from all sources is called “Big Data.” It gets stored somewhere, and everyday the pile gets bigger. Government and industry are finding new ways to analyze it. Last week the administration announced an initiative to aid the development of Big Data computing. A panel of experts join guest host Tom Gjelten to discuss the opportunities -- for business, science, medicine, education, and security … but also the privacy concerns.

Program Highlights

The term big data refers to the massive amounts of digital information companies and governments collect about us and our surroundings every day, pictures, records, temperatures, conversations. Our guests discuss how government and private industry are using big data and the main concerns surrounding its collection and utility.

What Is "Big Data?"

Villasenor said that big data is "really big." The amount of data that's estimated to have been created or replicated would fill 11 billion iPod classics, each holding about 160 gigabytes. "Remember that the world population is only 7 billion so that's a truly incomprehensible amount of data," Villasenor said.

Practical Uses

Every organization, whether it's government or private sector, uses information in different ways, said Leiter. In the world of terrorism, data that was collected clandestinely could be cross-checked with information that was available publicly to try to identify people who were doing suspicious things. In the private sector, organizations like banks use data routinely to identify cyber fraud and organized crime activity. "There's almost no application, either in government or the private sector, that can't benefit from some of this big data," Leiter said.

Privacy An "Enormous" Concern

Privacy is an enormous concern, but big data isn't necessarily always directly correlated with privacy, Villasenor said. For instance, the total amount of data needed to represent all the websites an average person visits in one year is not that big - about one or two megabytes. But a lot of people would consider that information very private, Villasenor said. "That said, of course, the more data that's out there, then the more opportunity there is that it could potentially be used in ways that were detrimental to privacy," he said.

You can read the full transcript here.

NPR

In Tom Hanks' iPad App, Typewriters Make Triumphant Return (Ding!)

For iPad users who are nostalgic for the clickety-clack of keystrokes and "ding!" of the carriage return, Hanks has created Hanx Writer, an app that simulates using a typewriter.
NPR

New U.S. Rules Protect Giant Bluefin Tuna

To reduce the number of giant bluefin tuna killed by fishing fleets, the U.S. is putting out new rules about commercial fishing in the Gulf of Mexico and parts of the western Atlantic.
WAMU 88.5

Jury Deliberating 14 Counts In McDonnell Trial

As of Tuesday afternoon, there is no word yet from the seven men and five women who are deliberating 14 separate counts against the McDonnells.
NPR

The Troubling Implications Of The Celebrity Photo Leak

To learn more about the recent celebrity photo hack, Melissa Block speaks with Matthew Green of Johns Hopkins University. They discuss how the photos might have been obtained.

Leave a Comment

Help keep the conversation civil. Please refer to our Terms of Use and Code of Conduct before posting your comments.