ENTRIES TAGGED "Facebook"
Obstacles for big data, big data intelligence, and a privacy plugin puts Google and Facebook settings in the spotlight.
Here are a few stories from the data space that caught my attention this week.
Big obstacles for big data
For the latest issue of Foreign Policy, Uri Friedman put together a summarized history of big data to show “[h]ow we arrived at a term to describe the potential and peril of today’s data deluge.” A couple months ago, MIT’s Alex “Sandy” Pentland took a look at some of that big data potential for Harvard Business Review; this week, he looked at some of the perilous aspects. Pentland writes that to be realistic about big data, it’s important to look not only at its promise, but also its obstacles. He identifies the problem of finding meaningful correlations as one of big data’s biggest obstacles:
“When your volume of data is massive, virtually any problem you tackle will generate a wealth of ‘statistically significant’ answers. Correlations abound with Big Data, but inevitably most of these are not useful connections. For instance, your Big Data set may tell you that on Mondays, people who drive to work rather than take public transportation are more likely to get the flu. Sounds interesting, and traditional research methods show that it’s factually true. Jackpot!
“But why is it true? Is it causal? Is it just an accident? You don’t know. This means, strangely, that the scientific method as we normally use it no longer works, because there are so many possible relationships to consider that many are bound to be ‘statistically significant’. As a consequence, the standard laboratory-based question-and-answering process — the method that we have used to build systems for centuries — begins to fall apart.”
Pentland says that big data is going to push us out of our comfort zone, requiring us to conduct experiments in the real world — outside our familiar laboratories — and change the way we test the causality of connections. He also addresses issues of understanding those correlations enough to put them to use, knowing who owns the data and learning to forge new types of collaborations to use it, and how putting individuals in charge of their own data helps address big data privacy concerns. This piece, together with Pentland’s earlier big data potential post, are this week’s recommended reads.
How Facebook stacks up against other tech IPOs.
This week's visualization comes from The New York Times and compares the last 30 years of tech IPOs (hint: watch for the big blue dot).
Pete Warden walks through the steps behind his latest Facebook visualization.
Creating a visualization requires more than just data and imagery. Pete Warden outlines the process and actions that drove his new Facebook visualization project.
Facebook says we're closer than we thought, Gnip targets finance, and eBay grabs Hunch.
Facebook research questions the "six degrees of separation" rule, Gnip gets into the real-time financial data business, and eBay looks to put Hunch's recommendation engine to use.
Facebook surfaces past status updates, particles defy physics, and data goes Hollywood in "Moneyball."
This week's data news includes news on Facebook's Timeline, observations of neutrinos moving faster than the speed of light, and Hollywood's take on data analysis.
Fears over Facebook's face recognition, hacked passwords get analyzed, and a video game taps historical data.
Stories this week examine some of the fears — real and imagined — around our social data, our privacy and security. But in case we forget the great things social data can build, we also look at the archival data behind the video game “L.A. Noire.”