Boston Digs Big Data

Patrick speaks, and Rus (CSAIL Director), Hockfield (MIT President), Rattner (Intel CTO), and Madden (bigdata@CSAIL Director) listenning

As first reported by Xconomy’s Greg Huang, three Boston-based big data initiatives were announced today at MIT: 1. Intel’s newest Intel Science and Technology Center (ISTC) for Big Data will be hosted by MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL).  Continue reading

Posted in Big Data Analytics, Boston, Intel, MIT, Research | 2 Comments

Machines vs. Models, Noise vs. Signal

An excerpt from Nassim Taleb’s forthcoming book, Antifragile, was posted yesterday on the Farnam Street blog. In “Noise and Signal,” Taleb says that “In business and economic decision-making, data causes severe side effects —data is now plentiful thanks to connectivity; and the share of spuriousness in the data increases as one gets more immersed into it. A not well discussed property of data: it is toxic in large quantities—even in moderate quantities…. the best way… to mitigate interventionism is to ration the supply of information, as naturalistically as possible. This is hard to accept in the age of the internet. It has been very hard for me to explain that the more data you get, the less you know what’s going on, and the more iatrogenics you will cause.”   Continue reading

Posted in AI, Asking questions, Models, Machine Learning, Big Data Analytics | Leave a comment

3 Big Data Startups: Paradigm4, DataSift, Mortar Data

Paradigm4

What does it do?  Founded in 2010, Paradigm4 is a massively scalable advanced analytics platform with an innovative non-relational database under the hood. It aims to  provide faster analytics, across larger data sets, with seamlessly integrated data management.

Management team Mike Stonebraker, CTO; Marilyn Matz, CEO; Paul Brown, Chief Architect     Continue reading

Posted in DataSift, Mortar Data, Paradigm4, startups | Leave a comment

Big Data Quotes of the Week

“It’s great to see [data science] maturing, and this new focus will lead to data applications which are not just more powerful, but more reliable and more impactful as well. Data Science has come of Statistical age.”–David Smith, Revolution Analytics Continue reading

Posted in Big Data Analytics, GigaSpaces, Quotes, Zebit | 1 Comment

Facebook’s IPO and the Laws of Big Data

Without using any predictive analytics tools, I confidently predict that Facebook’s IPO will give rise to more vocal demands for people to “get a cut” of its—and other social media companies’—profits. People deserve, so the argument goes, a share of any profits derived from mining the social data pool which they have so willingly helped create. Occupy Facebook, anyone?

But before you set up a tent in Menlo Park, consider this proposition: The value of personal data is zero. Personal data is not worth much if it’s kept personal and a sample of one is good for answering a very limited set of questions. Personal data gains value when it is shared, when it is combined with and compared to other data.  Continue reading

Posted in Big Data Analytics, Big Data Laws, Data Discovery, Data Science | Leave a comment

Profiles in Data Science: Jake Porway

Current positions: Founder and Executive Director, DataKind; Data Scientist, The New York Times

Bio: Porway spends his days as the data scientist in the New York Times R&D lab and his nights working on DataKind (formerly Data without Borders) which seeks to match non-profits in need of data analysis with freelance and pro bono data scientists  He founded DataKind in the hopes of creating a world in which every social organization has access to data capacity to better serve humanity.  Jake holds a B.S. in Computer Science from Columbia University and his M.S. and Ph.D. in Statistics from UCLA.

Quotable quote: “Data is the thrumming, electrical beat that is starting to drive everything.”

Papers/presentations/posts/interviews: Presentation at Pop!Tech; National Geographic–Emerging ExplorerData Scientists: The New Rock Stars of the Tech World

Posted in Data Scientists, DataKind | Leave a comment

Big Data Analytics in the New IBM CEO Study

The new Global IBM CEO Study found that almost one-quarter of CEOs say their organizations operate below par in terms of driving value from data.

A consumer products CEO from North America summarized the feeling of frustration brought about by the explosion in the volume data: “We have lots of data, but only 10 percent of it is useful information. And even within that 10 percent, we are not using it effectively. Impactful analytics is not in our genes.”   Continue reading

Posted in Big Data Analytics, Business Impact, CEOs | Leave a comment