<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AllThingsD &#187; big data</title>
	<atom:link href="http://allthingsd.com/tag/big-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://allthingsd.com</link>
	<description></description>
	<lastBuildDate>Thu, 23 May 2013 14:40:22 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com"/><image>
		  <url>http://allthingsd.com/theme/images/logo-rss.jpg</url>
		  <title>All Things Digital</title>
		  <link>http://allthingsd.com/</link>
		  <width>144</width>
		  <height>22</height>
	</image>		<item>
		<title>Customer Service Is Next Job for IBM's Watson</title>
		<link>http://allthingsd.com/20130520/customer-service-is-next-job-for-ibms-watson/</link>
		<comments>http://allthingsd.com/20130520/customer-service-is-next-job-for-ibms-watson/#comments</comments>
		<pubDate>Tue, 21 May 2013 04:46:27 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Commerce]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Big Blue]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[customer service]]></category>
		<category><![CDATA[game show]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Jeopardy]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[supercomputing]]></category>
		<category><![CDATA[Watson]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=323744</guid>
		<description><![CDATA[Keeping track of what consumers like and dislike is a beefy computing problem.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20130520/customer-service-is-next-job-for-ibms-watson/ibmsauron2/" rel="attachment wp-att-323748"><img src="http://allthingsd.com/files/2013/05/ibmsauron2-380x226.jpg" alt="ibmsauron2" width="380" height="226" class="alignright size-medium wp-image-323748" /></a>Remember Watson? The supercomputer that in an elaborate but interesting publicity stunt <a href="http://allthingsd.com/20110216/all-humans-bow-before-the-mighty-watson-master-of-jeopardy/">beat humanity</a> at the game show &#8220;Jeopardy,&#8221; and then for a follow-up went on to <a href="http://allthingsd.com/20120322/ibm-computer-watson-is-now-a-big-shot-doctor-and-you-still-arent/">become a big-shot doctor</a> (sort of), and more recently has started to <a href="http://allthingsd.com/20130209/ibms-game-show-winning-watson-computer-goes-to-work-treating-cancer/">specialize in cancer research</a> now has yet another new job.</p>
<p>This one doesn&#8217;t sound at first quite as interesting, but from the point of view of complex computing tasks, it&#8217;s pretty cool. When you think about all the ways that companies have to try to engage with and then make their customers happy and the ways they can do that more effectively, you can probably imagine how a deeply analytical computer might be useful.</p>
<p>IBM calls it the Watson Engagement Advisor; it&#8217;s an offshoot its <a href="http://allthingsd.com/20110726/seven-questions-about-smarter-commerce-with-ibms-craig-hayman/">Smarter Commerce initiative</a>. Consider that Watson is smart enough to understand the natural ebb and flow of human language and is designed to answer questions in much the same way humans do, and then quickly sort through a set of known information to determine the best answer, and you&#8217;ll realize it&#8217;s a fit for customer service. </p>
<p>In that way, Watson can learn over time, and like a good bartender with a lot of regulars, keep track of the unique likes and dislikes of customers and get better at it over time. And that&#8217;s important as consumers come to expect to be able to interact with companies pretty much wherever they are and on whatever device they happen to be using at the time: Whether it&#8217;s a smart phone, tablet, PC or whatever, they will expect &#8212; already are expecting &#8212; consistent experiences. Consumers, especially the younger ones, will expect companies to shift with the marketplace as tastes change and evolve.</p>
<p>Watson can be the voice that customers hear when they reach out to the company asking questions. Watson has only gotten smarter since its run on &#8220;Jeopardy,&#8221; speeding up its performance by 240 percent while slimming down the size of the system required to run it by 75 percent. Already the Nielsen Company and the Royal Bank of Canada are among those kicking the tires in trials. </p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130520/customer-service-is-next-job-for-ibms-watson/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>In Media, Big Data Is Booming but Big Results Are Lacking</title>
		<link>http://allthingsd.com/20130520/in-media-big-data-is-booming-but-big-results-are-lacking/</link>
		<comments>http://allthingsd.com/20130520/in-media-big-data-is-booming-but-big-results-are-lacking/#comments</comments>
		<pubDate>Mon, 20 May 2013 19:51:05 +0000</pubDate>
		<dc:creator>Ben Elowitz</dc:creator>
				<category><![CDATA[Voices]]></category>
		<category><![CDATA[Ben Elowitz]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Bravo]]></category>
		<category><![CDATA[chartbeat]]></category>
		<category><![CDATA[David Fincher]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[House of Cards]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[Kevin Spacey]]></category>
		<category><![CDATA[Netflix]]></category>
		<category><![CDATA[Omniture]]></category>
		<category><![CDATA[Reed Hastings]]></category>
		<category><![CDATA[Wetpaint]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=323402</guid>
		<description><![CDATA[Nobody wants to use the data.]]></description>
				<content:encoded><![CDATA[<p><img src="http://allthingsd.com/files/2013/05/dilbert3.jpg" alt="dilbert3" width="380" height="285" class="alignright size-full wp-image-323495" /></p>
<p>The New York Times named 2012 <a href="http://www.nytimes.com/2012/08/12/business/how-big-data-became-so-big-unboxed.html?_r=0">the crossover year for Big Data</a>: As a term and as a concept, Big Data broke through from the tech circle and into mainstream consciousness. (So much so that <a href="http://dilbert.com/strips/comic/2012-07-29/">even Dilbert&#8217;s boss was talking about it</a>.)</p>
<p>We&#8217;ve seen huge advances in our ability to generate, collect and store an explosion of data points: 90 percent of the world&#8217;s data has been accumulated in <a href="http://www-01.ibm.com/software/data/bigdata/">the last two years alone</a>. We&#8217;re generating 2.5 quintillion bytes of data daily, and every serious company is dutifully logging and contextualizing every impression, every click and every purchase with excruciating detail.</p>
<p>That said, shockingly little happens to the information once it has been stowed in the database. A good friend gave voice to this dirty little industry secret the other day:</p>
<p>&#8220;Nobody <em>wants</em> to use the data.&#8221;</p>
<p>He&#8217;s remarkably spot-on. Even though almost every CEO says their companies are becoming data-driven, the fact is that most high-level decisions are <em>still</em> being made from bullet points, not data points.</p>
<p>What the data revolution brought us was systems for collecting data &#8212; but collecting is the easy part. And even more importantly, it&#8217;s the safe part.</p>
<h4 class="subhed">The Real Problem: Data Phobia</h4>
<p>The trouble with data is that it asks as many questions as it answers. Your engagement is down, bounce rate is up, search traffic is up &#8212; why is that, and what can we do to make it higher, lower and higher? Data almost never hands you the answers or insights directly; it just illuminates the issue. And it illuminates a whole bunch of them at once, so it&#8217;s up to you to figure out what the priorities are.</p>
<p>If this problem is an &#8220;opportunity in disguise,&#8221; most executives seem quickly scared off by the masquerade. In truth, Big Data raises the bar for how smart you have to be as an executive.</p>
<p>The easy answer &#8212; leaving the analytics to the analytics department &#8212; relieves you of the responsibility of figuring it all out, as though it&#8217;s unknowable to anyone without a degree in data science. But it also relieves you of the answers.</p>
<p>What is the executive&#8217;s greatest fear? That exposing the trove of data without knowing what to do with it makes them look worse, not better. In media, many have hidden that fear behind the veneer of idealistic purism. I remember talking with Martin Nisenholtz several years ago when he was at the New York Times about how data is used in a newsroom; I asked what would happen if he shared performance metrics with reporters in real time (obviously this was before Chartbeat) to see what their audience cares about. He said, &#8220;They would throw me out.&#8221;</p>
<p>Our strong institutions and professional commitment to standards have ensured the journalistic values of truthfulness, accuracy, objectivity, impartiality, fairness and public accountability. None of those values are furthered by closing our eyes and ears to our own audiences. The result is a paradoxical culture that boldly states &#8220;content is king&#8221; and yet refuses to quantify its value for fear of tainting the purity of the product.</p>
<h4 class="subhed">The Opportunity: Using Big Data to Make Big Bets</h4>
<p>Until recently, we have had startlingly few case studies of the transformative power of Big Data on which to model our own big changes in media. Instead we&#8217;ve had IT initiatives that promised big insights, but ended up delivering big databases and bigger IT bills. For once, it&#8217;s not the IT department&#8217;s fault &#8212; it&#8217;s those of us who are using the data (and, more often, aren&#8217;t using it) who are to blame.</p>
<p>That&#8217;s why I turn to those who have made the big bets to see what&#8217;s different. Netflix has long been the poster child for using data to drive results, and now they&#8217;ve proven <a href="http://allthingsd.com/20130212/netflix-house-of-cards-its-most-watched-program/">in no uncertain terms</a> that when you ask your data the right questions you can find hugely valuable insights &#8212; even in the sacred domain of content creation.</p>
<p>Before Netflix pursued the option to buy &#8220;House of Cards,&#8221; it looked to its massive data stash. Execs wanted to know: Do Netflix users enjoy political thrillers? Check. Of political thriller enthusiasts, how many also watch David Fincher films? A whole bunch. Oh, and one more thing: Is this crowd fond of Kevin Spacey? As it turned out, there was a very healthy crossover in that Venn diagram.</p>
<p>Not only did this insight give Reed Hastings the confidence to bid on &#8220;House of Cards&#8221; &#8212; it gave him the level of certainty necessary to outbid heavyweights like HBO and AMC for the series.</p>
<p>What I love most about this story is that the questions were so simple, so logical. Sometimes the sheer volume of data at our fingertips overwhelms us and makes us forget that the fundamental strategic questions haven&#8217;t changed. What has changed is that now we have far better access to the answers. And when you can give your users what they want based on the signals they themselves have been sending you, that&#8217;s when Big Data starts to earn its keep.</p>
<h4 class="subhed">Five Questions You Should Be Asking Your Data</h4>
<p>Forget about Omniture and Google Analytics and all of the data minutiae you&#8217;re already tracking. Forget about little personalization features. The most valuable data doesn&#8217;t fit on the dashboard. Think bigger and move upstream: What&#8217;s the most amazing new product or service you can create? Here are five places to start digging:</p>
<ul>
<li><strong>What does my audience <em>love</em>?</strong> Cut the data every way you can to deeply understand this, with nuance &#8212; then reorient around that product. It might be parenting advice, or current memes, or breaking news. If you can find a common thematic thread in your most-consumed content, you have a great starting point for further segmentation. Lauren Zalaznick turned the Bravo network around by pinpointing the five key interests of the audience, cutting out the clutter, and giving them more and more and more of what they loved (hence the hugely popular &#8220;Top Chef&#8221; and &#8220;Real Housewives&#8221;).</li>
<li><strong>How do they want it?</strong> Netflix noticed that a significant number of users were watching marathon-style, and so they bucked TV tradition and released &#8220;House of Cards&#8221; all at once. How could you change your content packaging to better match the real habits of your users? Many have tried and failed with full-length video programming on the Web; that&#8217;s because (so far at least) most Internet audiences can&#8217;t sit still long enough to watch a 30- or 60-minute program. Adapt your delivery to what your audience wants.</li>
<li><strong>How can I best relate to them?</strong> Personality is critical &#8212; so which of your brands&#8217; public talents and personalities relate to whom? It might be a popular columnist, Don Draper, or Boo the Pomeranian. Figure out which personalities your audience connect with the most, and leverage them into the other themes and packages.</li>
<li><strong>What secret signals is my audience sending?</strong> Target <a href="http://www.nytimes.com/2012/02/19/magazine/shopping-habits.html">famously figured out</a> how to identify pregnant shoppers and even estimate their due dates months before the woman ever purchased a stroller or a pack of diapers. Find out which clues in your data indicate that a customer may be on the path to a new phase of life, and start messaging them with your relevant content even before they get there.</li>
<li><strong>Where is my sweet spot?</strong> Once you discover the key themes, packages and personalities that resonate with your audience the most (and at which relevant life stages), you can cross the data sets and identify your best untapped opportunities. Don&#8217;t just tweak your existing products and advertising &#8212; create whole new products that are designed specifically to thrive at the intersection. Just as the strong affinity overlap for Spacey/Fincher/Cards gave Netflix the confidence to make a bold bet, your own Venn diagram will spotlight your best chances to create knockout content that is destined to succeed.</li>
</ul>
<h4 class="subhed">Rethinking Management: Ask, Understand, Execute</h4>
<p>When it comes to dealing with Big Data, our skills haven&#8217;t evolved as fast as our capacity. We all have a functional specialty, whether it be content creation or distribution or sales or management &#8212; so whose job is it to ask the right questions of the data? Big insights and actions aren&#8217;t led by a data scientist; they are led by an executive who has an integrated view of customers, products, distribution and sales.</p>
<p>But asking Big Data the right questions isn&#8217;t just a new practice to add to the management to-do list. Pulling it off requires a rethinking of the manager&#8217;s role entirely. We&#8217;ve traditionally thought of management as the discipline of managing people and managing the business. Now it&#8217;s time to add &#8220;managing our understanding&#8221; to the job description.</p>
<p>The time of the executives who merely &#8220;execute&#8221; is past. The successful executives in this post-Big Data world first ask, understand, and then execute with the full support of the data behind them.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130520/in-media-big-data-is-booming-but-big-results-are-lacking/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Will Bloomberg Disclose How Heavily Reporters Mined Customer Data? (It Watches Them, Too.)</title>
		<link>http://allthingsd.com/20130513/will-bloomberg-disclose-how-heavily-reporters-mined-customer-data-it-watches-them-too/</link>
		<comments>http://allthingsd.com/20130513/will-bloomberg-disclose-how-heavily-reporters-mined-customer-data-it-watches-them-too/#comments</comments>
		<pubDate>Mon, 13 May 2013 20:38:03 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Bloomberg]]></category>
		<category><![CDATA[Bloomberg News]]></category>
		<category><![CDATA[Bloomberg Terminal]]></category>
		<category><![CDATA[European Central Bank]]></category>
		<category><![CDATA[Federal Reserve]]></category>
		<category><![CDATA[financial data]]></category>
		<category><![CDATA[Frederick Winslow Taylor]]></category>
		<category><![CDATA[Goldman Sachs]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[JPMorgan]]></category>
		<category><![CDATA[Matt Winkler]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[Wall Street]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=320877</guid>
		<description><![CDATA[Bloomberg tracks its employees as much as it does its clients. It probably knows exactly how many times a controversial function was used by its reporters.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20130511/bloomberg-news-busted-for-spying-on-bankers/bloomberg_eyes/" rel="attachment wp-att-320557"><img src="http://allthingsd.com/files/2013/05/Bloomberg_eyes.jpg" alt="Bloomberg_eyes" width="380" height="280" class="alignright size-full wp-image-320557" /></a>Let me start  by saying up front that I used to work at Bloomberg News, so what I&#8217;m about to say is informed by that experience. I worked at Bloomberg for about a year after the company bought BusinessWeek magazine from McGraw-Hill and turned it into Bloomberg Businessweek.</p>
<p>In that capacity, I learned to use the Bloomberg terminal that sits on some 315,000 desks in the financial industry and that makes the company all its money. And I learned early on that practically every move terminal users make is tracked and recorded. (Also, it goes without saying but I&#8217;ll say it anyway, this website is owned by News Corp., which owns Dow Jones, which is a competitor to Bloomberg.)</p>
<p>In a <a href="http://www.bloomberg.com/news/2013-05-13/holding-ourselves-accountable.html">Sunday editorial on Bloomberg View</a>, the company&#8217;s equivalent of an Op/Ed page, Editor-in-Chief Matt Winkler wrote, &#8220;Our reporters should not have access to any data considered proprietary. I am sorry they did. The error is inexcusable.&#8221; He opens the piece by quoting from a section of his book, &#8220;The Bloomberg Way&#8221;: </p>
<blockquote class="small"><p>&#8220;The appearance of impropriety can be as damaging to a reputation as doing something improper. Because we hold others accountable for disclosure, we expect the same of ourselves. While disclosing errors of judgment may be embarrassing, the sooner the lapses are reported, the sooner there is nothing more to say.&#8221;</p></blockquote>
<p>This raises the question: How fully will Bloomberg News disclose what it admits to be an &#8220;error of judgment&#8221; that is &#8220;almost as old as Bloomberg News.&#8221; How many reporters used the Z function &#8212; a software command that displays whether or not a customer is logged in and which functions he or she has been using the most &#8212; over the many years it was available to them? Chances are that Bloomberg has the data on precisely how often it was used and by which reporters. It could with some effort call in a third party to perform a detailed audit on this, and then disclose the findings of that audit to clients and the rest of the world.</p>
<p>I&#8217;ve asked Bloomberg about this. Spokeswoman Lauren Meller didn&#8217;t have an immediate answer. If I get one I&#8217;ll post it here.</p>
<p>If you&#8217;re going to properly understand the controversy that has emerged about the company in recent days, you need to understand the basics of the terminal itself. Bloomberg is at its very heart a financial data software company. In executing a &#8220;function&#8221; on its terminals, which are seen as status symbols of the financial industry, you type a command, usually one to four letters, and hit the Go key, which replaces the Return key on the conventional keyboard. When looking up, say, the price and fundamentals of Apple shares, you type AAPL, hit a key labeled Equity to indicate the first four letters are intended to indicate a stock ticker symbol, and then hit Go.</p>
<p>During the year I worked there I never heard about the so-called &#8220;Z function&#8221; at the heart of the current controversy, but its existence isn&#8217;t surprising. If you <a href="http://allthingsd.com/20130511/bloomberg-news-busted-for-spying-on-bankers/">haven&#8217;t been paying attention,</a> here&#8217;s what it&#8217;s all about. All 2,000-odd reporters at Bloomberg News have these terminals on their desks and use them to conduct research, report, write and publish their stories, and to communicate within the organization and without. </p>
<p>The Z function, now disabled for newsroom employees, showed when clients were and were not logged in to the system. When someone hadn&#8217;t logged in in a while, an attentive reporter might see that as a tip that the person was changing jobs or had left a firm, and then the reporter would start asking questions. Stories about executives moving between firms tend to be popular among terminal clients. It also showed what other Bloomberg functions these clients had been using, but in a non-specific way that wouldn&#8217;t show what stock or bond or other matter they might be researching or which news stories they had been reading. Bloomberg reporters are also said to have had access to transcripts of customer service calls clients made seeking help with functions. </p>
<p>Bloomberg is and has always been a &#8220;big data&#8221; company. The newly fashionable idea that you can learn a great deal and thus improve a software application by analyzing the big mass of data gathered about how it is used and where users run into problems has been been at the core of Bloomberg&#8217;s operational philosophy from the beginning. </p>
<p>Employees know from the moment they join the company that the amount of time they spend at their desks is logged. Building security systems are linked to the terminal and an access badge. When you &#8220;badge in&#8221; at any Bloomberg office around the world, this occurrence is logged. If you&#8217;re a Bloomberg employee based in New York and happen to be visiting London or Tokyo, your arrival and departure times are tracked, as is the amount of time your terminal is idle, should you be out gathering news or taking a lunch break.</p>
<p>I never experienced this first hand, but I heard privately shared tales from colleagues about their annual performance reviews, and discussions would at times turn to how well they used the terminal to do their jobs. As I was first joining, a friend who had worked at Bloomberg for a while told me that during one such conversation, he was mildly scolded for using Yahoo Finance to look up some bit of financial data &#8212; terminals also have Web browsers &#8212; rather than the terminal itself. </p>
<p>I point out this conversation for a reason. In its quest to make its products better &#8212; certainly a logical goal &#8212; Bloomberg clearly tracks how often its clients use its many functions. If that&#8217;s true, then it logically follows that it tracks how its reporters do the same thing. Historical data on the use by reporters of the Z function exists and can be examined.</p>
<p>As an organization, Bloomberg News is journalism as re-imagined by <a href="http://en.wikipedia.org/wiki/Frederick_Winslow_Taylor">Fredrick Winslow Taylor</a>, the philosophical father of factory automation. Its reporters are routinely gauged on how often they log scoops that appear on the terminal&#8217;s news service. Taylor believed that by analyzing work, the &#8220;One Best Way&#8221; to get it done could be found. As the founding editor of Bloomberg News, Winkler always struck me as an avid student of &#8220;Taylorism.&#8221; Every bit of data that can be gathered is analyzed to make the news gathering process better and more efficient. Indeed, Winkler&#8217;s 360-page book seems almost Taylor-inspired.</p>
<p>Scoops and other distinctive stories are further categorized into a taxonomy using an <a href="http://gawker.com/5468834/bloomberg-news-thy-taskmaster-is-the-breaking-news-points-system">internal nomenclature</a>, and the best of those are singled out in what&#8217;s known as &#8220;Matt&#8217;s Note,&#8221; a weekly memo from Winkler. An MMWin, or a market-moving win, is a story that beats a similar story by a competitor by several minutes and that after publication causes the market to react in some way. A Follow is when a competitor writes a story that follows up on a Bloomberg scoop. And there are many others. </p>
<p>All of these are thought to be tracked and used to evaluate a reporter&#8217;s performance every year. That means there&#8217;s data on how many reporters used the Z function and about whom, and probably data as well correlating to the published stories that resulted.</p>
<p>With at least two large banks complaining, and now two government entities &#8212; the <a href="http://www.cnbc.com/id/100729418">Federal Reserve</a> in the U.S. and the <a href="http://uk.reuters.com/article/2013/05/13/uk-bloomberg-data-ecb-idUKBRE94C0JN20130513">European Central Bank</a> &#8212; asking questions about all this, you can expect concerns about the lines between Bloomberg&#8217;s business and news operations to persist.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130513/will-bloomberg-disclose-how-heavily-reporters-mined-customer-data-it-watches-them-too/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Actian to Acquire Big-Data Startup ParAccel</title>
		<link>http://allthingsd.com/20130425/actian-to-acquire-big-data-startup-paraccel/</link>
		<comments>http://allthingsd.com/20130425/actian-to-acquire-big-data-startup-paraccel/#comments</comments>
		<pubDate>Thu, 25 Apr 2013 11:00:14 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Actian]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Bay Partners]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[CA]]></category>
		<category><![CDATA[Garnett & Helfrich Capital]]></category>
		<category><![CDATA[M&A]]></category>
		<category><![CDATA[MDV]]></category>
		<category><![CDATA[Menlo Ventures]]></category>
		<category><![CDATA[mergers and acquisitions]]></category>
		<category><![CDATA[ParAccel]]></category>
		<category><![CDATA[Pervasive Software]]></category>
		<category><![CDATA[Tao Venture Partners]]></category>
		<category><![CDATA[Walden International]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=315121</guid>
		<description><![CDATA[Actian rolls up its third acquisition in five months.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20130425/actian-to-acquire-big-data-startup-paraccel/actian-paraccel/" rel="attachment wp-att-315134"><img src="http://allthingsd.com/files/2013/04/actian-paraccel-380x251.png" alt="actian-paraccel" width="380" height="251" class="alignright size-medium wp-image-315134" /></a>Actian, a privately held player in the big data and business analytics software space that&#8217;s lately been known for making acquisitions, is about to close another. Later today, the company will announce a deal to acquire ParAccel, a well-funded startup that specializes in analytics database software.</p>
<p>Actian CEO Steven Shine told <strong>AllThingsD</strong> that the combined company will have revenue north of $150 million and 450 employees around the world. </p>
<p>It&#8217;s Actian&#8217;s third significant acquisition since 2011. In January, it paid $162 million for Pervasive Software, a publicly held software company. Late last year, it <a href="http://globenewswire.com/news-release/2012/11/21/506794/10013391/en/Versant-Agrees-to-be-Acquired-by-Actian-for-13-00-per-Share.html">paid $37 million for Versant</a>, beating out a bid from another company.</p>
<p>Financial terms were not disclosed, since both companies are private. But the deal marks an exit for ParAccel&#8217;s investors, including Amazon, MDV, Bay Partners, Walden International, Tao Venture Partners and Menlo Ventures, who had put in a combined $64 million since its founding in 2007. The most recent capital injection was a $20 million venture round led by Amazon that was <a href="http://www.finsmes.com/2012/04/paraccel-closes-20m-funding.html">announced a year ago</a>.</p>
<p>ParAccel specializes in high-end databases, and has seen a <a href="http://blogs.wsj.com/venturecapital/2011/03/10/paraccel-feeling-fine-after-acquisition-smoke-clears/">handful of its primary competitors</a>, like Vertica and Aster Data Systems, acquired by the likes of Hewlett-Packard and Teradata, respectively. Its Analytic Platform brings together an analytic database along with features to extend it and integrate it with other technologies for running big-data analytics. Its customers include Amazon, Royal Bank of Scotland, OfficeMax and MicroStrategy.</p>
<p>Amazon uses ParAccel&#8217;s technology in its <a href="http://aws.amazon.com/redshift/">RedShift cloud-based data warehousing service</a>, while MicroStrategy uses it to power a <a href="http://www.microstrategy.com/about-us/press/release/?ctry=167&#038;id=2302">business intelligence product</a>.</p>
<p>Shine said that as more companies begin to struggle with their big-data and analytics problems, their choices first seem limited to large vendors like IBM and Oracle and EMC&#8217;s Greenplum. &#8220;When you look at companies that are focused purely on data, you see the behemoths, and most of those drag hardware along with them,&#8221; he said. &#8220;And then you look down below and see a lot of Hadoop spinoffs.&#8221;</p>
<p>Companies are looking for help in getting all the various threads of gathering, managing and analyzing big troves of data and then turning it all into useful business intelligence, Shine said. And they also want to do it in the cloud, and that&#8217;s where he sees the opportunity.</p>
<p>Actian was born as a database product called Ingres inside CA Technologies, one it acquired in the 1990s. In 2004, CA decided to turn it into an open-source product. And in 2005, during a fit of streamlining, CA&#8217;s Ingres assets were spun out as a privately held company, majority owned by the private equity firm <a href="http://www.garnetthelfrich.com/">Garnett &#038; Helfrich Capital</a>, and became Ingres Corp. It changed its name to Actian in 2011.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130425/actian-to-acquire-big-data-startup-paraccel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big Data's Usability Problem</title>
		<link>http://allthingsd.com/20130423/big-datas-usability-problem/</link>
		<comments>http://allthingsd.com/20130423/big-datas-usability-problem/#comments</comments>
		<pubDate>Tue, 23 Apr 2013 19:44:56 +0000</pubDate>
		<dc:creator>Bill Wise</dc:creator>
				<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Bill Wise]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[FBI]]></category>
		<category><![CDATA[Fox News]]></category>
		<category><![CDATA[MediaOcean]]></category>
		<category><![CDATA[Reinhart-Rogoff]]></category>
		<category><![CDATA[Senator Lindsay Graham]]></category>
		<category><![CDATA[Tamerlan Tsarnaev]]></category>
		<category><![CDATA[usability]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=314653</guid>
		<description><![CDATA[In a wide sea of data, a few lines of code can be very easy to overlook.]]></description>
				<content:encoded><![CDATA[<p><img src="http://allthingsd.com/files/2013/04/toomuch380.jpg" alt="toomuch380" width="380" height="285" class="alignright size-full wp-image-314668" /></p>
<p>Sen. Lindsay Graham <a href="http://thehill.com/video/senate/295263-graham-misspelled-name-helped-bombing-suspects-russia-trip-go-unnoticed#ixzz2RDQVRqLg">just told Fox News</a> that the reason the FBI never realized that Boston Marathon bombing suspect Tamerlan Tsarnaev went to Russia in 2011 is that &#8220;when he got on the Aeroflot plane, they misspelled his name, so it never went into the system that he actually went to Russia.&#8221; Meanwhile, the Reinhart-Rogoff paper that has been a catalyst for government austerity policies worldwide since 2010 has, in fact, accidentally left out several countries&#8217; worth of critical data in Excel. </p>
<p><a href="http://www.nextnewdeal.net/rortybomb/researchers-finally-replicated-reinhart-rogoff-and-there-are-serious-problems">As one blogger sums up scathingly</a>: &#8220;One of the core empirical points providing the intellectual foundation for the global move to austerity in the early 2010s was based on someone accidentally not updating a row formula in Excel.&#8221;</p>
<p>Taken together, these factors offer a critical lesson here about the power and limits of Big Data today. In both scenarios, data management tools (i.e., the FBI&#8217;s systems and Excel) were undone by fairly simple errors: In one situation, a misspelling; in another, a failure to code a spreadsheet properly. And in both scenarios, the results were dire &#8212; an awful tragedy, and a potentially misdirected government economic policy in the midst of a recession.</p>
<p>As someone who spends day and night thinking through data management and workflow, these two stories lead me to three observations:</p>
<ul>
<li>As a society, we&#8217;re hugely reliant on data management platforms for our most critical information.</li>
<li>Our core data platforms often aren&#8217;t set up to handle human error, from basic coding flaws to spelling mistakes.</li>
<li>The wealth of data in our data tools can mask that human error. Consider: The <a href="http://www.nber.org/papers/w15639.pdf?new_window=1">Reinhart-Rogoff study examined</a> &#8220;new data on forty-four countries spanning about two hundred years&#8221; with &#8220;over 3,700 annual observations covering a wide range of political systems, institutions, exchange rate arrangements, and historic circumstances.&#8221;</li>
</ul>
<p>In such a wide sea of data, a few lines of code can be very easy to overlook, even if they have strong ramifications for analysis.</p>
<p>There are lots of things to take away from these three points, but I&#8217;ll just focus on one: The promise of Big Data is that it can make everyday processes &#8212; from critical analyses to mundane tasks &#8212; work smarter through data intelligence. Ultimately, all that data management translates into an economy and society that lets machines handle the minutiae as humans think through the larger picture.</p>
<p>To a large extent, that vision is already here. But at the same time, more human/data interaction means a lot more room for error (and inefficiency) around increasingly critical data sets &#8212; which, as we&#8217;ve seen, can have very serious results. Which means that, if we want to make the reality of Big Data match the dream, we need to spend serious time around providing usability that guides human users in the best way to engage with the data, and automation that takes human interaction (and human error) out of the picture for a lot of the basic calculations and tasks &#8212; and for some of the complicated ones, too.</p>
<p>If Big Data can&#8217;t fit hand-in-glove with usability and workflow, a lot of the promise of big data will be empty data crunching. That&#8217;s not just a problem for getting where we want to be in the evolution of computing. It&#8217;s a situation that can lead to bad data management &#8212; which translates into bad economics and, sometimes, far worse.</p>
<p><em>Bill Wise is CEO of Mediaocean. You can follow him on twitter at <a href="http://twitter.com/billwise">@billwise</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130423/big-datas-usability-problem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tableau Software Plots Latest Big Data IPO</title>
		<link>http://allthingsd.com/20130403/tableau-software-plots-latest-big-data-ipo/</link>
		<comments>http://allthingsd.com/20130403/tableau-software-plots-latest-big-data-ipo/#comments</comments>
		<pubDate>Wed, 03 Apr 2013 16:44:36 +0000</pubDate>
		<dc:creator>Telis Demos</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[initial public offering]]></category>
		<category><![CDATA[IPO]]></category>
		<category><![CDATA[Splunk]]></category>
		<category><![CDATA[Tableau Software]]></category>
		<category><![CDATA[Telis Demos]]></category>
		<category><![CDATA[The Wall Street Journal]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=308898</guid>
		<description><![CDATA[The next “big data” and “cloud” plays are teeing up for IPOs.]]></description>
				<content:encoded><![CDATA[<p>The next “big data” and “cloud” plays are teeing up for IPOs.</p>
<p>Seattle-based Tableau Software Inc. will hope to replicate the success of Splunk Inc. which has jumped 133 percent since its April 2012 debut. Tableau is seeking to raise about $150 million in an IPO that could happen in about a month.</p>
<p><a href="http://blogs.wsj.com/deals/2013/04/03/tableau-software-plots-latest-big-data-ipo/">Read the rest of this post on the original site »</a></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130403/tableau-software-plots-latest-big-data-ipo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>"Big Data" for Cancer Care</title>
		<link>http://allthingsd.com/20130327/big-data-for-cancer-care/</link>
		<comments>http://allthingsd.com/20130327/big-data-for-cancer-care/#comments</comments>
		<pubDate>Wed, 27 Mar 2013 14:00:35 +0000</pubDate>
		<dc:creator>Ron Winslow</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cancer]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[health care]]></category>
		<category><![CDATA[oncology]]></category>
		<category><![CDATA[Ron Winslow]]></category>
		<category><![CDATA[The Wall Street Journal]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=307087</guid>
		<description><![CDATA[A major oncology group is launching an ambitious project to collect data on the care of hundreds of thousands of cancer patients and use it to help guide treatment of other patients across the health-care system.]]></description>
				<content:encoded><![CDATA[<p>A major oncology group is launching an ambitious project to collect data on the care of hundreds of thousands of cancer patients and use it to help guide treatment of other patients across the health-care system.</p>
<p>Cancer doctors would be able to consult the database, much like doing a Google search. They would get advice on treatment strategies that might work for their patients based on how similar patients fared in practices around the U.S.</p>
<p><a href="http://online.wsj.com/article/SB10001424127887323466204578384732911187000.html">Read the rest of this post on the original site »</a></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130327/big-data-for-cancer-care/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tracking Sensors Invade the Workplace</title>
		<link>http://allthingsd.com/20130307/tracking-sensors-invade-the-workplace/</link>
		<comments>http://allthingsd.com/20130307/tracking-sensors-invade-the-workplace/#comments</comments>
		<pubDate>Thu, 07 Mar 2013 15:00:07 +0000</pubDate>
		<dc:creator>Rachel Emma Silverman</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[employees]]></category>
		<category><![CDATA[Rachel Emma Silverman]]></category>
		<category><![CDATA[sensors]]></category>
		<category><![CDATA[The Wall Street Journal]]></category>
		<category><![CDATA[tracking]]></category>
		<category><![CDATA[workplace]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=301277</guid>
		<description><![CDATA[As Big Data becomes a fixture of office life, companies are turning to tracking devices to gather real-time information on how teams of employees work and interact.]]></description>
				<content:encoded><![CDATA[<p>A few years ago when Bank of America Corp. wanted to study whether face time mattered among its call-center teams, the big bank asked about 90 workers to wear badges for a few weeks with tiny sensors to record their movements and the tone of their conversations.</p>
<p>The data showed that the most productive workers belonged to close-knit teams and spoke frequently with their colleagues. So, to get more employees mingling, the bank scheduled workers for group breaks, rather than solo ones.</p>
<p><a href="http://online.wsj.com/article/SB10001424127887324034804578344303429080678.html">Read the rest of this post on the original site »</a></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130307/tracking-sensors-invade-the-workplace/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big Data, Soft Sell: Netflix Pitches a Hands-Off Approach to Hollywood</title>
		<link>http://allthingsd.com/20130301/big-data-soft-sell-netflix-pitches-a-hands-off-approach-to-hollywood/</link>
		<comments>http://allthingsd.com/20130301/big-data-soft-sell-netflix-pitches-a-hands-off-approach-to-hollywood/#comments</comments>
		<pubDate>Fri, 01 Mar 2013 18:48:11 +0000</pubDate>
		<dc:creator>Peter Kafka</dc:creator>
				<category><![CDATA[Dive Into Media]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Arrested Development]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[HBO]]></category>
		<category><![CDATA[House of Cards]]></category>
		<category><![CDATA[Mitch Hurwitz]]></category>
		<category><![CDATA[Netflix]]></category>
		<category><![CDATA[Showtime]]></category>
		<category><![CDATA[Ted Sarandos]]></category>
		<category><![CDATA[Time Warner]]></category>
		<category><![CDATA[Will Arnett]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=299728</guid>
		<description><![CDATA[Sure, Netflix knows a lot about you and what you like to watch. But that doesn't mean it knows how to make stuff you want to watch.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/files/2013/02/Hurwitz_Arnett.jpg"><img class="alignright size-medium wp-image-294735" alt="Hurwitz_Arnett" src="http://allthingsd.com/files/2013/02/Hurwitz_Arnett-380x253.jpg" width="380" height="253" /></a></p>
<p>Here&#8217;s a narrative <a href="http://gigaom.com/2012/06/14/netflix-analyzes-a-lot-of-data-about-your-viewing-habits/">lots</a> <a href="http://www.salon.com/2013/02/01/how_netflix_is_turning_viewers_into_puppets/">of</a> <a href="http://www.nytimes.com/2013/02/25/business/media/for-house-of-cards-using-big-data-to-guarantee-its-popularity.html?pagewanted=all">people</a> like right now: In the old days, the movie and TV guys had to guess about what kind of stuff you wanted to see. But Netflix doesn&#8217;t need to guess: It knows so much about your viewing habits that it knows exactly what you want, so it picks films and shows accordingly.</p>
<p>And now it&#8217;s using all of that Big Data to <em>create</em> stuff you want to see, using its analytics to shape the original videos it is funding.</p>
<p>Take that, <a href="http://en.wikipedia.org/wiki/Adventures_in_the_Screen_Trade#.22Nobody_Knows_Anything.22">William Goldman</a>!</p>
<p>But if you&#8217;re a Hollywood creative person, that narrative wouldn&#8217;t sound nearly as appealing: You&#8217;re already used to getting tons of input and edicts from suits. Now Netflix is promising to up the ante by using computers, too?</p>
<p>Which is why Netflix is <em>not</em> selling that pitch to writers, directors and actors.</p>
<p>Instead, it is promising a hands-off &#8212; or mostly hands-off &#8212; approach: <em>We&#8217;ll give you a bunch of money to go make something, and you go make it. And maybe we&#8217;ll offer some suggestions.</em> (Which is the same approach, by the way, that networks like HBO and Showtime used to coax Hollywood talent for their originals for a long time.)</p>
<p>No need to take my word for it, though. You can hear it straight from the folks who are working with Netflix on the new stuff.</p>
<p>At our <strong><a href="http://allthingsd.com/category/dive-into-media/">D: Dive Into Media</a></strong> conference last month, we had Mitch Hurwitz and Will Arnett, creator and star, respectively, of &#8220;Arrested Development,&#8221; onstage with Ted Sarandos, the Netflix executive who paid them to make a new season of the show. The whole segment is a bunch of fun, but if you want to hear about the input Sarandos did and didn&#8217;t have on &#8220;Arrested Development,&#8221; skip ahead to the 30:30 mark:</p>
<p><div class="video-wsj"><object width="640" height="360"><param name="movie" value="http://s.wsj.net/media/swf/microPlayer.swf"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><param name="flashvars" value="videoGUID=37F662AB-9BC4-422F-B7D5-91C0E2C155BB&playerid=4001&plyMediaEnabled=1&configURL=http://m.wsj.net/video-players/&autoStart=false" base="http://s.wsj.net/media/swf/"name="microflashPlayer"></param><embed src="http://s.wsj.net/media/swf/microPlayer.swf" bgcolor="#FFFFFF" flashVars="videoGUID={37F662AB-9BC4-422F-B7D5-91C0E2C155BB}&playerid=4001&plyMediaEnabled=1&configURL=http://m.wsj.net/video-players/&autoStart=false" base="http://s.wsj.net/media/swf/" name="microflashPlayer" width="640" height="360" seamlesstabbing="false" type="application/x-shockwave-flash" swLiveConnect="true" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash"></embed><br />[ See post to watch video ]</div></object></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130301/big-data-soft-sell-netflix-pitches-a-hands-off-approach-to-hollywood/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM Knows When to Acquire and When to Divest</title>
		<link>http://allthingsd.com/20130228/ibm-knows-when-to-acquire-and-when-to-divest/</link>
		<comments>http://allthingsd.com/20130228/ibm-knows-when-to-acquire-and-when-to-divest/#comments</comments>
		<pubDate>Thu, 28 Feb 2013 16:56:34 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Commerce]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[acquisitions]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[Big Blue]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[EDS]]></category>
		<category><![CDATA[enterprise hardware]]></category>
		<category><![CDATA[enterprise software]]></category>
		<category><![CDATA[Ginni Rometty]]></category>
		<category><![CDATA[Hewlett-Packard]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[investors day]]></category>
		<category><![CDATA[M&A]]></category>
		<category><![CDATA[mergers and acquisitions]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=299406</guid>
		<description><![CDATA[Also, Big Blue bets bigger than before on Big Data.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20130228/ibm-knows-when-to-acquire-and-when-to-divest/ginni_rometty_ibm2/" rel="attachment wp-att-299407"><img src="http://allthingsd.com/files/2013/02/ginni_rometty_IBM2-380x253.jpg" alt="ginni_rometty_IBM2" width="380" height="253" class="alignright size-medium wp-image-299407" /></a>It always pays to know when and what to buy. It also pays to know when and what to sell.</p>
<p>That was a point that IBM CEO Ginni Rometty made today in remarks at the company&#8217;s investors day in San Jose, Calif. IBM is known for making numerous acquisitions over the years, a <a href="http://allthingsd.com/20121219/ibm-to-acquire-storediq-a-manager-of-corporate-data/">recent example being StoredIQ</a>. </p>
<p>What does IBM look for in an acquisition? Rometty boiled it down to three questions the company asks before every deal: &#8220;Does it extend a capability we have? Does it have scalable intellectual property? Can we extend it to 173 countries around the world?&#8221;</p>
<p>In a perhaps not-so-veiled shot at rival Hewlett-Packard, given its combined $16 billion in write-downs on the acquisitions of EDS and Autonomy last year, Rometty said IBM feels strongly that &#8220;companies get in trouble when they acquire something that takes them into a new space.&#8221;</p>
<p>But just as important to Big Blue&#8217;s success in recent years has been its decisions to get <em>out</em> of certain businesses. Examples include the PC business, which it sold to Lenovo in 2004, and, more recently, its retail point-of-sale business, which it <a href="http://www.ibm.com/investor/ircorner/article/rss.wss">sold to Japan&#8217;s Toshiba</a> last year.</p>
<p>&#8220;Over the last decade, we have divested $15 billion worth of revenue,&#8221; Rometty said. &#8220;If we had not, we would be a larger company, but we would also be a lesser-margin company, and we would have capabilities that our clients would be less interested in.&#8221;</p>
<p>Rometty also said that IBM expects to continue its big bets on technologies like Big Data and analytics. &#8220;Data will be the basis of competitive advantage for every company, for every industry in the coming decade.&#8221;</p>
<p>To that end, she said that IBM now expects revenue from business analytics to account for as much as $20 billion in annual revenue by fiscal 2015. The prior target was $16 billion. And if Big Blue hits that goal it would amount to a doubling of analytics revenue from 2010.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130228/ibm-knows-when-to-acquire-and-when-to-divest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Credit Card Companies Need Some Madison Avenue Style</title>
		<link>http://allthingsd.com/20130207/why-credit-card-companies-need-some-madison-avenue-style/</link>
		<comments>http://allthingsd.com/20130207/why-credit-card-companies-need-some-madison-avenue-style/#comments</comments>
		<pubDate>Thu, 07 Feb 2013 23:50:14 +0000</pubDate>
		<dc:creator>Alistair Goodman</dc:creator>
				<category><![CDATA[Mobile]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[Alistair Goodman]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[banks]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Bob Dylan]]></category>
		<category><![CDATA[Chase]]></category>
		<category><![CDATA[credit cards]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Durbin Amendment]]></category>
		<category><![CDATA[EE]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mobile devices]]></category>
		<category><![CDATA[O2]]></category>
		<category><![CDATA[PayPal]]></category>
		<category><![CDATA[Placecast]]></category>
		<category><![CDATA[Silicon Valley]]></category>
		<category><![CDATA[Telefonica UK]]></category>
		<category><![CDATA[Vodafone UK]]></category>
		<category><![CDATA[WEVE]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=292784</guid>
		<description><![CDATA[Facing major competitors in the mobile wallet and offers business, credit card companies need to figure out how to leverage their relationships with consumers.]]></description>
				<content:encoded><![CDATA[<p><div id="attachment_292797" class="wp-caption alignright" style="width: 390px"><a href="http://allthingsd.com/files/2013/02/madison.jpg"><img src="http://allthingsd.com/files/2013/02/madison.jpg" alt="madison" width="380" height="285" class="size-full wp-image-292797" /></a><p class="wp-caption-text"><span class="media-attribution">Image copyright <a href="http://www.shutterstock.com/gallery-5230p1.html">Gregory James Van Raalte</a></span></p></div>In 2012, companies like PayPal, Google and Apple made major announcements in the mobile wallets and offers space. Credit card companies were not nearly as noisy, but 2013 could be the year that all changes.</p>
<p>Financial institutions are embracing digital offers out of necessity because revenue from current sources is in decline. The future of the payments industry is shifting quickly to monetizing consumer relationships &#8212; particularly through mobile devices &#8212; and away from extracting new value from the payments value chain.</p>
<p>What many do not know is that credit card companies have experienced significant pain as a result of the capping of their lucrative interchange fees by <a href="http://www.nerdwallet.com/blog/banking/durbin-amendment-explained/">the Durbin Amendment</a>, which became effective in 2011. Faced with declines in what had previously been a significant form of revenue, credit card companies know that they need to open new sources of revenue. This need is becoming ever more urgent with companies like Google and Paypal joining the mix and leveraging their unique assets &#8212; novelty, retailer relationships, and extensive technology infrastructure.</p>
<p>The opportunity here is massive &#8212; total real-world (not online) retail commerce is $4 trillion a year in the U.S., and <a href="http://www.emarketer.com/About/Article.aspx?R=1009548">eMarketer just reported that mobile advertising is expected to increase by 180 percent</a> in the coming year to reach $4 billion. Credit card companies looking to take their fair share of this revenue need to figure out how to leverage their great relationships with consumers to open up new sources of revenue.</p>
<p>Here are my suggestions for strategies that will help credit card companies come out ahead in 2013:</p>
<p><strong>Stop chasing Chase. Instead, start acting like a media company.</strong><br />
To open new sources of revenue through media and technology, banks need to add new DNA that is not focused on traditional banking concerns such as regulatory compliance, security and fraud. Of course, I am not suggesting that they stop complying with regulations, or that they abandon security. But banks must become more nimble at attacking new revenue opportunities &#8212; particularly in consumer offers. At Google and other companies in Silicon Valley, teams still pull all-nighters to release alpha versions of products for consumers and partners in a matter of days to test an experience, an approach that is antithetical to the banking world. Yet if banks want to succeed in the media world, they have to figure out how at least part of their organization can play in an API-driven ecosystem that encourages collaboration and rapid product releases. First test for the C-suite at banks: Have you developed and released anything new to consumers in a single quarter?</p>
<p><strong>Play ball with the competitors.</strong><br />
Consider the value of the ecosystem in growing your mobile offers initiative. In order to succeed in delivering a great mobile rewards program, the key is having a rich supply of merchant offers upon which to layer targeting. No one company is going to assemble enough, so think instead about collaborating with all the other players and getting the network effects of a larger audience and a bigger pool of offers. An interesting model for this kind of collaboration is <a href="http://weve.com/">WEVE</a> in the UK, in which three mobile operators &#8212; <a href="http://ee.co.uk/">EE</a>, Telefonica UK (<a href="http://www.o2.com/">O2</a>) and <a href="http://www.vodafone.co.uk/">Vodafone UK</a> &#8212; have banded together to form a large consortium. The individual companies behind the consortium still compete to acquire subscribers, but they now collaborate with a common platform for monetizing those subscribers through marketing.</p>
<p><strong>Turn that marketing focus inside out.</strong><br />
Now that you have a great offers program, the biggest challenge is getting enough customers into it to move the needle. Each card provider today has marketing teams that focus solely on signing up new customers for their credit cards. Yet somehow, these are not the folks developing new digital offerings. These customer acquisition teams are digital media experts. Imagine focusing that valuable marketing experience on getting consumers to opt into new and sophisticated customer reward programs, and to download their new rewards apps. It is customer acquisition, pure and simple.</p>
<p><strong>Don&#8217;t wait for big data.</strong><br />
My last and most controversial suggestion for 2013 is to let go of the obsession with big data! Many banks are building huge databases for micro-targeting customers with niche offers. Microtargeting is cool, but believe me, media is a war of attrition and all of that big data-based targeting will yield a customer segment of just a few people. Brands do not want to target only a select handful of people with their offers; they need to reach a million people in order for their investment in creating and distributing that offer to make economic sense. Silicon Valley has managed to convince enterprises that massive amounts of data will build a program that really performs in terms of offers and sales; but ultimately, <em>microtargeting is not monetizable at scale until there are an equally large set of available offers</em>. The real way to success is to provide a great experience with lots of offers for your entire customer base. Yes, there are a handful of important variables that should be considered, including location, time of day, gender, interests and even retargeting; but other data on top is gravy and too much is at odds with reaching a large enough number of people with relevant offers. The &#8220;big picture&#8221; is not always based on big data. (Hint: You might derive as much value from customers by asking them what they want instead.)</p>
<p><strong>The Long View</strong><br />
As Bob Dylan said, &#8220;you don&#8217;t need a weatherman to know which way the wind blows.&#8221; Credit card companies can no longer hesitate while other companies race forward to make a profit on this kind of consumer spending. If credit card companies employ the strategies I propose, they will be real contenders against the likes of Paypal, Google, Apple and the mobile operators. The weather forecast is clear for mobile offers: Sunny, with billions of dollars blowing in the wind. It&#8217;s up to each financial institution to figure out how to get to market faster and catch some of it.</p>
<p><em>Alistair Goodman is the CEO of Placecast, where he leads a team of mobile, technology and marketing experts who have created the most scalable, proven, location-based marketing system currently available. Alistair has more than 20 years of experience working in marketing and product development efforts for media and technology companies. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130207/why-credit-card-companies-need-some-madison-avenue-style/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>DataGravity Lands $30 Million From Andreessen Horowitz; Levine Joins Board</title>
		<link>http://allthingsd.com/20130129/datagravity-lands-30-million-from-andreessen-horowitz-levine-joins-board/</link>
		<comments>http://allthingsd.com/20130129/datagravity-lands-30-million-from-andreessen-horowitz-levine-joins-board/#comments</comments>
		<pubDate>Tue, 29 Jan 2013 12:56:12 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Andreessen Horowitz]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[Charles River Ventures]]></category>
		<category><![CDATA[DataGravity]]></category>
		<category><![CDATA[EqualLogic]]></category>
		<category><![CDATA[General Catalyst Partners]]></category>
		<category><![CDATA[John Joseph]]></category>
		<category><![CDATA[New Hampshire]]></category>
		<category><![CDATA[Paula Long]]></category>
		<category><![CDATA[Peter Levine]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=289577</guid>
		<description><![CDATA[Extracting intelligence from stored data is hard.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20130129/datagravity-lands-30-million-from-andreessen-horowitz-levine-joins-board/datagravity-feature/" rel="attachment wp-att-289578"><img src="http://allthingsd.com/files/2013/01/datagravity-feature-380x285.png" alt="datagravity-feature" width="380" height="285" class="alignright size-medium wp-image-289578" /></a>For all the press it gets as the next business panacea, Big Data is hard. You can collect and analyze tons of data, but before companies can get anything useful out of it, generally speaking, it&#8217;s not uncommon for Ph.D.-level data scientists to be called in to figure out what to do with it, and a lot of heavy-duty software gets applied.</p>
<p>A company called DataGravity, based in Nashua, N.H., is aiming to reduce the complexity of getting useful information from stored data. It announced today that it has taken $30 million in a Series B round of venture capital financing. The round was led by Andreessen Horowitz, and <a href="http://allthingsd.com/20110321/peter-levine-veritas-veteran-and-data-center-guru-joins-andreesen-horowitz/">Peter Levine</a>, an AH partner, will join DataGravity&#8217;s board. Existing partners Charles River Ventures and General Catalyst Partners also particpated in the round. </p>
<p>The company is being a little cagey about what exactly it is doing, but it was started by Paula Long and John Joseph, two execs from EqualLogic, the storage company acquired by Dell in 2008. Long was a founder, and Joseph was vice president of marketing and product management. </p>
<p>Where EqualLogic was all about making the complex business of managing data storage in the enterprise easy enough for a general IT person to handle, DataGravity is aiming to do something similar in the area of extracting useful intelligence from data repositories.</p>
<p>&#8220;You&#8217;ll hear a lot about companies going after data repositories, but it&#8217;s usually done with a big professional services component to it,&#8221; Long said. &#8220;Usually a third-party company will arrive at your site with three or four people who will mine your data for you for $150 an hour. The companies we want to sell to won&#8217;t be able to afford that. Nor will they have the people with the necessary skills internally.&#8221;</p>
<p>&#8220;These are companies looking for business intelligence, but who are looking for it in a more consumable fashion,&#8221; Joseph said.</p>
<p>In a statement, Levine called DataGravity one of the &#8220;pioneering companies doing the hard work of solving intractable enterprise challenges.&#8221;</p>
<p>DataGravity expects to be more specific about its plans next year. In the meantime, it will use the new round of funding to help build the product and set a go-to-market plan for 2014. It is also actively recruiting people in the fields of data visualization and user experience.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130129/datagravity-lands-30-million-from-andreessen-horowitz-levine-joins-board/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Has Big Data Reached Its Moment of Disillusionment?</title>
		<link>http://allthingsd.com/20130124/has-big-data-reached-its-moment-of-disillusionment/</link>
		<comments>http://allthingsd.com/20130124/has-big-data-reached-its-moment-of-disillusionment/#comments</comments>
		<pubDate>Fri, 25 Jan 2013 00:36:04 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[enterprise software]]></category>
		<category><![CDATA[featured post]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[MapR]]></category>
		<category><![CDATA[Mortar Data]]></category>
		<category><![CDATA[Rob Bearden]]></category>
		<category><![CDATA[Trifacta]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=288297</guid>
		<description><![CDATA[Big Data is hard.]]></description>
				<content:encoded><![CDATA[<p><img src="http://allthingsd.com/files/2013/01/dissillusioned_lolcat-380x285.jpeg" alt="dissillusioned_lolcat" width="380" height="285" class="alignright size-medium wp-image-288492" /></p>
<p>Last year was a year when the phrase &#8220;Big Data&#8221; was <a href="http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/">all over the place</a>. Dig through the troves of data your business generates, the thinking goes, and some useful business intelligence falls out. That, at least, is the idea, and there are numerous companies &#8212; some startups, some big and established players &#8212; trying to build business plans on different aspects of that idea.</p>
<p>But in the course of being reduced to a simple buzzy phrase, Big Data as a concept implies some expectations, some realistic, some undoubtedly not. The research house Gartner has a phrase for this tendency as well: The Hype Cycle. </p>
<p>The Hype Cycle goes like this: A new technology that promises to fundamentally &#8220;change everything&#8221; gets talked up incessantly in the press and at industry events and often also in research reports. At some point the chatter peaks, and expectations reach a fever pitch. Soon, maybe a year or two after it all started to build and some money has been spent and everything that was supposed to have changed for the better actually <em>hasn&#8217;t</em>, the narrative focus turns negative. What seemed so brilliant and earthshaking 18 months ago, seems in restrospect to have been an ill-advised waste of time, money and attention.</p>
<p>This is what Gartner calls the &#8220;Trough of Disillusionment&#8221; phase of the Hype Cycle. </p>
<p><img src="http://allthingsd.com/files/2013/01/gartner_hype_cycle.png" alt="gartner_hype_cycle" width="380" height="285" class="alignleft size-full wp-image-288495" /></p>
<p>Gartner analyst Svetlana Sicular <a href="http://blogs.gartner.com/svetlana-sicular/big-data-is-falling-into-the-trough-of-disillusionment/">argues in a blog post</a> that Big Data may have reached that point. She has been &#8220;hearing from people in the center of the Hadoop movement,&#8221; the open-source technology central to companies like Cloudera, Hortonworks and MapR. She also presents a video of a Hadoop gathering called <a href="http://www.meetup.com/SF-Bay-Areas-Big-Data-Think-Tank/events/96526532/">Elephant Riders</a> where reps from these three companies are debating its current state. (It&#8217;s about 90 minutes and if you&#8217;re so inclined, you can <a href="http://www.youtube.com/watch?v=zCnClYLcCI8">see it here</a>.)</p>
<p>By Gartner&#8217;s standards, the trough of disillusionment may indeed have arrived, though you certainly wouldn&#8217;t be able to tell from the level of investment interest in companies like Cloudera, which late last year raised a massive <a href="http://allthingsd.com/20121206/exclusive-cloudera-closes-massive-65-million-funding-round-at-700-million-valuation/">$65 million round of funding</a>.</p>
<p>One source of that disillusionment, she writes, is that companies are struggling with a basic problem: What questions do you attempt to answer with your data in the first place? &#8220;Several days ago, a financial industry client told me that framing a right question to express a game-changing idea is extremely challenging,&#8221; Sicular wrote. &#8220;First, selecting a question from multiple candidates; second, breaking it down to many sub-questions; and, third, answering even one of them reliably. It is <em>hard</em>.&#8221;</p>
<p>Hadoop doesn&#8217;t exactly make that process any easier. Once you&#8217;ve decided to use it, getting anything useful out of it requires some pretty specialized knowledge and training, and finding the right people to do that isn&#8217;t easy. But the industry is beginning to respond to that need: Startups like <a href="http://allthingsd.com/20120402/mortar-data-hadoop-for-the-rest-of-us-gets-seed-funding/">Mortar Data</a> have sought to make Hadoop more readily accessible to mainstream programmers, while another called <a href="http://allthingsd.com/20121004/trifacta-aims-to-make-big-data-useful-lands-4-3-million-from-accel-partners/">Trifacta</a> makes the resulting data easier to manipulate.</p>
<p>And versions of Hadoop itself are getting incrementally easier to work with. Hortonworks, for example, recently released HDP 1.2, a new version of the open-source platform, but also Sandbox, a set of training tools that lets developers play around with Hadoop and get a feel for its use. </p>
<p>I talked with Hortonworks CEO Rob Bearden recently, and he said that, in 2011, companies had no idea what Hadoop could be used for, then spent 2012 experimenting with it, and now want to get some real-world value out of it in 2013. &#8220;This year, all the technology is coming together in a way that is consumable,&#8221; he said. &#8220;In the last quarter of last year we saw a lot of interesting production environments. Now the objectives are becoming clear for getting useful in 2013.&#8221;</p>
<p>The next milestone in the Hype Cycle, Sicular writes, is negative press. Eventually it&#8217;s followed by a period called the &#8220;Slope of Enlightenment,&#8221; and finally the &#8220;Plateau of Productivity.&#8221; It&#8217;s nice to know there could be a positive conclusion to all this somewhere down the road.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130124/has-big-data-reached-its-moment-of-disillusionment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scientists Store King Speech, Shakespeare Sonnets in DNA</title>
		<link>http://allthingsd.com/20130123/scientists-store-king-speech-shakespeare-sonnets-in-dna/</link>
		<comments>http://allthingsd.com/20130123/scientists-store-king-speech-shakespeare-sonnets-in-dna/#comments</comments>
		<pubDate>Wed, 23 Jan 2013 19:10:11 +0000</pubDate>
		<dc:creator>Gautam Naik</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[DNA]]></category>
		<category><![CDATA[Gautam Naik]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[storage]]></category>
		<category><![CDATA[The Wall Street Journal]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=287870</guid>
		<description><![CDATA[Scientists have stored audio and text on fragments of DNA and then retrieved them with near-perfect fidelity -- a technique that one day may provide a new way to handle the overwhelming data of the digital age.]]></description>
				<content:encoded><![CDATA[<p>Scientists have stored audio and text on fragments of DNA and then retrieved them with near-perfect fidelity &#8212; a technique that one day may provide a new way to handle the overwhelming data of the digital age.</p>
<p>The scientists encoded in DNA an audio clip of Martin Luther King Jr.&#8217;s &#8220;I Have a Dream&#8221; speech, a photograph, a copy of Crick and Watson&#8217;s famous &#8220;double helix&#8221; scientific paper from 1953 and Shakespeare&#8217;s 154 sonnets. They were then able to retrieve them with 99.99% accuracy. The experiment was reported Wednesday in the journal Nature.</p>
<p><a href="http://online.wsj.com/article/SB10001424127887324539304578259883507543150.html">Read the rest of this post on the original site »</a></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130123/scientists-store-king-speech-shakespeare-sonnets-in-dna/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Illusions of Expertise: Why Marketers Should Start Listening to Data</title>
		<link>http://allthingsd.com/20130115/illusions-of-expertise-why-marketers-should-start-listening-to-data/</link>
		<comments>http://allthingsd.com/20130115/illusions-of-expertise-why-marketers-should-start-listening-to-data/#comments</comments>
		<pubDate>Tue, 15 Jan 2013 20:30:04 +0000</pubDate>
		<dc:creator>Omar Tawakol</dc:creator>
				<category><![CDATA[Voices]]></category>
		<category><![CDATA[algorithm]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[BlueKai]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[marketers]]></category>
		<category><![CDATA[Nate Silver]]></category>
		<category><![CDATA[Omar Tawakol]]></category>
		<category><![CDATA[Paul Meehl]]></category>
		<category><![CDATA[PECOTA]]></category>
		<category><![CDATA[Philip Tetlock]]></category>
		<category><![CDATA[The Signal and the Noise]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=285704</guid>
		<description><![CDATA[Armed with mountains of data, most experts put in charge of predicting outcomes still can't come up with accurate answers.]]></description>
				<content:encoded><![CDATA[<p><div id="attachment_285733" class="wp-caption alignright" style="width: 390px"><a href="http://allthingsd.com/files/2013/01/data38011.jpg"><img src="http://allthingsd.com/files/2013/01/data38011.jpg" alt="data3801" width="380" height="285" class="size-full wp-image-285733" /></a><p class="wp-caption-text"><span class="media-attribution">Image copyright <a href="http://www.shutterstock.com/gallery-60375p1.html">Petr Vaclavek</a></span></p></div>With all the advances in Big Data, shouldn&#8217;t it be easy to predict the future? Already, the algorithms and computer processing power exist to analyze petabytes of data every second, so it would seem logical we could us that power to predict outcomes &#8212; election results, the stock market, economic trends or simply who will win the Super Bowl. The problem is, our technical capacity to analyze data is growing faster than our social capability to understand it.</p>
<p>Armed with mountains of data, most experts put in charge of predicting outcomes still can&#8217;t come up with accurate answers. Why? Because most people have a human bias to manipulate data to deliver the outcomes they really want to see.</p>
<p>Philip Tetlock, a psychologist at the University of California at Berkeley, published a book in 2005 titled &#8220;Expert Political Judgment &#8212; How Good Is It?&#8221; For the book, a team of researchers interviewed 284 political experts and extracted 80,000 predictions from them over 15 years, rating the probabilities that each event would occur (they gave each prediction three possible outcomes). The result was surprising: the experts&#8217; predictions were worse than those made simply by assigning equal probabilities to all three outcomes. Dart-throwing monkeys could have performed just as well. It&#8217;s easy to make fun of talking heads and political pundits, but what about experts in other fields?</p>
<p>Another psychology professor and a past President of the American Psychological Association, Paul Meehl, has conducted similar research in other fields. In his work, he studied whether a simple algorithm could predict freshmen grades at the end of a school year better than expert college counselors could. The algorithms were only given SAT scores and grades. The expert counselors, on the other hand, were given the same data as well as a 45-minute interview and a personal essay. Again the simple algorithm won. In yet another example of experts versus algorithms, Paul&#8217;s work pitted algorithms predicting the future price of a bottle of wine against connoisseurs. In this case, the wine predicting algorithm had access only to three weather variables while the wine connoisseur, on the other hand, had all of the data and the ability to taste the wine. By now it should be obvious the algorithms beat the connoisseurs when it came to predicting the wine&#8217;s price. Paul&#8217;s analysis didn&#8217;t stop there. It extended to medical experts, flight trainers, parole officers, bankers and many others. In all the cases, the algorithms won. So where did all these experts go wrong? There were several reasons &#8212; but we will focus on just one here.</p>
<p>The biggest bias experts have is they tend to value perceived causes or stories over the underlying data. Storytelling is a powerful way to inject order into chaos, but reality is often far more complex than a simplified story. Whenever data supports their stories, they promote the data, but when data contradicts their stories, they question or ignore the data. Experts become really good at &#8220;finding the data&#8221; to promote their case.</p>
<p>What does all this have to do with marketers? A marketer&#8217;s job is to tell a story. Any expert will have a tendency to manipulate data to get it to conform to his or her story. Marketers are susceptible to the same sins, except they&#8217;re even more prone to storytelling. There is also a terrible feedback loop at work here. There may be good marketers who don&#8217;t want to torture data to fit into a clean story, but these marketers couldn&#8217;t make the bold claims that end up as headlines. So in short, marketers are some of the worst &#8220;experts&#8221; at predicting specific outcomes because their job biases them to storytelling and rewards them for making predictions that end up in the news.</p>
<p>To better understand why marketers tend to ignore data, let&#8217;s look at one possible scenario.</p>
<p>Imagine a Father&#8217;s Day ad that&#8217;s unfortunately misplaced next to a news article with the headline &#8220;One in Four Women Victims of Domestic Violence.&#8221; As a marketer, you are presented with the following two formulations about the poorly placed ad and you want to find out which publisher you should blame:</p>
<p><strong>Problem Description A</strong><br />
85 percent of the impressions in the Father&#8217;s Day ad campaign are served in publisher A, and 15 percent are served in publisher B. Ad verification software says the ad was found in a bad context on publisher B, and the verification software is right 80 percent of the time.</p>
<p>When asked, most marketers choose publisher B as the most likely cause of the problem. But let&#8217;s try an alternate formulation of the problem:</p>
<p><strong>Problem Description B</strong><br />
Publisher A and publisher B served the same number of ad impressions.<br />
We learn that 85 percent of ads served in publisher A are served into inappropriate content. Ad verification software says the ad was found in a bad context on publisher B. The verification software is right 80 percent of the time.</p>
<p>Now which publisher do you think caused the problem? Most marketers choose publisher A as the culprit.</p>
<p>It turns out both formulations of the problem result in the same probabilistic answer &#8212; that is, 61 percent probability that publisher A caused the problem, according to Bayes rule in odds form. Most marketers get the first version wrong and the second version right, even though it&#8217;s the same math problem! Why do most marketers get the second version right? In the first story, how you use the fact that 85 percent of impressions are served in publisher A is not obvious. In the second story, you have a cause. Publisher A is, for all practical purposes, almost always inappropriate. Marketers prefer stories that point to an understandable cause over statistics. Marketers will hone in on a story and ignore the rest of the data, no matter how big the data set.</p>
<p>Think of the many problems marketers have that involve some form of prediction. Who is the target audience for this product or service? Why do they need, like or want the product or service? Where can we find this audience? What creative will succeed with this audience? We even have a whole industry built around the RFP (request for proposal), which itself is a form of a story. The RFP mechanism guides billions of dollars of ad spending, and ad allocation decisions are forms of predictions.</p>
<p>So can marketers learn to overcome their biases and use data to make more accurate predictions? It may well be possible.</p>
<p>In his book, &#8220;The Signal and the Noise, &#8221; Nate Silver tells a great story about how the PECOTA system he built to predict pitcher performance in baseball was beat by expert scouts. In 2006, Nate&#8217;s program PECOTA produced a list of 100 prospects from the minor leagues. He then pitted that list against the Baseball America Top 100 prospects list, which was constructed by scouts. After watching six years of actual performance, and worrying their jobs would disappear due to PECOTA, the scouts performed better than the PECOTA list by a good margin. It turns out the scouts had learned to tone down their own biases and instead &#8220;listen&#8221; to the data. They learned they could combine their expertise with the reams of public data available.</p>
<p>Marketers can learn to separate the signal from the noise. Marketers are dedicated storytellers who believe their own stories, downplay luck and impose cause on randomness. In the world of Big Data, marketers have to learn to to let the data speak for itself. Otherwise, all the big data in the world won&#8217;t help marketers understand their customers better.</p>
<p><em>Omar is the co-founder and CEO of BlueKai, the data activation system that supplies both Fortune 100 companies and leading publishers with solutions for managing and activating first- and third-party data for creating highly effective customer and marketing campaigns. Omar&#8217;s previous roles include Chief Advertising Officer for mobile search and advertising solution Medio and Chief Marketing Officer for early behavioral data leader Revenue Science.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130115/illusions-of-expertise-why-marketers-should-start-listening-to-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Graphs as a New Way of Thinking</title>
		<link>http://allthingsd.com/20130109/graphs-as-a-new-way-of-thinking/</link>
		<comments>http://allthingsd.com/20130109/graphs-as-a-new-way-of-thinking/#comments</comments>
		<pubDate>Wed, 09 Jan 2013 19:16:31 +0000</pubDate>
		<dc:creator>Emil Eifrem</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Numbers]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[Adobe]]></category>
		<category><![CDATA[American Express]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Creative Cloud]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Deutsche Telekom]]></category>
		<category><![CDATA[Emil Efrem]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[graphs]]></category>
		<category><![CDATA[Neo Technologies]]></category>
		<category><![CDATA[Squidoo]]></category>
		<category><![CDATA[Telenor Group]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Viadeo]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=283845</guid>
		<description><![CDATA[As data gets bigger, graphs get more important.]]></description>
				<content:encoded><![CDATA[<p>Faced with the need to generate ever-greater insight and end-user value, some of the world’s most innovative companies &#8212; Google, Facebook, Twitter, Adobe and American Express among them &#8212; have turned to graph technologies to tackle the complexity at the heart of their data.</p>
<p>To understand how graphs address data complexity, we need first to understand the nature of the complexity itself. In practical terms, data gets more complex as it gets bigger, more semi-structured, and more densely connected.</p>
<p>We all know about big data. The volume of net new data being created each year is growing exponentially &#8212; a trend that is set to continue for the foreseeable future. But increased volume isn&#8217;t the only force we have to contend with today: On top of this staggering growth in the volume of data, we are also seeing an increase in both the amount of semi-structure and the degree of connectedness present in that data.</p>
<h4 class="subhed">Semi-Structure</h4>
<p>Semi-structured data is messy data: data that doesn&#8217;t fit into a uniform, one-size-fits-all, rigid relational schema. It is characterized by the presence of sparse tables and lots of null checking logic &#8212; all of it necessary to produce a solution that is fast enough and flexible enough to deal with the vagaries of real world data.</p>
<p>Increased semi-structure, then, is another force with which we have to contend, besides increased data volume. As data volumes grow, we trade insight for uniformity; the more data we gather about a group of entities, the more that data is likely to be semi-structured.</p>
<h4 class="subhed">Connectedness</h4>
<p>But insight and end-user value do not simply result from ramping up volume and variation in our data. Many of the more important questions we want to ask of our data require us to understand how things are connected. Insight depends on us understanding the relationships between entities &#8212; and often, the quality of those relationships.</p>
<p>Here are some examples, taken from different domains, of the kinds of important questions we ask of our data:</p>
<ul>
<li>Which friends and colleagues do we have in common?</li>
<li>What&#8217;s the quickest route between two stations on the metro?</li>
<li>What do you recommend I buy based on my previous purchases?</li>
<li>Which products, services and subscriptions do I have permission to access and modify? Conversely, given this particular subscription, who can modify or cancel it?</li>
<li>What&#8217;s the most efficient means of delivering a parcel from A to B?</li>
<li>Who has been fraudulently claiming benefits?</li>
<li>Who owns all the debt? Who is most at risk of poisoning the financial markets?</li>
</ul>
<p>To answer each of these questions, we need to understand how the entities in our domain are connected. In other words, these are graph problems. </p>
<p>Why are these graph problems? Because graphs are the best abstraction we have for modeling and querying connectedness. Moreover, the malleability of the graph structure makes it ideal for creating high-fidelity representations of a semi-structured domain. Traditionally relegated to the more obscure applications of computer science, graph data models are today proving to be a powerful way of modeling and interrogating a wide range of common use cases. Put simply, graphs are everywhere.</p>
<h4 class="subhed">Graph Databases</h4>
<p>Today, if you’ve got a graph data problem, you can tackle it using a graph database &#8212; an online transactional system that allows you to store, manage and query your data in the form of a graph. A graph database enables you to represent any kind of data in a highly accessible, elegant way using nodes and relationships, both of which may host properties: </p>
<ul>
<li>Nodes are containers for properties, which are key-value pairs that capture an entity’s attributes. In a graph model of a domain, nodes tend to be used to represent the things in the domain. The connections between these things are expressed using relationships.</li>
<li>A relationship has a name and a direction, which together lend semantic clarity and context to the nodes connected by the relationship. Like nodes, relationships can also contain properties: Attaching one or more properties to a relationship allows us to weight that relationship, or describe its quality, or otherwise qualify its applicability for a particular query.</li>
</ul>
<p>The key thing about such a model is that it makes relations first-class citizens of the data, rather than treating them as metadata. As real data points, they can be queried and understood in their variety, weight and quality: Important capabilities in a world of increasing connectedness.</p>
<h4 class="subhed">Graph Databases in Practice</h4>
<p>Today, the most innovative organizations are leveraging graph databases as a way to solve the challenges around their connected data. These include major names such as Google, Facebook, Twitter, Adobe and American Express. Graph databases are also being used by organizations in a range of fields including finance, education, web, ISV and telecom and data communications. </p>
<p>The following examples offer use case scenarios of graph databases in practice.</p>
<ul>
<li>Adobe Systems currently leverages a graph database to provide social capabilities to its Creative Cloud &#8212; a new array of services to media enthusiasts and professionals. A graph offers clear advantages in capturing Adobe’s rich data model fully, while still allowing for high performance queries that range from simple reads to advanced analytics. It also enables Adobe to store large amounts of connected data across three continents, all while maintaining high query performance.</li>
<li>Europe’s No. 1 professional network, Viadeo, has integrated a graph database to store all of its users and relationships. Viadeo currently has 40 million professionals in its network and requires a solution that is easy to use and capable of handling major expansion. Upon integrating a graph model, Viadeo has accelerated its system performance by more than 200 percent.</li>
<li>Telenor Group is one of the top ten wireless Telco companies in the world, and uses a graph database to manage its customer organizational structures. The ability to model and query complex data such as customer and account structures with high performance has proven to be critical to Telenor&#8217;s ongoing success.</li>
</ul>
<p><div id="attachment_283846" class="wp-caption alignright" style="width: 650px"><a href="http://allthingsd.com/files/2013/01/telenor5.png"><img src="http://allthingsd.com/files/2013/01/telenor5-640x480.png" alt="An access control graph. Telenor uses a similar data model to manage products and subscriptions." width="640" height="480" class="size-large wp-image-283846" /></a><p class="wp-caption-text">An access control graph. Telenor uses a similar data model to manage products and subscriptions.</p></div></p>
<ul>
<li>Deutsche Telekom leverages a graph database for its highly scalable social soccer fan website attracting tens of thousands of visitors during each soccer match, where it provides painless data modeling, seamless data model extendibility, and high performance and reliability.</li>
<li>Squidoo is the popular social publishing platform where users share their passions. They recently created a product called Postcards, which are single-page, beautifully designed recommendations of books, movies, music albums, quotes and other products and media types. A graph database ensures that users have an awesome experience as it provides a primary data store for the Postcards taxonomy and the recommendation engine for what people should be doing next.</li>
</ul>
<p>Such examples prove the pervasiveness of connections within data and the power of a graph model to optimally map relationships. A graph database allows you to further query and analyze such connections to provide greater insight and end-user value. In short, graphs are poised to deliver true competitive advantage by offering deeper perspective into data as well as a new framework to power today’s revolutionary applications. </p>
<h4 class="subhed">A New Way of Thinking</h4>
<p>Graphs are a new way of thinking for explicitly modeling the factors that make today’s big data so complex: Semi-structure and connectedness. As more and more organizations recognize the value of modeling data with a graph, they are turning to the use of graph databases to extend this powerful modeling capability to the storage and querying of complex, densely connected structures. The result is the opening up of new opportunities for generating critical insight and end-user value, which can make all the difference in keeping up with today’s competitive business environment. </p>
<p><em>Emil is the founder of the Neo4j open source graph database project, which is the most widely deployed graph database in the world. As a life-long compulsive programmer who started his first free software project in 1994, Emil has with horror witnessed his recent degradation into a VC-backed powerpoint engineer. As the CEO of Neo4j&#8217;s commercial sponsor Neo Technology, Emil is now mainly focused on spreading the word about the powers of graphs and preaching the demise of tabular solutions everywhere. Emil presents regularly at conferences such as JAOO, JavaOne, QCon and OSCON.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20130109/graphs-as-a-new-way-of-thinking/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>IBM to Acquire StoredIQ, a Manager of Corporate Data</title>
		<link>http://allthingsd.com/20121219/ibm-to-acquire-storediq-a-manager-of-corporate-data/</link>
		<comments>http://allthingsd.com/20121219/ibm-to-acquire-storediq-a-manager-of-corporate-data/#comments</comments>
		<pubDate>Wed, 19 Dec 2012 16:32:07 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[acquisitions]]></category>
		<category><![CDATA[Austin]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[M&A]]></category>
		<category><![CDATA[mergers and acquisitions]]></category>
		<category><![CDATA[storage]]></category>
		<category><![CDATA[StoredIQ]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=279238</guid>
		<description><![CDATA[Big Blue adds to its ability to derive meaning out of mountains of old corporate data.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20110714/ibms-cloud-is-big-in-japan-with-two-new-data-centers/eyebeeem-feature/" rel="attachment wp-att-98049"><img src="http://allthingsd.com/files/2011/07/eyebeeem-feature-380x285.png" alt="eyebeeem-feature" width="380" height="285" class="alignright size-medium wp-image-98049" /></a>IBM said today that it will <a href="http://www.prnewswire.com/news-releases/ibm-to-acquire-storediq-184090101.html">acquire StoredIQ</a>, an 11-year-old Austin-based company focused on managing big caches of corporate data.</p>
<p>A classic problem in Big Data circles is the reflexive urge to move data before you manage it and do anything useful with it. StoredIQ&#8217;s approach calls for managing data in the place it is originally stored. It works with a lot of different kinds of data sources &#8212; desktop machines, file shares, even old tapes. The point is to analyze them all, find the important bits, and delete the bits that aren&#8217;t important.</p>
<p>IBM will add StoredIQ to its Information LifeCycle Governance Suite, which aims to help companies spend less on the data they accumulate over the years. Data has a bad way of multiplying. If you&#8217;ve ever seen an email chain with attachments get sent to lots of people &#8212; then re-sent, forwarded and so on &#8212; you can pretty easily understand how unnecessary copies of the same data make storing it all so difficult. And if it&#8217;s happening on your email servers, it&#8217;s happening with other kinds of data, as well. Copies get made, then copies of copies. In time, it becomes unrealistic to sort through it all and just clean it up.</p>
<p>There is value in that older data as it ages. There are business patterns that repeat themselves that aren&#8217;t apparent years later: Pricing data, seasonal patterns and other information that&#8217;s worth keeping on for years and years. That&#8217;s where IBM&#8217;s analytics chops come into play. The key is figuring out which bits are important and which aren&#8217;t.</p>
<p>It&#8217;s not a trivial problem. Some companies are required by law &#8212; there have been a lot of new regulations around data retention in recent years &#8212; to store everything in case of a lawsuit or an enforcement action by a regulatory agency. As data multiplies, so does the cost to store and manage it, creating a headache for both the CIO and general counsel. </p>
<p>The deal builds on prior IBM acquisitions, like PSS Systems in 2010 and <a href="http://allthingsd.com/20120425/ibm-boosts-big-data-offerings-with-vivismo-acquisition/">Vivisimo earlier this year</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121219/ibm-to-acquire-storediq-a-manager-of-corporate-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twitter Takes Big Data to School</title>
		<link>http://allthingsd.com/20121214/twitter-takes-big-data-to-school/</link>
		<comments>http://allthingsd.com/20121214/twitter-takes-big-data-to-school/#comments</comments>
		<pubDate>Fri, 14 Dec 2012 14:00:12 +0000</pubDate>
		<dc:creator>Mike Isaac</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[college]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data sets]]></category>
		<category><![CDATA[Firehose]]></category>
		<category><![CDATA[Growth team]]></category>
		<category><![CDATA[Othman Laraki]]></category>
		<category><![CDATA[Professor Marti Hearst]]></category>
		<category><![CDATA[Social Data]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[UC Berkeley]]></category>
		<category><![CDATA[university]]></category>
		<category><![CDATA[University of California at Berkeley]]></category>
		<category><![CDATA[VP growth]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=278089</guid>
		<description><![CDATA[Twitter's big data sets come Cal.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20121214/twitter-takes-big-data-to-school/twitter_education/" rel="attachment wp-att-278090"><img src="http://allthingsd.com/files/2012/12/twitter_education.jpeg" alt="twitter_education" width="354" height="142" class="alignright size-full wp-image-278090" /></a>It&#8217;s that time of year again. The tinsel is hung, annoying music is playing nonstop and university classes are winding down while students prep for exams before the holidays.</p>
<p>That includes the students from Professor Marti Hearst&#8217;s class at the University of California at Berkeley, all of whom just finished off a semester entrenched in data sets sliced straight from the Twitter fire hose, after taking a course called <a href="http://blogs.ischool.berkeley.edu/i290-abdt-s12/about/">I290: &#8220;Analyzing Big Data with Twitter.&#8221;</a></p>
<p>It was essentially a crash course in Twitter&#8217;s big-data practices. The engineering and math students enrolled spent half the semester in a lecture hall listening to a host of guest product managers and engineers from Twitter, who explained the company&#8217;s approaches to analyzing and dealing with the massive amount of data that flows through Twitter&#8217;s pipes every single moment.</p>
<p>For data nerds (and wonks like me who follow the company), the course was a treat, as Twitter top brass like VP of Growth Othman Laraki stopped in to deliver lectures on some of the company&#8217;s best practices and philosophies when, for example, it comes to growth strategy. Other visitors included data systems engineers, search and relevance team members, and a former Twitter anti-spam/security expert. </p>
<p><a href="http://allthingsd.com/20121214/twitter-takes-big-data-to-school/twitter_berkeley/" rel="attachment wp-att-278093"><img src="http://allthingsd.com/files/2012/12/twitter_berkeley-380x253.jpg" alt="twitter_berkeley" width="380" height="253" class="alignleft size-medium wp-image-278093" /></a>When the course neared its end last week, some 40 students from Professor Hearst&#8217;s course <a href="http://engineering.twitter.com/2012/12/class-project-analyzing-big-data-with.html?utm_source=dlvr.it&amp;utm_medium=twitter">visited Twitter&#8217;s headquarters in San Francisco</a> and presented their small group projects created using Twitter data.</p>
<p>Bummed you missed out on the course? Fret not. Professor Hearst posted <a href="http://blogs.ischool.berkeley.edu/i290-abdt-s12/">video lectures from the semester</a> on her blog at the UC Berkeley School of Information. It&#8217;s all super-geeky stuff, but fascinating to watch if you&#8217;re at all interested in how Twitter makes sense of all the data that comes its way on a regular basis. (I recommend Laraki&#8217;s talk in particular &#8212; Twitter&#8217;s approach to growth is key to watch over the next few years, as the company continues to scale.)</p>
<p>Or, you know, you could just skip it and party like a true undergrad for all of winter break.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121214/twitter-takes-big-data-to-school/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Oracle Acquires DataRaker, a Big Data Supplier to Utilities</title>
		<link>http://allthingsd.com/20121213/oracle-acquires-dataraker-a-big-data-supplier-to-utilities/</link>
		<comments>http://allthingsd.com/20121213/oracle-acquires-dataraker-a-big-data-supplier-to-utilities/#comments</comments>
		<pubDate>Thu, 13 Dec 2012 14:59:36 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[DataRaker]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[M&A]]></category>
		<category><![CDATA[mergers and acquisitions]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=277711</guid>
		<description><![CDATA[A step for Oracle into analyzing data generated by smart meters.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20121118/cisco-munches-meraki-for-1-2-billion/acquisitions_shark-2/" rel="attachment wp-att-270615"><img src="http://allthingsd.com/files/2012/11/acquisitions_shark1.jpg" alt="acquisitions_shark" width="380" height="260" class="alignright size-full wp-image-270615" /></a>Oracle says it will acquire DataRaker, a player in the business of big data analytics that specializes in working with data generated by smart meters used by the gas and electrical industries.</p>
<p>Since DataRaker is relatively small and privately held, financial terms aren&#8217;t being disclosed. By my count, and it may be wrong, this is Oracle&#8217;s ninth acquisition of 2012.</p>
<p>DataRaker supplies cloud-based analytics software to electrical and gas utilities that are using smart meters. It combines data gathered from those meters with data gathered from older legacy systems and provides analysis that helps control costs, saving both customers and the utility money. </p>
<p>Here&#8217;s Oracle&#8217;s announcement. </p>
<blockquote class="memo"><p>Oracle Buys DataRaker</p>
<p>Adds Cloud-based Analytics Platform to Oracle Utilities Solutions to Transform Big Data into Actionable Intelligence</p>
<p>Redwood Shores, CA – December 13, 2012</p>
<p>News Facts<br />
Oracle today announced that it has entered into an agreement to acquire DataRaker, a provider of a cloud-based analytics platform that enables electric, gas and water utilities to leverage vast amounts of data to optimize operational efficiency and improve the customer experience.</p>
<p>Leading-edge utilities are investing in infrastructure to collect massive amounts of data from millions of distributed smart meters and sensors, and they require modern technologies to analyze and understand the insights provided by this data.</p>
<p>DataRaker’s proven solutions enable customers to gain immediate benefits from smart devices, by transforming meter, customer and network data into insights that can have a dramatic impact on organizational performance.</p>
<p>The combination of Oracle and DataRaker’s cloud-based solutions is expected to provide utilities with the most complete solution to harness the benefits of utility Big Data to improve operational performance and enhance customer experience.</p>
<p>More information on this announcement can be found at http://www.oracle.com/DataRaker<br />
Supporting Quotes</p>
<p>“Big Data created by smart meters and sensors has presented utilities with an enormous opportunity to improve operations and deliver better customer service by acting on the unique insights that can only be found by understanding the massive amounts of data coming from their customers and networks,” said Rodger Smith, Senior Vice President and General Manager, Oracle Utilities. “With DataRaker, Oracle can provide customers a complete and integrated set of products to further unlock efficiencies and create data insights that maximize business value.”</p>
<p>“Oracle’s proposed acquisition of DataRaker represents a strong endorsement of DataRaker’s proven, scalable and high-performance platform. We are excited to be a part of Oracle, and look forward to combining our resources to drive new innovations that will continue to deliver value to customers,” said Rick Brakken, CEO and Co-Founder, DataRaker.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121213/oracle-acquires-dataraker-a-big-data-supplier-to-utilities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A New Storage Paradigm for Big Data</title>
		<link>http://allthingsd.com/20121212/a-new-storage-paradigm-for-big-data/</link>
		<comments>http://allthingsd.com/20121212/a-new-storage-paradigm-for-big-data/#comments</comments>
		<pubDate>Wed, 12 Dec 2012 23:59:47 +0000</pubDate>
		<dc:creator>Janae Stow Lee</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Janae Stow Lee]]></category>
		<category><![CDATA[Quantum]]></category>
		<category><![CDATA[wide area storage]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=277471</guid>
		<description><![CDATA[That terabyte per day per oilfield of 3D seismic data being collected today might turn into the next decade's oil find]]></description>
				<content:encoded><![CDATA[<p><div id="attachment_277487" class="wp-caption alignright" style="width: 390px"><img src="http://allthingsd.com/files/2012/12/clouds380.jpg" alt="clouds380" width="380" height="285" class="size-full wp-image-277487" /><p class="wp-caption-text"><span class="media-attribution">Image by <a href="http://www.shutterstock.com/gallery-877822p1.html">timquo</a></span></p></div>You can&#8217;t open either a technology or business journal these days without seeing an article on the tremendous opportunity Big Data offers in driving new sources of revenue and efficiency. What&#8217;s the BIG deal? Yes, data has value. That&#8217;s hardly a new concept. Since the time of the first shop, businesspeople have been saving their customer lists, product specifications, and key employee data on stone tablets, papyrus, paper and now digital storage.</p>
<p>The BIG deal is the disruptive value caused by the advent of high performance digital data capture. This technology has made data collection almost free. Internet-based marketing systems auto-magically capture masses of information about prospective customer preferences. Flash-enabled digital movie cameras can be emptied every night and re-used, a far cry from the film model in which every frame was burned on expensive media &#8212; which then had to be manually processed and edited. Digital capture systems also enable capture of exponentially more data per event. Social media sites and businesses increasingly create, store and analyze HD video instead of text, at ten to one hundred times more granularity of data per customer or product. The availability of compute power to manipulate this data to create business advantage enables companies like WalMart to maintain and frequently utilize multi-petabyte databases to analyze customer patterns and speed decision making. For the first time, solutions are also becoming available which allow for productive analysis of video as well. Finally, the resulting content is being stored forever: after all, that terabyte per day per oilfield of 3D seismic data being collected today might turn into the next decade&#8217;s oil find; or today&#8217;s genomic profile might be tomorrow&#8217;s cure for cancer.</p>
<p>As a result of this &#8220;free&#8221; data capture, increasing information granularity, more frequent usage and extended data value, businesses, research institutions and governments are growing enormous stores of large unstructured data (increasingly video) that needs to be stored and managed. This result presents a number of challenging data storage problems: extreme scalability; affordability (in general); managing the balance between cost and easy online access; maximizing application and user access; and assuring data durability.</p>
<p>Like the cavalry, a new set of &#8220;wide area storage&#8221; solutions based on second generation object storage are arriving just in time to help enterprises manage these issues. Object storage is a unique storage architecture utilizing a kind of valet parking ticket system to store and retrieve data. The creator of a piece of data (an object) hands the object to the storage in exchange for an object identifier (the digital equivalent of a valet parking ticket). When the data is later needed, the user hands the system the object identifier, and data is returned. The power of this model is that it is highly scalable &#8212; many objects can be stored and retrieved in parallel &#8212; and retrieval can be independent of the original application or physical location. Any application or user who gains authorized access to the electronic key can use the data. Historically, this technology has been used to store data in large, containerized archives, but these were limited by the size and performance of the object storage &#8220;box.&#8221;</p>
<p>What makes the latest generation architecture different is its ability to copy and disperse objects very efficiently across large numbers of independent processing and storage elements, which can even be distributed widely &#8212; geographically providing disaster recovery protection without the need for traditional replication. This wide area storage essentially frees object storage from the constraints of a box or even a site. Wide area storage is similar to storing data in the internet or cloud &#8212; data is freed from the traditional boundaries of expensive storage hardware and processes. This results in capabilities which are perfectly suited to the challenges of Big Data: almost infinite scale; low cost per petabyte; the ability to afford 100 percent online data storage; global multi-application access; and a content store which can live forever without ever needing to endure a disruptive data migration.</p>
<p>How does it achieve this? Well first, as previously described, the underlying object storage is natively extremely scalable; unlike systems with centralized indices, if you want to add more data to a wide area storage system, you simply add more objects, which are then dispersed across more scaled out components to add access, performance or storage. And the hardware architecture is scale-out, which means if you need more storage, more performance or more communications bandwidth, you just plug in more pre-packaged storage, processors or network capacity, and the system takes care of the rest. Growth is simple.</p>
<p>Second, much like the systems we&#8217;ve all read about at Google, wide area storage systems are built under the assumption that individual hardware components will fail. Because objects are copied and dispersed across many storage and geographic resources, a multitude of components can fail while data is continuously available. This &#8220;failure-assumed&#8221; model allows for wide area storage to operate on lower cost, off-the-shelf disk and processor technologies, which translates to much lower capital cost than traditional disk storage. The ability to defer replacement of failed components also results in lower support and operating costs. And because multiple users and sites can share the system in common, the overhead of storage and disaster protection is shared across all users. No archive silos.</p>
<p>Third, the native cloud-like access model of these new offerings embeds the capability for &#8220;wide&#8221; geographic and application access, supporting a wide range of uses – from streaming data (like video, sensor information, and genomic sequences) to parallel processing (like Hadoop for analytics). Some solution vendors are also providing easier access for existing applications, including policy-based tiering from traditional disk or the ability to appear like a NAS archive.</p>
<p>Finally, and perhaps most intriguing, these new solutions offer the capability for content, once stored, to never need to be migrated again. This may be the most critical element of all. For any user or technology manager who has endured the pain of unloading a broken, filled or obsolescent NAS or traditional block storage array in order to migrate to the next big thing, with next generation object storage, you never need do this again. This is going to be critical when your Big Data store is holding hundreds of petabytes of high value data. Forever.</p>
<p><em>Janae Stow Lee is senior vice president for Quantum&#8217;s file system and archive products.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121212/a-new-storage-paradigm-for-big-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Exclusive: Cloudera Closes Massive $65 Million Funding Round at $700 Million Valuation</title>
		<link>http://allthingsd.com/20121206/exclusive-cloudera-closes-massive-65-million-funding-round-at-700-million-valuation/</link>
		<comments>http://allthingsd.com/20121206/exclusive-cloudera-closes-massive-65-million-funding-round-at-700-million-valuation/#comments</comments>
		<pubDate>Thu, 06 Dec 2012 20:06:16 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Accel Partners]]></category>
		<category><![CDATA[Amr Awadallah]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Christophe Bisciglia]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[financing]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Ignition Partners]]></category>
		<category><![CDATA[In-Q-Tel]]></category>
		<category><![CDATA[Jeff Hammerbacher]]></category>
		<category><![CDATA[Meritech Capital Partners]]></category>
		<category><![CDATA[Mike Olson]]></category>
		<category><![CDATA[Ping Li]]></category>
		<category><![CDATA[venture capital]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=275769</guid>
		<description><![CDATA[This Hadoop thing? It just might just be going places.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20121206/exclusive-cloudera-closes-massive-65-million-funding-round-at-700-million-valuation/cloudera_380/" rel="attachment wp-att-275837"><img src="http://allthingsd.com/files/2012/12/cloudera_380.png" alt="" title="cloudera_380" width="379" height="285" class="alignright size-full wp-image-275837" /></a>Whenever you hear the phrase &#8220;big data&#8221; these days, and depending on whom you&#8217;re talking to, the word Hadoop usually isn&#8217;t far behind. When companies and large organizations seek to tackle the biggest of their big data problems, they usually use Hadoop, the open source software framework that essentially makes those problems easier to manage.</p>
<p>Hadoop has become a <a href="http://allthingsd.com/20110629/everyone-loves-hadoop-so-cloudera-makes-it-easier-to-manage/">big thing</a>, and in fact there are numerous flavors of it and <a href="http://allthingsd.com/20110830/exclusive-hadoop-companies-multiply-as-mapr-lands-20m-in-funding/">numerous companies</a> offering their own version of it, while others are building things to work with it.</p>
<p>The biggest and best known company in the Hadoop world is <a href="http://allthingsd.com/20110412/big-data-start-up-cloudera-kicks-hadoop-up-a-notch/">Cloudera</a>. Started in 2008 by a trio of engineers from Facebook, Google and Yahoo (Jeff Hammerbacher, Christophe Bisciglia and Amr Awadallah) plus CEO Mike Olson, a former Oracle exec, it has in four years gone from the start-up that few really understood to the company you have to talk to if you want to stand a chance wrestling your data challenges to the ground.</p>
<p>Today, the company will take a significant step forward. <strong>AllThingsD</strong> has learned that Cloudera has landed a $65 million Series E round of venture capital funding. The funding round is being led by Accel Partners. Accel is not a new investor in Cloudera &#8212; it led its $5 million Series A round in 2009 &#8212; and it is leading a round that includes all repeat investors, including Greylock Partners, Ignition Partners, In-Q-Tel and Meritech Capital Partners.</p>
<p>The round brings Cloudera&#8217;s total capital raised to about $140 million. And according to sources familiar with the terms, this deal values Cloudera at about $700 million.</p>
<p>The round was to be announced after a closed meeting of Cloudera employees at its Palo Alto headquarters led by CEO Mike Olson and other senior executives. The meeting was scheduled to take place at noon PT.</p>
<p>The company also said it plans to expand its European operations with the opening of new offices in the United Kingdom in the first quarter of 2013.</p>
<p>Coming as it does on the heels of Cloudera&#8217;s hiring of a new CFO &#8212; Jim Frankola, the former CFO of Yodlee and Ariba who started his career at IBM, <a href="http://www.cloudera.com/content/cloudera/en/about/press-center/press-releases/release.html?ReleaseID=1747794">joined on Oct. 3</a> &#8212; the questions about a possible public offering of Cloudera shares will begin to increase.</p>
<p>I asked Olson and Frankola about it in an interview Monday. They both said they&#8217;re interested in the long-term future for Cloudera, and if that includes an IPO, then great. &#8220;From the very beginning we&#8217;ve seen the opportunity here that is big enough to create a long-term business,&#8221; Olson said. &#8220;You don&#8217;t often get a chance to step up to the plate in a ballpark that is as exciting as this one,&#8221; Olson told me.</p>
<p>Frankola hearkened back to his days in corporate finance at Big Blue. &#8220;At IBM we got to make really big, multibillion dollar bets, and then you go to smaller companies and you sometimes have to play it safe. But the space that we&#8217;re playing in is one of the biggest bets there is in technology.&#8221;</p>
<p>I also talked with Accel Partner Ping Li, who has long been a Cloudera fan and created <a href="http://allthingsd.com/20111108/cloudera-lands-40-million-from-ignition-accel-launches-100-million-big-data-fund/">Accel&#8217;s Big Data Fund</a> in part to fund companies doing interesting things with Hadoop. &#8220;These guys pretty much created the big data wave when they decided to take Hadoop out of the bowels of the data center and into the enterprise,&#8221; he said. &#8220;They really created the idea that there&#8217;s all this data and Hadoop is a really critical layer in the stack.&#8221;</p>
<p>How&#8217;s the business doing? No one would say exactly, though Frankola said Cloudera has been seeing its revenue and number of customers roughly doubling every year. Li put it this way: &#8220;You don&#8217;t see this type of growth in software and infrastructure companies that often.&#8221;</p>
<p>When you consider that four years ago people sort of scratched their heads and wondered what they would actually use Hadoop for, the list of companies that are Cloudera customers is impressive. It includes AOL, CBS, eBay, Expedia, J.P. Morgan Chase, Monsanto, Nokia, Research In Motion and the Walt Disney Company. </p>
<p>Cloudera is sometimes compared &#8212; not inaccurately, but perhaps imprecisely &#8212; to Red Hat, which built a business on helping companies deploy and manage the free Linux operating system. Like Linux, Hadoop is technically free, and any company can go out and download it for free. The trick comes in knowing what to do with it, and managing it, and getting value out of it. And that&#8217;s where Cloudera&#8217;s money-making value comes into play. It sells its own software that runs on top of Hadoop, plus it also distributes its own flavor of Hadoop that it supports and helps its customers manage on a subscription basis.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121206/exclusive-cloudera-closes-massive-65-million-funding-round-at-700-million-valuation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Renaissance of Enterprise Computing</title>
		<link>http://allthingsd.com/20121205/the-renaissance-of-enterprise-computing/</link>
		<comments>http://allthingsd.com/20121205/the-renaissance-of-enterprise-computing/#comments</comments>
		<pubDate>Wed, 05 Dec 2012 19:02:48 +0000</pubDate>
		<dc:creator>Peter Levine</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[Actifio]]></category>
		<category><![CDATA[Andreessen Horowitz]]></category>
		<category><![CDATA[Apptio]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[box]]></category>
		<category><![CDATA[Bromium]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[CRM]]></category>
		<category><![CDATA[enterprise computing]]></category>
		<category><![CDATA[enterprise software]]></category>
		<category><![CDATA[GitHub]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Mixpanel]]></category>
		<category><![CDATA[Okta]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Petr Levine]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[SAAS]]></category>
		<category><![CDATA[Salesforce]]></category>
		<category><![CDATA[Tidemark]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=275405</guid>
		<description><![CDATA[There was a time when enterprise computing was almost exclusively dominated by Microsoft, Oracle and Cisco.]]></description>
				<content:encoded><![CDATA[<p><img src="http://allthingsd.com/files/2012/12/vitruvian3802-380x285.jpg" alt="" title="vitruvian3802" width="380" height="285" class="alignright size-Featured wp-image-275467" /><br />
<blockquote class="small">“Everything that can be invented has been invented.”<br />
—Charles H. Duell, Commissioner, U.S. patent office, 1899</p></blockquote>
<p>Last month, <a href="http://www.youtube.com/watch?v=b9n7GulqdsU&#038;feature=youtu.be">we gathered 75 of the top CIOs from around the country</a> to discuss the new generation of enterprise software and the redefined role of the CIO. These CIOs are dealing with an unprecedented level of experimentation and innovative new approaches focused on unsolved problems in enterprise software. The end result will be a complete remaking of the entire enterprise software stack at the intersection of cloud, mobile and SaaS. </p>
<p>All of the CIOs are also facing a changed environment, one where every department within an organization makes its own software buying decisions, outside the purview of the CIO. This “departmentalization of applications” &#8212; from <a href="https://www.box.com/">Box</a> for collaboration to <a href="https://github.com">GitHub</a> for software development to <a href="http://www.tidemark.net">Tidemark</a> for Enterprise Performance Management &#8212; means the CIO not only needs to figure out how to enable the department and employee to leverage these software products, but also meet the security and compliance requirements of the larger corporate environment &#8212; which, by the way, <a href="http://www.bromium.com">Bromium</a> and <a href="http://www.okta.com">Okta</a> allow you to do. These CIOs know that they can adapt or organizations will adapt without them. </p>
<p>Their jobs weren’t always so difficult. For those of you old enough to remember, there was a time when enterprise computing was almost exclusively dominated by Microsoft, Oracle and Cisco. It was a time when on-premise, Windows-based applications were the de facto standard and there was no alternative. The enterprise was so entrenched that challenging the status quo was viewed as suicidal and very stupid. So hardened was the thinking that most innovation in the enterprise was relegated to mere feature extensions of existing solutions.</p>
<p>Fast-forward to today and the world of enterprise computing has done a 180. Traditional IT is being blown to bits as cloud infrastructure, Software-as-a-Service and mobile computing become the new standards. We are experiencing innovation and usage as never seen before. It is truly a renaissance of massive scale. Hundreds of billions of dollars are up for grabs as buyers shift to new architectures and away from old, as new users and new markets embrace the availability and ease by which they can consume technology. </p>
<h4 class="subhed">On the Road to a Revolution</h4>
<p>VMware and Salesforce catalyzed this movement from unlikely origins. Both were little known and under-funded, but against all conventional wisdom each visualized a new world order &#8212; a world in which the data center was virtual and where applications would run off-premise, eliminating op-ex and painful software upgrades. The world watched but there were few believers. “Suicidal,” people said. “Why would I ever permit my precious customer data to reside outside my firewall?”</p>
<p>But momentum grew. VMware figured out how to effectively break apart the functionality of software from the hardware it resides on, driving a new set of economics into data centers. Salesforce began expanding beyond CRM, demonstrating the wider viability of subscription-based payments and the customer benefits of constant iteration. Customers began to believe that this new vision might actually come true. From a single virtual server and a single customer relationship app, both companies paved the way for a new world order. </p>
<p>Every part of the business software stack is now being remade &#8212; from infrastructure to applications to mobile to analytics &#8212; with every incumbent in danger of having its core business eroded. And, sure, incumbents will try to buy innovative products and will try to develop their own competing technologies, but the reality is that this new paradigm disrupts the entirety of these businesses. Overcoming a foundational shift cannot be met by a simple product buy or even a strategy change &#8212; the new breed of enterprise software start-ups has different revenue recognition policies, different sales models and different go-to-market models, and engineering processes than incumbents. We are talking about transformations occurring here simultaneously in technology and business models! It’s an entirely new approach to IT. </p>
<h4 class="subhed">The Departmentalization of Applications</h4>
<p>Buyers are clamoring for this new approach. None of our portfolio companies use Oracle. Some use Microsoft, but the majority opt for Google or an open source package. In our own Executive Briefing Center, where we connect and facilitate exchange amongst global brands and the rising stars in tech, we’re finding that even enterprise CIOs are looking beyond mature players to new and emerging technology companies, especially in areas like cloud computing, mobile, big data and SaaS. These are the early indicators of a more permanent shift in IT consumption habits. This shift is resulting in software applications that are targeted for specific business functions. <a href="http://www.apptio.com">Apptio</a>, for example, has built a world-class application that specifically targets the CIO as a customer. <a href="https://mixpanel.com">Mixpanel</a> has an application that is in the big data space, but specifically targets analytics for mobile applications. This shift is what I am calling the “departmentalization of applications”. </p>
<p>And entrepreneurs know that incumbents are vulnerable. We see a tremendous number of entrepreneurs bringing a new approach to this crusty old enterprise software market. We see entrepreneurs like Ben Werther of <a href="http://www.platfora.com">Platfora</a>, who is passionate about up-ending the Business Intelligence market, and Ash Ashutosh of <a href="http://www.actifio.com">Actifio</a>, who is creating the next generation storage software. </p>
<p>These are entrepreneurs who choose to do the hard work of building software for companies to use, and the software they are creating is elegant, fast, does what it’s supposed to and priced fairly. This is an unbeatable value proposition. For everyone except perhaps the incumbents, this is a great time to be involved with enterprise software. </p>
<p><em>Peter Levine is a partner at Andreessen Horowitz and blogs at <a href="http://peter.a16z.com/">http://peter.a16z.com/</a>. He has been a lecturer at both MIT and Stanford business schools and was the former CEO of XenSource, which was acquired by Citrix in 2007. Prior to XenSource, Peter was EVP of Strategic and Platform Operations at Veritas Software, where he helped grow the organization from no revenue to more than $1.5 billion, and from 20 employees to over 6,000.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121205/the-renaissance-of-enterprise-computing/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Eight Questions for Rick Smolan About the Human Face of Big Data</title>
		<link>http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/</link>
		<comments>http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/#comments</comments>
		<pubDate>Tue, 04 Dec 2012 12:56:49 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Commerce]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Product News]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[DVR]]></category>
		<category><![CDATA[electrical power]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[environment]]></category>
		<category><![CDATA[Gates Foundation]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Japan]]></category>
		<category><![CDATA[Nigeria]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[polio]]></category>
		<category><![CDATA[Rick Smolan]]></category>
		<category><![CDATA[Seven Questions]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=274748</guid>
		<description><![CDATA[A defining book that makes a previously nebulous concept understandable.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/bigdatanyc/" rel="attachment wp-att-274776"><img src="http://allthingsd.com/files/2012/12/bigdatanyc-380x285.jpeg" alt="" title="bigdatanyc" width="380" height="285" class="alignright size-Featured wp-image-274776" /></a>If you work anywhere near anything that might be described as &#8220;big data&#8221; and have ever had trouble explaining to someone you care about why what you do matters, the obvious gift to give this holiday season is &#8220;The Human Face of Big Data.&#8221;</p>
<p><a href="http://allthingsd.com/20120913/rick-smolans-newest-project-will-try-to-breathe-life-into-big-data/">Weighing in at 7.5 pounds</a>, it is an ambitious, jaw-dropping effort helmed by former Time, Life and National Geographic photographer Rick Smolan &#8212; he of the &#8220;Day in the Life&#8221; series of photography books, as well as &#8220;<a href="http://www.myamericaathome.com/customcover/inside.php">America at Home</a>,&#8221; &#8220;<a href="http://en.wikipedia.org/wiki/America_24/7">America 24/7</a>&#8221; and &#8220;<a href="http://books.google.com/books/about/24_hours_in_cyberspace.html?id=abpeAAAAIAAJ">24 Hours in Cyberspace</a>.&#8221; Smolan&#8217;s new book attempts to demystify &#8212; largely successfully &#8212; the nebulous concept: What is big data?</p>
<p>It was exactly the question that Smolan was asking when he first hit upon the idea for the book while attending the <a href="http://allthingsd.com/category/d/d9/"><strong>D9</strong> conference in 2011</a>. Hearing the phrase &#8220;big data&#8221; uttered in so many conversations, he had no idea what it meant. Asking at first yielded unclear answers, yet he persisted, eventually landing on the idea.</p>
<p>Today, the book is landing on the desks of world leaders, dignitaries and other notable people around the world: Among those on the list: President Obama, the Dalai Lama, Pope Benedict XVI and Chinese Premier Wen Jiabao, and also Twitter CEO Jack Dorsey; Daniel Tunkelang, chief data scientist at LinkedIn; and actor Robin Williams. Among the images they&#8217;ll see upon opening it is the blended image of 1,400 different shots of New York&#8217;s Times Square taken across 15 hours. Big data is about people: What they do, where they go, who they know and so on. The stories about how data, once harnessed, solves problems and in some ways creates new ones, is its overarching theme.</p>
<p>There&#8217;s also a smartphone app for iPhone and Android that is launching today. It&#8217;s an interactive viewer app from Aurasma that aims to bring the book’s content to life, accessing videos and animations by pointing the camera at images on certain pages flagged within the book. On top of that, there&#8217;s a $2.99 iPad app that enables readers to take a deeper dive with some of the stories, using videos, charts and animated infographics. </p>
<p>I talked with Smolan about the book yesterday by phone, after spending more than a few hours perusing an advance copy over the weekend. Here&#8217;s a little of what we talked about: </p>
<p><a href="http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/smolan-big-data/" rel="attachment wp-att-274781"><img src="http://allthingsd.com/files/2012/12/smolan-big-data.jpeg" alt="" title="smolan-big-data" width="270" height="180" class="alignright size-full wp-image-274781" /></a><strong>AllThingsD: So where did you get the idea for a book on big data? It&#8217;s a phrase that doesn&#8217;t necessarily jump out at me as part of the title of a bestseller.</strong></p>
<p><strong>Smolan</strong>: I was at <strong>D: All Things Digital</strong> in 2011, and I kept hearing the phrase big data, and I kept asking people what it meant, because I felt stupid and because it sounded like one of those marketing phrases. The first person I talked to said, &#8220;It&#8217;s so much information it won&#8217;t sit on your personal computer.&#8221; Well, that wasn&#8217;t very interesting. The next one said, &#8220;It&#8217;s taking information from one place and overlapping it with information from another, and finding these patterns.&#8221; And that wasn&#8217;t interesting, either. The third person said, &#8220;It&#8217;s like watching the planet grow a nervous system.&#8221; And that sounded interesting. Basically, we&#8217;re seeding the world with low-cost sensors, and we&#8217;ve all become sensors with our cellphones. And instead of doing random samplings, we can almost survey every single person on the planet in real time &#8212; where they are, what they&#8217;re doing, how fast they&#8217;re going, what they&#8217;re spending money on. The ability to gather that information, process it, visualize it and then respond to it while it&#8217;s still happening is something we&#8217;ve never had the ability to do before.</p>
<p><strong>Some of the material in the book I&#8217;m familiar with. The first image I saw when I opened it was one I recognized from <a href=http://senseable.mit.edu/nyte/>MIT&#8217;s Sensable City Lab</a>, and I also recognize big data anecdotes from IBM, like the one where they harnessed medical data to <a href=http://allthingsd.com/20110616/video-an-ibm-film-about-chocolate-and-babies-and-ducks/>detect infections in premature infants</a>. In this way, it seems it&#8217;s a little different from your previous books.</strong></p>
<p>It&#8217;s sort of a combination of original photography and curation. I think that putting all the information in one place and weaving it together, with these wonderful essays that I think are just as strong as the pictures. I&#8217;ve been getting notes from people like Marissa Mayer and Jack Dorsey saying that this is the first time they&#8217;ve had something that helps them explain how important this is. Amazon called last week to say they sold out of copies of the book on the first day and people were ordering 50 or 60 copies at a time, which has never happened ever to any book I&#8217;ve done in 25 years. They were dumbfounded. The hard thing about the book world is that you never know whether 10 people or a million people will find it interesting. A lot of people have never heard about big data and the ones who have, have a lot of trouble explaining it to other people. So I&#8217;m hoping that this will become the thing the people who know give to their parents or their family as a way of saying &#8220;this is why what I do is important.&#8221;</p>
<p><strong>Obviously you&#8217;ve spent a lot of time thinking about all this during the last year, and you&#8217;ve probably been asked a million times if you think this is all creepy or intrusive in some way. Is it?</strong></p>
<p>I&#8217;m an optimist. Every new tool can be used for good or evil. The whole point of doing this project is to start a conversation about it all. The people who are thinking most about big data right now are corporations and governments. I&#8217;d like to broaden the conversation and I hope the book makes some kind of contribution. I&#8217;m worried that the only ones profiting from it right now are corporations. As individuals we have very little say about how our data is being used. I&#8217;m not worried about the privacy implications of it so much. But it seems to me that as an individual, if I&#8217;m the one generating the data, I should have some kind of say in how it&#8217;s going to be used. </p>
<p><strong>Did you have a particular favorite anecdote or photograph?</strong></p>
<p>I just came back from Australia, and they have this expression down there: <a href=http://www.urbandictionary.com/define.php?term=gobsmack>Gobsmacked</a>. I think a lot of the pictures in the book convey that feeling. There are some that are funny, some that are just thought-provoking. There&#8217;s the case of the Environmental Systems Research Institute (ESRI) that creates these incredibly detailed satellite maps for governments. They found there were villages in Nigeria, which has the highest rate of polio resurgence in the world.  There are villages there that have never shown up on any map, no one in the government knew they were there. ESRI can recognize the shape of huts and pathways. The Gates Foundation has been trying to eradicate polio in places like Nigeria, and they have a very big effort there. They took the satellite maps and handed out 10,000 GPS-enabled cellphones to polio workers. They could see where they were in real time, and make sure they got to each of the houses. We spent a week travelling with the polio workers watching them do their work. I think the idea of using satellites to help cure polio is a pretty interesting concept. </p>
<p><strong>You have a lot of examples where understanding of big data is saving lives, which I think will surprise some people who don&#8217;t initially see it as having direct benefits for real people. What are some others?</strong></p>
<p>There&#8217;s the case of the recent earthquake in Japan. I heard a fascinating story by Kai Ryssdal on Marketplace Radio about how 43 seconds before the shaking actually began, all the bullet trains and factories in Japan stopped running. It was all automated. That country spent 15 years and half a billion dollars to build the system that automated all of this. Obviously the devastation was horrible, but the system worked. Then I read about a group of engineers in Palo Alto that had created a program called <a href=http://qcn.stanford.edu/>Quake Catcher</a> that uses the accelerometer in your laptop. Its the part in your laptop that detects when it&#8217;s been dropped and quickly moves the head on the drive drive before it smashes to the ground. It uses the same acceleromter to detect earthquakes. If it senses vibration and sees the same pattern over a 30-mile area, that&#8217;s an earthquake. On one side of the page, you have this huge half-billion dollar project, hardwired, dedicated parts that have to be replaced, lots of engineering time. And on the other you have this free ubiquitous crowdsourced mobile sensor system that has no profit motivation, and no cost. I love it. It&#8217;s a delightful story of people doing this to help each other. And the data just underpins it all. </p>
<p><strong>Is there anything in the book that has some practical, everyday value?</strong></p>
<p>Yes. There&#8217;s the example of Shwetak Patel, he&#8217;s a MacArthur Fellow and teaches at the University of Washington. He found a way to detect every device in the home and measure how much power it&#8217;s using. Every month we get a bill from the power company and we just pay it, we don&#8217;t even ask what it&#8217;s about. He&#8217;s created a sensor that can be plugged in anywhere in the house that detects the unique digital signature of everything that&#8217;s drawing power in the house &#8212; your computer, your toaster oven, whatever. I asked him if there was anything he had learned that would surprise the average American. He said it&#8217;s the DVR. The average American spends 11 percent of their monthly electrical bill on their DVR. It was designed in such a way that the hard drive never spins down, so even if you record only one show a week, it&#8217;s running the entire time and consuming power. So instead of drilling another oil well or burning more coal you could reduce America&#8217;s power bill by 5 percent just be redesigning the DVR. So many stories have this sense of delight: The data has been there all along, it&#8217;s just that no one was paying attention to it.</p>
<p><strong>You&#8217;ve also done iPad and smartphone apps to enhance the book. What can you tell me about that?</strong></p>
<p>I don&#8217;t know that anyone has ever done this with a book like this before, but there&#8217;s a free app you can download to your smartphone. Some of the pictures in the book have this little yellow key symbol in the corner. When you have the app, and you point the app at the page, it launches the page in the app. There are videos, there are Ted Talks. I think there are 22 or 23 videos. There&#8217;s an animated version of a story about pizza delivery guys in Midtown Manhattan. There&#8217;s also an iPad app, the profits from which go to charity: water, which is a nonprofit that&#8217;s working on bringing safe drinking water to people in developing nations. My goal here is to keep people turning the pages. </p>
<p><strong>What do you want people to be left with in the end?</strong></p>
<p>There&#8217;s an essay toward the back of the book called &#8220;Data Driven&#8221; by Jonathan Harris that has a really interesting thought. It&#8217;s that there is a relatively small group of people who are living in cities like San Francisco and New York, are mainly between the ages of 22 and 35, who are having an outsized effect on the rest of the human species. The kinds of societal changes that used to be the result of wars and famines are being brought about through software. …What I like about the essay is that EMC, which funded the book, had no right of review. I told them that this book wasn&#8217;t going to be all about cheerleading big data as the solution to all our problems. I said it was also going to sound a cautionary note because I think that right now governments and corporations are the ones having conversations about big data and that the average person isn&#8217;t. But it can have an effect on so many things in our lives, from our credit rating to our ability to get hired and our ability to do lots of things. I think it&#8217;s really important that we have this conversation now. </p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121204/eight-questions-for-rick-smolan-about-the-human-face-of-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sumo Logic, Generating Big Data From Log Files, Lands $30 Million From Accel</title>
		<link>http://allthingsd.com/20121128/sumo-logic-generating-big-data-from-log-files-lands-30-million-from-accel/</link>
		<comments>http://allthingsd.com/20121128/sumo-logic-generating-big-data-from-log-files-lands-30-million-from-accel/#comments</comments>
		<pubDate>Wed, 28 Nov 2012 13:05:57 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Accel Partners]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Aneel Bhusri]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Greylock Partners]]></category>
		<category><![CDATA[Limelight Networks]]></category>
		<category><![CDATA[logs]]></category>
		<category><![CDATA[Netflix]]></category>
		<category><![CDATA[Ooyala]]></category>
		<category><![CDATA[Ping Li]]></category>
		<category><![CDATA[Sumologic]]></category>
		<category><![CDATA[Sutter Hill Ventures]]></category>
		<category><![CDATA[Vance Loiselle]]></category>
		<category><![CDATA[Workday]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=273350</guid>
		<description><![CDATA[You'll never look at systems logs in quite the same way again.]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20120607/why-google-couldnt-pal-up-with-buddy-media/moneybags/" rel="attachment wp-att-217917"><img src="http://allthingsd.com/files/2012/06/moneybags.png" alt="" title="moneybags" width="380" height="285" class="alignright size-full wp-image-217917" /></a>When, exactly, was the last time you thought about the log file generated by one of your servers? It was probably when something wasn&#8217;t going right and you were trying to troubleshoot some mysterious digital misfire somewhere along a complicated chain of machines.</p>
<p>Managing systems logs is one of the more unglamorous tasks that come with managing IT infrastructure &#8212; not that there&#8217;s much glamour in the first place &#8212; but today is the sort of day when it takes on a cooler sheen.</p>
<p>Today, a fascinating and fast-growing start-up called Sumo Logic announced that it has taken a $30 million Series C round of venture capital funding led by Accel Partners. It is the latest investment by Accel&#8217;s Big Data fund, led by partner <a href="http://allthingsd.com/20110118/accels-ping-li-compares-the-cloud-to-the-mainframe/">Ping Li</a>.</p>
<p>The round brings Sumo Logic&#8217;s total capital raised to $50.5 million. Prior investors include Greylock Partners &#8212; Workday CEO Aneel Bhusri sits on the board &#8212; and Sutter Hill Ventures.</p>
<p>Sumo Logic&#8217;s approach to handling logs is to treat them like any other big-data problem. It uses cloud-based software to analyze, monitor and visualize data generated by machines in real time. System logs are just another one of those collections of data that used to be considered disposable or difficult to sort through. Sumo aims to make them a key source of insight into how well &#8212; or not &#8212; IT infrastructure is running.</p>
<p>Sumo often gets mentioned in the same breath as Splunk, the analytics company that specializes in <a href="http://allthingsd.com/20110919/seven-questions-for-splunk-ceo-godfrey-sullivan/">machine-generated data</a> and which had a <a href="http://allthingsd.com/20120419/and-its-off-splunk-rockets-108-percent-in-ipo-debut/">successful IPO</a> earlier this year. Splunk even spun off a company called Loggly that specializes in log management.</p>
<p>I talked with Sumo Logic&#8217;s CEO Vance Loiselle on Monday. He said that companies have been looking for years for a good way to easily make use of their log files. &#8220;In the last couple of years, we&#8217;ve seen more and more companies are generating tons of log data, but also all kinds of unstructured data,&#8221; Loiselle said. &#8220;That could be for servers that are or are not running well, or applications that you need to know specific analytic data about its performance, what your customers are using, and what kind of performance they&#8217;re seeing.&#8221;</p>
<p>Companies have been looking for a way to get ahold of that data and do a deep dive on the patterns that can be found in it, in order to decide if changes need to be made or to see how different parts of the business are performing. Doing that analytics work in the cloud, and thus saving on the purchase of extra storage hardware plus software, is sort of a no-brainer, but up to now that&#8217;s how this work has tended to be done.</p>
<p>The company launched only this February. More than 800 companies have tried the free service, and already more than 40 have traded up to the paid service. Among them are Netflix, Ooyala and Limelight Networks.</p>
<p>When you&#8217;re rolling out a new application, the last thing you need is some weird error cropping up somewhere in the chain of servers or on the network and getting in the way. Sumo Logic&#8217;s engine quickly crunches through the log data and can be used to pinpoint exactly where the problems are so they can be corrected, Loiselle said. It also works with log files generated by virtual machines.</p>
<p>Loiselle said the investment will help the company grow its sales and marketing operation, but also to do some of the heavy lifting needed to build up some new core features. &#8220;We need to bring in as many high-caliber inside and outside sales people as we can because the opportunity before us is so big. First we&#8217;ll scale up in North America, and then we&#8217;ll start thinking about what we&#8217;re going to do in Europe.&#8221; There&#8217;s also a huge need to hire more engineers, he said. &#8220;We&#8217;ve doubled the size of the engineering team over the last six months, and basically we have to double it again.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121128/sumo-logic-generating-big-data-from-log-files-lands-30-million-from-accel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How IBM Is Watching How You Shop Online</title>
		<link>http://allthingsd.com/20121123/how-ibm-is-watching-how-you-shop-online/</link>
		<comments>http://allthingsd.com/20121123/how-ibm-is-watching-how-you-shop-online/#comments</comments>
		<pubDate>Fri, 23 Nov 2012 23:16:43 +0000</pubDate>
		<dc:creator>Arik Hesseldahl</dc:creator>
				<category><![CDATA[Commerce]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Black Friday]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[commerce]]></category>
		<category><![CDATA[Craig Hayman]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[online shopping]]></category>
		<category><![CDATA[shopping]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://allthingsd.com/?p=272279</guid>
		<description><![CDATA[Bet you didn't know Big Blue was watching, did you?]]></description>
				<content:encoded><![CDATA[<p><a href="http://allthingsd.com/20110714/ibms-cloud-is-big-in-japan-with-two-new-data-centers/eyebeeem-feature/" rel="attachment wp-att-98049"><img src="http://allthingsd.com/files/2011/07/eyebeeem-feature-380x285.png" alt="" title="eyebeeem-feature" width="380" height="285" class="alignright size-Featured wp-image-98049" /></a>Starting yesterday and continuing into today, computing giant IBM has been putting out quick reports on the <a href="http://allthingsd.com/20121123/mobile-thursday-smartphone-shopping-is-still-tiny-but-its-this-years-big-online-buzzword/">state of online shopping</a>. </p>
<p>Apparently, this is now a officially a thing, so here are some stats taken from the latest snapshot as of 3 pm ET, because we just know you&#8217;re not shopping on a tablet, you&#8217;re on the edge of your seat waiting to hear about how many others are:</p>
<ul>
<li>Online sales are up 20 percent for this same time period over Black Friday 2011.</p>
<li>The number of consumers using a mobile device to visit a retailer&#8217;s site is at 28 percent, up from 18.1 percent in 2011.
<li>The number of consumers using their mobile device to make a purchase is 14.3 percent, up from 10.3 percent in 2011.
<li>Shoppers using the iPad led to more retail purchases more often per visit than other mobile devices, with conversion rates reaching 4.2 percent, higher than all other mobile devices.
<li>Shoppers referred from social networks like Facebook and Twitter generated 0.18 percent of all online sales on Black Friday.</ul>
<p>So, you might be wondering how IBM gets all this info. It&#8217;s all part of its strategic play in the world of big data, essentially helping companies make more sense of the huge troves of data they&#8217;ve gathered that were previously being ignored. Smarter Commerce is the area of IBM devoted to helping retailers better understand that data so they can come up with improved ideas concerning how to sell more stuff. </p>
<p>Where they gather that data is the IBM Benchmark. It&#8217;s a cloud-based digital analytics platform that soaks up digital information about how consumers respond to different ways of selling things online, 24 hours a day, seven days a week, all year long, from 500 different online retailers. IBM won&#8217;t name them &#8212; they joined the network under condition of anonymity &#8212; but Big Blue says the companies that participate include about half of the companies named on the <a href="http://www.internetretailer.com/top500/list/">Internet Retailer Top 100 list</a>. A lot of the technology comes from Coremetrics and Unica, acquisitions IBM made in 2010. </p>
<p>Last year, I talked about all this with Craig Hayman, IBM&#8217;s VP of the WebSphere, Application and Integration Middleware Software Division of the IBM Software Group. One quote from <a href="http://allthingsd.com/20110726/seven-questions-about-smarter-commerce-with-ibms-craig-hayman/">that conversation</a> sticks out in my memory; it bears repeating here:</p>
<blockquote class="memo"><p>&#8220;If you think about consumers, and you think about the amount of technology that they have at their hands, to reach out to read reviews and talk to friends and families, they’re incredibly empowered. There’s not one purchase decision that they make that is not impacted by some element of social networks. What does that do to the companies that have to deal with that by offering the best products and services, and you see companies are struggling to do that: To make the right offer at the right time with the right price. When they do it well, we all talk about how it went well; and when they do it badly, we talk about how annoying it was.&#8221;</p></blockquote>
<p>Now you know. Not only are retailers and your credit card companies watching you shop, so is IBM.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20121123/how-ibm-is-watching-how-you-shop-online/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>