<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Strata &#187; Tyler Bell</title>
	<atom:link href="http://strata.oreilly.com/tylerb/feed" rel="self" type="application/rss+xml" />
	<link>http://strata.oreilly.com</link>
	<description>Making Data Work</description>
	<lastBuildDate>Tue, 21 May 2013 19:54:11 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>That&#8217;s it &#8212; I&#8217;m taking my data and going home</title>
		<link>http://strata.oreilly.com/2013/02/thats-it-im-taking-my-data-and-going-home.html</link>
		<comments>http://strata.oreilly.com/2013/02/thats-it-im-taking-my-data-and-going-home.html#comments</comments>
		<pubDate>Thu, 21 Feb 2013 14:00:20 +0000</pubDate>
		<dc:creator>Tyler Bell</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[strata]]></category>
		<category><![CDATA[Strata SC 2013]]></category>
		<category><![CDATA[strata session sc 2013]]></category>
		<category><![CDATA[strataconf]]></category>

		<guid isPermaLink="false">http://strata.oreilly.com/?p=54720</guid>
		<description><![CDATA[Russia&#8217;s railway gauge is different from Western Europe&#8217;s. At the border of the former Soviet states, the Russian gauge of 1.524m meets the European &#38; American &#8216;Standard&#8217; gauge of 1.435m. The reasons for this literal disconnect arise from discussions between &#8230; ]]></description>
				<content:encoded><![CDATA[<p>Russia&#8217;s railway gauge is different from Western Europe&#8217;s. At the border of the former Soviet states, the Russian gauge of 1.524m meets the European &amp; American &#8216;Standard&#8217; gauge of 1.435m. The reasons for this literal disconnect arise from discussions between the Tsar and his War Minister. When asked the most effective way to prevent Russia&#8217;s own rail lines being used against them in times of invasion, the Minister suggested a different gauge to prevent supply trains rolling through the border. The artifact of this decision remains visible today at all rail crossings between Poland and Belarus or Slovakia and Ukraine. The rail cars are jacked up at the border, new wheels inserted underneath, and the car lowered again. It is about a 2-4 hour time burn for each crossing.</p>
<p>Per head, per crossing, over 170 years, is a heck of a lot of resource wasted. But to change it would entail changing the rail stock of the entire country and realigning about 225,000 km (140,000 mi) of track.</p>
<p>Talk about technical debt.</p>
<p>Data suffers from a similar disconnect. It really wasn&#8217;t until the advent of XML 15 years ago that we had an agreed (but not entirely satisfactory) mechanism for storing arbitrary data structures outside the application layer. This is as much a commentary on our technical priorities as it is a social indictment. We are simply not good at playing with others when it comes to data.</p>
<p><span id="more-54720"></span></p>
<p>Think otherwise? Cast back to last year&#8217;s tit-for-tat shortly after Facebook bought Instagram: Twitter responded by disabling Instagram&#8217;s ability to import their friends from Twitter, followed by Instagram&#8217;s disabling their Twitter cards integration, not as retaliation, of course, but rather because viewing images on Instagram itself is &#8220;a better experience&#8221; for consumers. A poor excuse really; this was a battle over users-as-data. Most commonly in these battles, the consumer pays the price.</p>
<p>Corporate attitudes towards data are still very focused on this idea of data as proprietary value.  My data is mine  and yours is yours (even if it was created by our users). The Promethean gift of big data tools has not lessened this attitude.  Companies have now the means to handle all the data they gather and keep it even closer to their bosom, and keep the rail gauges disconnected.</p>
<p>This attitude remains prominent but is fundamentally atavistic. The fact is, data is not natively a zero sum game. At <a href="http://factual.com">Factual</a> we manage data for 65 million small businesses and landmarks in 50 countries. The size and fluidity of this dataset demands the combined attentions of multiple organizations, so we&#8217;ve designed technical connectors to help with contributions, diff streams to broadcast edits, and a legal framework that ensures that if you improve our data, we share ownership. <a href="http://strataconf.com/strata2013/public/schedule/detail/27302">At Strata next week</a>, I will be going into detail about why we think a shared approach is the only approach here, and why it is bad principle to keep the passengers waiting.</p>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2013/02/thats-it-im-taking-my-data-and-going-home.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Snap to the graph, not the grid</title>
		<link>http://strata.oreilly.com/2011/05/data-coordinates-location.html</link>
		<comments>http://strata.oreilly.com/2011/05/data-coordinates-location.html#comments</comments>
		<pubDate>Tue, 03 May 2011 13:00:00 +0000</pubDate>
		<dc:creator>Tyler Bell</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[coordinates]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[geo]]></category>
		<category><![CDATA[local]]></category>
		<category><![CDATA[points of interest]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2011/05/data-coordinates-location.html</guid>
		<description><![CDATA[Coordinate pairs are regular and orderly, but they are entirely ambiguous when used to represent more conceptual places like states, cities, stores and neighborhoods. ]]></description>
				<content:encoded><![CDATA[<p>About 10 years ago, &#8220;geo&#8221; and &#8220;local&#8221; were two very distinct product verticals in most Internet companies. This was in part because what we now think of as local arose out of a Yellow Pages market, while the mapping use case arose from more enterprise, maps-and-navigation origins.</p>
<p>At the time, most Internet businesses were keen to perpetuate extant geo-local user experiences, rather than seeing them as crude, if profitable, interim steps to something better.  Nowhere was this more evident than at my former employer, Yahoo: as late as 2006, investment cases were still being pitched for <em>either</em> maps <em>or</em> local; the bulk of Yahoo&#8217;s geo engineering team resided in an entirely different country; and the mobile group had its own building with few, if any, connections to the rest of the company.  </p>
<p>Today, entrepreneurs and execs know that local, geo, and mobile are not distinct product groups and technology stacks, but rather essential components of a unified toolset that better connects people with the world around them.</p>
<p>Nothing better reflects the diversity of this geo toolkit than the products on display at the recent <a href="http://where2conf.com/where2011">Where 2.0 conference</a>.  The three-day event embraced the full spectrum of all things spatial: coding, licensing, marketing, daily deals, 3D visualization, and a bit of mapping thrown in for the traditionalists in the audience.</p>
<p>In this context I gave a short presentation on &#8220;<a href="http://where2conf.com/where2011/public/schedule/detail/17201">Big Data, Big Local</a>.&#8221; I&#8217;ve embedded the slides below.  Although I anticipate receiving an award for including Borat, Homer Simpson, Bender, Winston Churchill, porn shops, kittens, surfing, and unicorns coherently in a single presentation on geo &mdash; the deck does not stand well on its own.  Go figure.</p>
<p align="center">
<p>I wanted therefore to provide a brief overview here, and perhaps tackle it more fully in subsequent posts.  This is not intended to be a detailed argument, simply an accompanying narrative to the deck.</p>
<p>The general tenor of the discussion focuses upon how we have traditionally employed coordinates as the final word in how we position things and people in the physical world.  The problem with this approach is that a coordinate pair lacks context &mdash; they are regular and orderly, which is great for machines, but are entirely ambiguous when used to represent more conceptual places like states, cities, stores and neighborhoods. That&#8217;s not so good for people.  In fact, a single coordinate pair can be used to represent all these places, and their different associations, concurrently.</p>
<p>Social location has moved us toward a new form of positioning: by business or points of interest (POI).  These geo-referenced entities exist across the world in an irregular and poorly typed graph, but they are rich with context.  People exist and events happen at places, not coordinates &mdash; social location applications &#8220;snap&#8221; us to these nodes, and their context becomes ours.</p>
<p>We are stretching the traditional use of business listings beyond their limits.  Looking back, however, you can see how they&#8217;ve been continuously extended to fit new models of bringing consumers and businesses closer together.  You&#8217;ll soon see listings data act as hooks in the mobile payments space, and they will become increasingly pivotal in creating rich topological graphs among people, brands, and physical places.</p>
<p>We can do great things with business listings and POI as geo-commercial beacons, but they come with their own problems, two of which I cover in the deck.  The first is a more pedestrian example of name canonicalization: stores in the same retail / restaurant chain are represented today in too many forms. The evidence presented by variants on &#8220;Subway Sandwiches&#8221; in <a href="http://www.slideshare.net/TylerBell/big-data-big-local/12">slide 12</a> (below) would suggest that current data providers expend comparatively little effort normalizing this content.  The problem may exist because this information is currently intended to serve an extant, limited use case &mdash; make the name discoverable using free text search and present to users &mdash; rather than drive new, data-driven markets.</p>
<p class="image-box-580">
<a href="http://www.slideshare.net/TylerBell/big-data-big-local/12"><img src="http://radar.oreilly.com/2011/04/28/0511-tylerb-slide12.png" width="580" border="0" alt="Poor / Absent Canonicalization" /></a></p>
<p>The second, more important issue is that we are faced on the Internet with a multitude of electronic incarnations of one physical entity. While intended to enhance discoverability across properties, the current situation effects a near-complete inability to understand how users engage electronically with a physical place outside a single caisson such as Facebook, Twitter, or Yelp.  This need is increasingly important, because URLs are now often employed as surrogates for a place &mdash; they can be resolved to machine-readable attributes and in fact act exactly like nodes in a graph of <a href="http://radar.oreilly.com/2010/11/semantic-web-linked-data.html">linked data</a>. </p>
<p>This is a problem that requires solving, and we are tackling it at <a href="http://www.factual.com/">Factual</a>. I include some topical commentaries from Chris Dixon and Albert Wenger as artillery support in <a href="http://www.slideshare.net/TylerBell/big-data-big-local/18">slide 18</a>:</p>
<p class="image-box-580">
<a href="http://www.slideshare.net/TylerBell/big-data-big-local/18"><img src="http://radar.oreilly.com/2011/04/28/0511-tylerb-slide18.png" width="580" border="0" alt="The need for common namespaces" /></a></p>
<p>So the problem is that we are collectively going to need this normalization, very soon.  I conclude noting that Factual&#8217;s duty does not end with the creation of tab-delineated files. Instead, we are looking to create data-centric platforms that instill order on an increasingly chaotic Internet, and ensure that the result is meeting the future business needs of our contemporaries in the local space.</p>
<p></p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://www.youtube.com/watch?v=-j1ZraoILUc">Video: Tyler Bell on how big data will shape location</a></li>
<li> <a href="http://radar.oreilly.com/2011/03/messy-location-data.html">Why location data is a mess, and what can be done about it</a></li>
<li> <a href="http://radar.oreilly.com/2010/11/semantic-web-linked-data.html">Where the semantic web stumbled, linked data will succeed</a></li>
<li> <a href="http://radar.oreilly.com/2010/09/lets-pull-it-together.html">Toward a local syzygy: aligning deals, check-ins and places</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2011/05/data-coordinates-location.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big Data: An opportunity in search of a metaphor</title>
		<link>http://strata.oreilly.com/2011/02/big-data-metaphor.html</link>
		<comments>http://strata.oreilly.com/2011/02/big-data-metaphor.html#comments</comments>
		<pubDate>Thu, 10 Feb 2011 14:00:00 +0000</pubDate>
		<dc:creator>Tyler Bell</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[@top]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[strataconf]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2011/02/big-data-metaphor.html</guid>
		<description><![CDATA[ Big data is a massive opportunity, but the language used to describe it (&#34;goldrush,&#34; &#34;data deluge,&#34; &#34;firehose,&#34; etc.) reveals we&apos;re still searching for its identity. ]]></description>
				<content:encoded><![CDATA[<p><img src="http://radar.oreilly.com/2011/02/08/020811-stratajobboard.jpg" border="0" alt="Strata job board" width="300" style="float: right;margin: 3px 0 10px 10px" /><br />
The crowd at the <a href="http://strataconf.com/strata2011">Strata Conference</a> could be divided into two broad contingents: </p>
<ol>
<li>Those attending to learn more about data, having recently discovered its potential.</li>
<li>Long-time data enthusiasts watching with mixed emotions as their interest is legitimized, experiencing a feeling not unlike when a band that you&#8217;ve been following for years suddenly becomes popular. </li>
</ol>
<p>A data-oriented event like this, outside a specific vertical, could not have drawn a large crowd with this level of interest, even two years ago. Until recently, data was mainly an artifact of business processes.  It now takes center stage; organizationally, data has left the IT department and become the responsibility of the product team.</p>
<p>
Of course &#8220;data,&#8221; in its abstract sense, has not changed. But our ability to obtain, manipulate, and comprehend data certainly has. Today, data merits top billing due to a number of confluent factors, not least its increased accessibility via on-demand platforms and tools.   Server logs are the new cash-for-gold: act now to realize the neglected riches within your upper drive bay.</p>
<p>
But the idea of &#8220;big data&#8221; as a discipline, as a conference subject, or as a business, remains in its formative years and has yet to be satisfactorily defined.  This immaturity is perhaps best illustrated by the array of language employed to define big data&#8217;s merits and its associated challenges.  Commentators are employing very distinct wording to make the ill-defined idea of &#8220;big data&#8221; more familiar; their metaphors fall cleanly into three categories:</p>
<ul>
<li> <strong>Natural resources</strong> (&#8220;the new oil,&#8221; &#8220;goldrush&#8221; and of course &#8220;data mining&#8221;): Highlights the singular value inherent in data, tempered by the effort required to realize its potential.</li>
<li> <strong>Natural disasters</strong> (&#8220;data tornado,&#8221; &#8220;data deluge,&#8221; data tidal wave&#8221;): Frames data as a problem of near-biblical scale, with subtle undertones of assured disaster if proper and timely preparations are not considered.</li>
<li> <strong>Industrial devices</strong> (&#8220;data exhaust,&#8221; &#8220;firehose,&#8221; &#8220;Industrial Revolution&#8221;): A convenient grab-bag of terminologies that usually portrays data as a mechanism created and controlled by us, but one that will prove harmful if used incorrectly.</li>
</ul>
<p>If Strata&#8217;s Birds-of-a-Feather conference sessions are anything to go by, the idea of &#8220;big data&#8221; requires the definition and scope these metaphors attempt to provide. Over lunch you could have met with like-minded delegates to discuss big data analysis, cloud computing, Wikipedia, peer-to-peer collaboration, real-time location sharing, visualization, data philanthropy, Hadoop (natch&#8217;), data mining competitions, dev ops, data tools (but &#8220;not trivial visualizations&#8221;), Cassandra, NLP, GPU computing, or health care data.  There are two takeaways here: the first is that we are still figuring out what big data is and how to think about it; the second is that any alternative is probably an improvement on &#8220;big data.&#8221;</p>
<p>Strata is about &#8220;making data work&#8221; &mdash; the tenor of the conference was less of a &#8220;how-to&#8221; guide, and more about defining the problem and shaping the discussion.  Big data is a massive opportunity; we are searching for its identity and the language to define it.</p>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2011/02/big-data-metaphor.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Where the semantic web stumbled, linked data will succeed</title>
		<link>http://strata.oreilly.com/2010/11/semantic-web-linked-data.html</link>
		<comments>http://strata.oreilly.com/2010/11/semantic-web-linked-data.html#comments</comments>
		<pubDate>Mon, 15 Nov 2010 14:00:00 +0000</pubDate>
		<dc:creator>Tyler Bell</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[@top]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[future of search]]></category>
		<category><![CDATA[geo]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2010/11/semantic-web-linked-data.html</guid>
		<description><![CDATA[Linked data can be realized without the purity of semantic annotation, but a focus on consumers gives it a better shot at adoption. It begs the question: Why invest in difficult technologies if consumer outcomes can be realized with current tools and knowledge? ]]></description>
				<content:encoded><![CDATA[<p>In the same way that the <a href="http://en.wikipedia.org/wiki/Holy_Roman_Empire">Holy Roman Empire</a> was neither holy nor Roman, Facebook&#8217;s  <a href="http://opengraphprotocol.org">OpenGraph Protocol</a> is neither open nor a protocol.  It is, however, an extremely straightforward and applicable standard for document metadata. From a strictly semantic viewpoint, OpenGraph is considered hardly worthy of comment: it is a frankenstandard, a mishmash of microformats and loosely-typed entities, lobbed casually into the semantic web world with hardly a backward glance.</p>
<p>But this is not important. While OpenGraph avoids, or outright ignores, many of the problematic issues surrounding semantic annotation (see Alex Iskold&#8217;s excellent <a href="http://radar.oreilly.com/2010/05/facebook-open-graph-and-the-se.html">commentary on OpenGraph here on Radar</a>), criticism focusing only on its technical purity is missing half of the equation. Facebook gets it right where other initiatives have failed. While OpenGraph is incomplete and imperfect, it is immediately usable and sympathetic with extant approaches.  Most importantly, OpenGraph is one component in a wider ecosystem. Its deployment benefits are apparent to the consumer and the developer: add the metatags, get the &#8220;likes,&#8221; know your customers.</p>
<p>Such consumer causality is critical to the adoption of any semantic mark-up.  We&#8217;ve seen it before with microformats, whose eventual popularity was driven by their ability to <a href="http://radar.oreilly.com/2009/05/google-rich-snippets-semantic-web.html">improve how a page is represented in search engine listings</a>, and not by an abstract desire to structure the unstructured.  Successful adoption will often entail sacrificing standardization and semantic purity for pragmatic ease-of-use; this is where the semantic web appears to have stumbled, and where <a href="http://linkeddata.org/">linked data</a> will most likely succeed.</p>
<p>Linked data intends to make the Web more interconnected and data-oriented.  Beyond this outcome, the term is less rigidly defined.  I would argue that linked data is more of an ethos than a standard, focused on providing context, assisting in disambiguation, and increasing serendipity within the user experience.  This idea of linked data can be delivered by a number of existing components that work together on the data, platform, and application levels:</p>
<ul>
<li><strong>Entity provision</strong>: Defining the who, what, where and when of the Internet, entities encapsulate meaning and provide context by type.  In its most basic sense, an entity is one row in a list of things organized by type &#8212; such as people, places, or products &#8212; each with a unique identifier.  Organizations that realize the benefits of linked data are releasing entities like never before, including the publication of 10,000 subject headings by the <a href="http://data.nytimes.com/">New York Times</a>, admin regions and postcodes from the UK&#8217;s <a href="http://blog.ordnancesurvey.co.uk/2010/11/linked-data-at-ordnance-survey/">Ordnance Survey</a>, placenames from <a href="http://developer.yahoo.com/geo/geoplanet/data/">Yahoo GeoPlanet</a>, and the data infrastructures being created by <a href="http://www.factual.com/">Factual</a> [disclosure: I've just signed on with Factual].</li>
<li><strong>Entity annotation</strong>: There are numerous formats for annotating entities when they exist in unstructured content, such as a web page or blog post.  Facebook&#8217;s OpenGraph is a form of entity annotation, as are HTML5 <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=176035">microdata</a>, <a href="http://en.wikipedia.org/wiki/RDFa">RDFa</a>, and microformats such as <a href="http://en.wikipedia.org/wiki/Hcard">hcard</a>.  Microdata is the shiny, new player in the game, but see Evan Prodromou&#8217;s great <a href="http://evan.prodromou.name/RDFa_vs_microformats">post on RDFa v. microformats</a> for a breakdown of these two more established approaches.</li>
<li><strong>Endpoints and Introspection</strong>: Entities contribute best to a linked data ecosystem when each is associated with a Uniform Resource Identifier (URI), an Internet-accessible, machine readable endpoint.  These endpoints should provide <em>introspection</em>, the means to obtain the properties of that entity, including its relationship to others.  For example, the Ordnance Survey URI for the &#8220;City of Southampton&#8221; is <a href="http://data.ordnancesurvey.co.uk/id/7000000000037256">http://data.ordnancesurvey.co.uk/id/7000000000037256</a>. Its properties can be retrieved in machine-readable format (RDF/XML,Turtle and JSON) by appending an &#8220;rdf,&#8221; &#8220;ttl,&#8221; or &#8220;json&#8221; extension to the above.  To be properly open, URIs must be accessible outside a formal API and authentication mechanism, exposed to semantically-aware web crawlers and search tools such as <a href="http://developer.yahoo.com/search/boss/structureddata.html">Yahoo BOSS</a>. Under this definition, local business URLs, for example, can serve in-part as URIs &#8212; &#8216;view source&#8217; to see the semi-structured data in these listings from <a href="http://www.yelp.com/biz/cin-cin-wine-bar-los-gatos-2">Yelp</a> (using hcard and OpenGraph), and <a href="http://foursquare.com/venue/18645">Foursquare</a> (using microdata and OpenGraph).</li>
<li><strong>Entity extraction</strong>: Some linked data enthusiasts long for the day when all content is annotated so that it can be understood equally well by machines and humans.  Until we get to that happy place, we will continue to rely on entity extraction technologies that parse unstructured content for recognizable entities, and make contextually intelligent identifications of their type and identifier.  <a href="http://en.wikipedia.org/wiki/Named_entity_recognition">Named entity recognition</a> (NER) is one approach that employs the above entity lists, which may also be combined with heuristic approaches designed to recognize entities that lie outside of a known entity list.  Yahoo, Google and Microsoft are all hugely interested in this area, and we&#8217;ll see an increasing number of startups like <a href="http://www.headup.com/">Semantinet</a>  emerge with ever-improving precision and recall.  If you want to see how entity extraction works first-hand, check out Reuters-owned <a href="http://viewer.opencalais.com/">Open Calais</a> and experiment with their form-based tool.</li>
<li><strong>Entity concordance and crosswalking</strong>: The <a href="http://gigaom.com/2010/05/07/the-great-open-database-of-place-pages-in-the-sky/">multitude of place namespaces</a> illustrates how a single entity, such as a local business, will reside in multiple lists.   Because the &#8220;unique&#8221; (U) in a URI is unique only to a given namespace, a world driven by linked data requires systems that explicitly match a single entity across namespaces.  Examples of crosswalking services include: <a href="http://blog.placecast.net/post/489490648/opening-the-placecast-match-api">Placecast&#8217;s Match API</a>, which returns the Placecast IDs of any place when supplied with an <a href="http://en.wikipedia.org/wiki/HCard">hcard</a> equivalent; <a href="http://developer.yahoo.com/geo/geoplanet/guide/api-reference.html#api-concordance">Yahoo&#8217;s Concordance</a>, which returns the Where on Earth Identifier (WOEID) of a place using as input the place ID of one of fourteen external resources, including OpenStreetMap and Geonames; and the <a href="http://www.guardian.co.uk/open-platform/blog/linked-data-open-platform"> Guardian Content API</a>, which allows users to search Guardian content using non-Guardian identifiers. These systems are the unsung heroes of the linked data world, facilitating interoperability by establishing links between identical entities across namespaces.  Huge, unrealized value exists within these applications, and we need more of them.</li>
<li><strong>Relationships</strong>: Entities are only part of the story. The real power of the semantic web is realized in knowing how entities of different types relate to each other: actors to movies, employees to companies, politicians to donors, restaurants to neighborhoods, or brands to stores.  The power of all graphs &#8212; these networks of entities &#8212; is not in the entities themselves (the nodes), but how they relate together (the edges).  However, I may be alone in believing that we need to nail the problem of multiple instances of the same entity, via concordance and crosswalking, before we can tap properly into the rich vein that entity relationships offer.</li>
</ul>
<p>The approaches outlined above combine to help publishers and application developers provide intelligent, deep and serendipitous consumer experiences. Examples include the semantic handset from <a href="http://techcrunch.com/2010/10/27/aro-mobile/">Aro Mobile</a>, the BBC&#8217;s <a href="http://www.bbc.co.uk/blogs/bbcinternet/2010/07/the_world_cup_and_a_call_to_ac.html">World Cup experience</a>, and <a href="http://www.insidefacebook.com/2010/11/09/aggregated-mentions-machine-reading/"> aggregating references on your Facebook news feed</a>.</p>
<p>Linked data will triumph in this space because efforts to date focus less on the <em>how</em> and more on the <em>why</em>.  RDF, SPARQL, OWL, and triple stores are onerous. URIs, micro-formats, RDFa, and JSON, less so.  Why invest in difficult technologies if consumer outcomes can be realized with extant tools and knowledge?  We have the means to realize linked data now &#8212; the pieces of the puzzle are there and we (just) need to put them together.</p>
<p>Linked data is, at last, bringing the discussion around to the user. The consumer &#8220;end&#8221; trumps the semantic &#8220;means.&#8221; </p>
<p></p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2010/05/facebook-open-graph-and-the-se.html">Facebook Open Graph: A new take on semantic web</a></li>
<li> <a href="http://radar.oreilly.com/2010/08/integrating-the-semantic-web-w.html">Linked data is opening 800 years of UK legal info</a></li>
<li><a href="http://radar.oreilly.com/2009/05/google-rich-snippets-semantic-web.html">Google&#8217;s Rich Snippets and the Semantic Web</a></li>
</ul>
<p></p>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2010/11/semantic-web-linked-data.html/feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
