<?xml version="1.0" encoding="UTF-8" ?>
<rss xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
        <channel>
        <title>structured-data - ReadWrite</title>
        <link>http://readwrite.com</link>
        <description />
        <language>en</language>
        <copyright>Copyright 2012 SAY Media, Inc.</copyright>
        <managingEditor>readwriteweb@gmail.com</managingEditor>
        <docs>http://blogs.law.harvard.edu/tech/rss</docs> 
        <lastBuildDate>Tue, 12 Mar 2013 10:33:00 -0700</lastBuildDate>
        <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://rww.superfeedr.com/" />

                    <item>
                <title><![CDATA[Attacking Big Data Old-School Style - With VMware's SQLFire]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/shutterstock_6397828.jpg" />
                                        <p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
<p class="p1">As more and more information floods into the Internet, organizing and making sense of this <a href="http://readwrite.com/2012/03/05/big-data">Big Data</a> becomes more important and more difficult.</p>
<p class="p1">New database methods are emerging to help process unstructured data, but IT developers and database deployers also have to figure out how to deal with the world of legacy technology.</p>
<p class="p1">For the last 40 years, relational database programs (usually powered by <a href="http://en.wikipedia.org/wiki/SQL">SQL</a>-based management systems) have been the backbone of supplying businesses with organized rows and columns of data. The problem is that these legacy systems may not be able to work together to give businesses the information they need when they need it. Older programs may also have trouble processing data requests over long distances.</p>
<h2 class="p2">A New Way Of Thinking About Databases</h2>
<p class="p1">A new way of thinking is needed. Over the past decade the push for “not only SQL” or <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a> database software has provided a pathway for businesses to connect bits and pieces of data from a variety of sources at very rapid speeds across different geographies.</p>
<p class="p3"><strong>(See also </strong><strong><a href="http://readwrite.com/2013/02/20/whats-next-for-taming-big-data">What's Next For Taming Big Data?</a><span style="line-height: 1.538em;">)</span></strong></p>
<p class="p1">Some businesses are spreading out the workloads using noSQL databases within cloud computing-based networks. Others are approaching the problem still using traditional SQL relational database software - and that’s perfectly OK.</p>
<p class="p1">Previous articles in this series (<a href="http://readwrite.com/series/taming-big-data/">Taming Big Data</a>) discuss the benefits of a noSQL database tool like <a href="http://vmware.com/go/gemfire">VMware’s vFabric GemFire</a>. But SQL database software retains a well-established community of tens of thousands of developers and integrators who may be reluctant to move beyond the SQL they know and love. What’s a company to do?</p>
<p class="p3"><strong>(See also </strong><strong><a href="http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data">VMware's Cloud-Based GemFire Makes It Easier To Work With Big Data</a>.)</strong><strong><br /></strong></p>
<p class="p1">For SQL diehards, <a href="http://www.vmware.com/go/sqlfcomm">VMware’s vFabric SQLFire</a> SQLFire is a distributed SQL database typically used for online transactions. The software is more modern than most traditional relational database management systems.</p>
<h2 class="p2">What Can SQLFire Do For Me?</h2>
<p class="p1">SQLFire functions and performs much like <a href="http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data">GemFire</a> under the hood. SQLFire uses GemFire's data grid engine, which lets both programs capture data and then replicate and partition the information "in-memory” on the server. But instead of having to learn GemFire commands and controls, SQLFire has a user interface and programming framework that will be familiar to developers used to programming in a SQL interface and with SQL tools.</p>
<p class="p1">Backups are enabled through virtual copies on other connected servers, although data can be stored long-term on disks as needed.</p>
<p class="p1">Unlike other embedded databases, SQLFire allows several servers to store replicated and partitioned tables, persist data to disk, communicate directly with other servers and participate in distributed queries.</p>
<p class="p1">For traditional IT developers and database deployers, the SQLFire interface makes it easier to write applications and take advantage of GemFire’s underlying noSQL technology. Developers and integrators who know SQL well will have an easy time adapting SQLFire to new projects.</p>
<p class="p1">SQLFire is perfect for classic Web transactions, especially where there is a need for fast speeds and a requirement to dig deep into clusters of data.</p>
<h2 class="p2">Business Case For SQLFire</h2>
<p class="p1">In addition to making SQL developers feel comfortable, SQLFire can work across multiple networks and geographies. This comes in handy when enterprises need information at the moment it becomes available on multiple continents.</p>
<p class="p1">For example, a large regional bank in the Northeastern United States collects large amounts of data that helps it maintain its regional and branch offices. The bank also monitors customer transactions at tellers and various ATMs.</p>
<p class="p1">Bank management was interested in measuring the different types of transactions being handled at each of type of station, what types of accounts they were accessing and the various times of day the transactions took place.</p>
<p class="p1">Historically, the bank could attach an individual database to each branch, but in today's global environment the company decided it needed to measure all of these data points at the same time for each office. The company tested vFabric SQLFire against its own systems and found the existing server took 20 minutes to complete the queries while the SQLFire server completed its task in less than a minute.</p>
<h2 class="p2">Deploying SQLFire In The Enterprise</h2>
<p class="p1">In the enterprise sQLFire is generally found on inexpensive computer servers in database clusters. A typical use case would find SQLFire helping eliminate potential data bottlenecks in new mobile and Web environments. Another common deployment option for SQLFire is to integrate it with existing traditional databases or analytics programs.</p>
<p class="p1">The software can also be interfaced through an API using programming languages such as Java or <a href="http://www.springsource.org/">Spring</a>. SQLFire is also compatible with Java database (JDBC) or ADO.NET.</p>
<p class="p1">As companies look for new ways to make data accessible and provide a consistent view of that information, it's important to have tools that suit the needs of all kinds of developers and IT managers.</p>
<p class="p1">VMware’s <a href="http://www.vmware.com/products/application-platform/vfabric-gemfire/overview.html">GemFire</a> and <a href="http://www.vmware.com/products/application-platform/vfabric-sqlfire/overview.html">SQLFire</a> software are designed to address just those needs - allowing companies to move beyond concerns over speed and scale and tackle Big Data applications head on.</p>
<p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
                    ]]></description>
                <link>http://readwrite.com/2013/03/12/attacking-big-data-old-school-style-with-vmware-sqlfire</link>
                <guid>http://readwrite.com/2013/03/12/attacking-big-data-old-school-style-with-vmware-sqlfire</guid>
                <category>Taming Big Data</category>
                <pubDate>Tue, 12 Mar 2013 10:33:00 -0700</pubDate>
                <author></author>
            </item>
                    <item>
                <title><![CDATA[2013: Say Goodbye To The Traditional Data Center]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/DataCenter%202.jpg" />
                                        <p>In 2013, traditional data centers will begin to lose their dominant status within the data-management food chain. They will increasingly be replaced&nbsp;by big-data software and lower-cost, ARM-based systems-on-chips.&nbsp;</p>
<p>When thinking about the future of data centers, the problem is one of scale. For the past few decades, relational databases and the attendant hardware that runs them have been able to manage pretty much anything a company could throw at them, but those days are coming to an end.</p>
<h2>When Relational Ruled The Land</h2>
<p>In the beginning, and for the first 20 years or so, data was heavily transactional, and was managed in discrete&nbsp;and very secure<strong>&nbsp;</strong>ways. Speed was less important than making sure the data was safe as houses.</p>
<p>In the 1990s, data began to be used in a slightly different way, as comapnies placed analytical demands on the data being gathered. Instead of being retreived in discrete packages, data became as a strategic asset to be analyzed, leading to the disciplines of business intelligence.&nbsp;Databases grew into massive data warehouses, and parallel querying arose as the only way to effectively manage the staggering workloads placed on information technology.</p>
<p>Through the early years of electronic data, growth in the volume of data may have been rapid, but data tools and infrastructure were pretty much able to keep pace.</p>
<p>That's not so true anymore. Software soon will not be able to cope with the overwhelming volume of data being generated, says Mike Hoskins, chief technology officer of <a href="http://www.pervasive.com/" target="_blank">Pervasive Software</a>. What's coming is a real break in how data is managed.</p>
<h2>Breaking The Old Model</h2>
<p>To give an idea of what kind of scale we're talking about, Hoskins points to U.S. retailer <a href="http://www.walmart.com/" target="_blank">Wal-Mart</a>'s estimated 1-petabyte data store.</p>
<p>"That's the accumulation of 40 years of Wal-Mart sized business," he said. "<a href="http://www.facebook.com" target="_blank">Facebook</a>? Facebook generates that much data in a week."</p>
<p>There's always a collection of data behind each transaction. But in e-commerce today, a customer can be clicking around quite a bit before buying, which leads to useful data sets tens, hundreds or thousands of times larger than "so-and-so bought widget X with credit card Y." Add the fact that the machines handling these activities are also recording machine-to-machine transactions, and the data workload explodes beyond the capacity of any traditional data center.&nbsp;</p>
<p>"We are reaching the end of the useful life" of our data centers, Hoskins said. "The bottom line is, it's a death march."</p>
<p>Even if conventional software could manage this explosion, no company could afford it. Not to mention the energy costs invovled in buying, running and cooling the hardware.</p>
<p>Indeed, it is innovation in hardware that's going to provide the evolutionary break that Big Data requires. Servers with <a href="http://readwrite.com/2012/12/11/cheating-deathwatch-arm-holdings-holds-out" target="_blank">ARM</a>-based processors, which absorb something like 20 times less power than Intel-based processors, are&nbsp;<a title="http://readwrite.com/2012/11/01/arm-vs-intel-servers-the-size-of-a-smartphone" href="http://readwrite.com/2012/11/01/arm-vs-intel-servers-the-size-of-a-smartphone">the next wave in data center infrastructure</a>. Lower power requirements, after all, mean less resistance and less heat. Less heat means less money wasted on cooling and the ability to compress ARM-based systems closer together.</p>
<p>As energy and general hardware costs coem down, hardware is lined up to take care of the new data workloads of this new massive scale of data.</p>
<h2>First Hardware - Then Software</h2>
<p>On the software side, Big Data will increasingly be handled by&nbsp;Hadoop systems that can store data and manage and analyze Facebook-scale loads.</p>
<p>If you're wondering why this is supposed to be big news, think about it this way: Relational databases have been handling data of all shapes and sizes for decades, and now there will be a certain level of data that the traditional data center architecture will simply be unable to handle.&nbsp;It's the first stratification of data management.&nbsp;</p>
<p>On one level of data management,&nbsp;relational databases will still be around, supporting smaller, less complex and more tactical workloads. But on this new level, whole new architectures will be created to deal with this scale.</p>
<div>Big Data in the form of Hadoop-based architectures is but the first step into the future.&nbsp;In the past, data managers had to heavily pre-process data to get it to fit within a certain schema for use in a relational database. Today, they're&nbsp;foregoing the pre-processing and are shoving the unformatted data into commodity Hadoop clusters. To perform analytical work, data managers are pulling refined data back into databases and other analytical tools.&nbsp;</div>
<h2>What's The Data Center Endgame?</h2>
<p>This half-way approach is not the end game, though.</p>
<p>Eventually, Hoskins believes, tools will be built into the Hadoop framework that will enable data managers to run applications and analysis right where the data lives, inside the Hadoop clusters.</p>
<p>It's no accident then that the latest iteration of one of Hadoop's core components - <a href="http://hadoop.apache.org/docs/r0.23.0/hadoop-yarn/hadoop-yarn-site/YARN.html" target="_blank">MapReduce 2.0, code-named YARN</a> - includes the beginnings of a framework that will let developers build exactly those kinds of tools inside Hadoop. This is something that the VP of Apache Hadoop Arun Murthy confirmed to me early this year at the <a href="http://strataconf.com/strata2012" target="_blank">Strata&nbsp;Conference</a>&nbsp;in Santa Clara, California. When the YARN application framework is robust enough, Hadoop will be able to let developers code those applications.</p>
<p>This will be the new way of working with data as it gets too big for relational databases to handle: a new architecture of low-cost, low-power servers that will keep applications and data as close to each other as possible, in order to maximize efficiency and speed.</p>
<p>"Relational database technology has had a good run," Hoskins said. But the days of the relational database being a part of every data solution are fading fast, as a new kind of data center becomes the new sheriff in town.</p>
<p><em>Image courtesy of&nbsp;<a href="http://www.shutterstock.com">Shutterstock</a>.</em></p>
                    ]]></description>
                <link>http://readwrite.com/2012/12/20/2013-cloud-trends-say-goodbye-to-the-traditional-data-center</link>
                <guid>http://readwrite.com/2012/12/20/2013-cloud-trends-say-goodbye-to-the-traditional-data-center</guid>
                <category>Predictions</category>
                <pubDate>Thu, 20 Dec 2012 06:00:00 -0800</pubDate>
                <author>Brian Proffitt</author>
            </item>
                    <item>
                <title><![CDATA[Looking For The Next $1 Billion Open Source Company]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/shutterstock_billiondollarcloud_0.jpg" />
                                        <p>It's been just under seven months since Red Hat became the world's first $1 billion open-source company. Now the question is who will follow suit and become the next open source company to hit this milestone?</p>
<p>Before sorting through the list of likely candidates, it's important to define what we're talking about when we say "$1 &nbsp;billion company." Primarily, we're talking actual annual revenue, not market valuation. That's a key difference, since under the valuation definition, <a title="" href="http://www.redhat.com">Red Hat</a> would not have been the first billion-dollar open source company; MySQL AB would have had that distinction, when Sun Microsystems bought it for $1 billion in 2008.</p>
<p>And, when we're talking open source, the idea is a company that bases its business on the sale and distribution of open-source software. Things get a bit tricky here, since many companies could be assigned the "open source" moniker - and some might argue against it. This is particularly true of companies that follow the so-called "open core model," which typically release a free version of the software as a "community" edition as well as a more feature-rich but closed edition that's available for a licensing fee or support subscription.</p>
<p>Red Hat is not open core, because even though it sells a "closed" enterprise version, Red Hat Enterprise Linux, and helps distribute a free version of the product, Fedora, the truth is that the source code for Red Hat Enterprise Linux is still freely available, so one can, if desired, use RHEL without support.</p>
<p>While it would be possible to reject open-core model companies, the fact is that this would sharply limit the field of potential candidates for the next $1 billion open source company, so I'm going to keep them in mind and note them as such.</p>
<p>Our three top candidates - all private companies - definitely vary in the kind of software they offer, but each one has an opportunity to be the next open-source company that crosses the billion-dollar line.</p>
<h1>The Database Candidate</h1>
<p>With all the hullaballoo about Big Data (see next candidate), it's important not to forget there are still a lot of fast and flexible offerings in database-land that still follow the old rules of managing data. Big data doesn't mean that relational databases are dead; in fact, such databases are more important then ever.</p>
<p><a title="" href="http://www.enterprisedb.com/">EnterpriseDB</a> is in the business of distributing commercial products built on one such open source database, <a title="" href="http://www.postgresql.org">PostgreSQL</a>, a direct competitor to the <a title="" href="http://www.mysql.com">MySQL</a> mentioned earlier that was snatched up by Sun and subsequently acquired by <a title="" href="http://www.oracle.com/index.html">Oracle</a>.<span class="embedded-Media-image img-caption-r">
				<img src="http://readwrite.com/files/fields/24740_EDB_logo_rgb_150_0.jpg" style="" />
			</span>
</p>
<p>On its own, PostgreSQL (or Postgres as geeks call it), is hot right now. The open source Postgres is getting a lot of new features and the pace of development is perceived as faster than its MySQL counterpart. Plus, there is a definite interest in Postgres driven by a movement away from MySQL. While MySQL is still very powerful software, there is a lot of tension in the marketplace about Oracle's stewardship of that project, since Oracle dropped the ball on several of the open source projects it acquired from Sun and many feel that eventually MySQL will suffer from the same problem, with Oracle either not pursuing development as much as it could or (as some fear) closing MySQL altogether.</p>
<p>While MySQL's loss is PostgreSQL's gain, Postgres is getting attention on its own merits. EnterpriseDB, which offers an enterprise version known as Postgres Plus, is making big inroads with customers like Sony Entertainment.</p>
<p>Data on a smaller scale will always be more prevalent than Big Data, which means Postgres and the vendors that package it have a nice, wide market into which to play. That gives EnterpriseDB a clear path to becoming the first $1 billion open source database company.</p>
<h1>The Big Data Candidate</h1>
<p>Nevertheless, you can't ignore the hype... Big data is a big driver for commerce right now, and any big data vendor has to be considered for billion-dollar revenue status.</p>
<p>The technology that's leading the big data charge is the open source data framework known as <a title="" href="http://hadoop.apache.org">Apache Hadoop</a>. Hadoop is not a database, but rather a distributed storage tool that enables programmers devise jobs in Java code to search for desired information - without formal database structure.</p>
<p><span class="embedded-Media-image img-caption-l">
				<img src="http://readwrite.com/files/partner-cloudera.png" style="" />
			</span>
Hadoop is not the only big data tech, but right now it's the top technology in the sector. And right now the biggest Hadoop company is <a title="" href="http://www.cloudera.com">Cloudera</a>.</p>
<p>Though <a title="" href="http://www.mapr.com">MapR</a> and <a title="" href="http://hortonworks.com">Hortonworks</a> are strong Hadoop contenders, Cloudera has managed to establish an early lead in this sector. Cloudera has developed its own tools to make it easier to seek information from Hadoop stores and is building a rich partner ecosystem to enable different applications of Hadoop-stored data.</p>
<p>If Hadoop continues its rocket-ship rise, it's not hard to envision Cloudera becoming the first $1 billion open source Big Data company.</p>
<h1>The Cloud Computing Candidate</h1>
<p>In the world of open source cloud computing, <a title="" href="http://www.openstack.org">OpenStack</a> is the name that gets the most attention, thanks to the combined marketing prowess of <a title="" href="http://www.rackspace.com">Rackspace</a>, <a title="" href="http://www.hp.com">HP</a>, <a title="" href="http://www.dell.com">Dell</a>&nbsp;and Red Hat, just to name a few.</p>
<p>But OpenStack is not a company or even a product - it's a project. Eventually, one of the OpenStack vendors may come up with a commercial product and run with it, but if its someone like Red Hat or HP, they've already made their mark. <a title="" href="http://www.pistoncloud.com">Piston Cloud</a> or <a title="" href="https://www.suse.com">SUSE</a> may also succeed as billion-dollar candidates, but right now the OpenStack system is a free-for-all and no one seems to have a commanding lead.</p>
<p><span class="embedded-Media-image img-caption-r">
				<img src="http://readwrite.com/files/eucalyptus-300x32.png" style="" />
			</span>
So I'm betting on an open source cloud player that's not even in the OpenStack ecosystem: <a title="" href="http://www.eucalyptus.com">Eucalyptus</a>.</p>
<p>Eucalyptus gets the nod because of OpenStack's lack of product and the fact the other open product in this space, <a title="" href="http://www.cloudstack.org">CloudStack</a>, has <a title="" href="http://www.citrix.com/lang/English/home.asp">Citrix</a> as its flagship commercial vendor and it's not really an open source company. Eucalyptus also gets marks for being very well connected to the <a title="" href="http://aws.amazon.com">Amazon Web Services</a> ecosystem. While you may argue that sticking to AWS frameworks is really not all that open (in fact, I have made that very argument), there's no denying that AWS is killing in this space and a lot of customers interested in private or hybrid cloud computing are very content to keep swimming in Amazon's pond.</p>
<p>But there is more to Eucalyptus than being in the right place at the right time. It has got a whip-smart team, led by Marten Mickos, who - in one of those "small world" things - was the CEO of MySQL AB, the same company Oracle now has in its portfolio and was <em>valued</em> at $1 billion upon acquisition.</p>
<p>With all of these stars aligned, Eucalyptus could soon be the first billion-dollar open source cloud computing company.</p>
<p>Sure, other candidates may be in the running for the next billion-dollar open source company. And there is always a chance that someone new, like <a title="" href="http://www.bigswitch.com">Big Switch Networks</a> and its virtual networking systems, could come out of nowhere and snatch the title.</p>
<p>But these three candidates, based on what I see right now in the open source world, are presently the most likely contenders for becoming the next $1 billion open source company.</p>
<p><em>Lead image courtesy of <a href="http://www.shutterstock.com">Shutterstock</a>.</em></p>
                    ]]></description>
                <link>http://readwrite.com/2012/10/23/looking-for-the-next-1-billion-open-source-company</link>
                <guid>http://readwrite.com/2012/10/23/looking-for-the-next-1-billion-open-source-company</guid>
                <category>Cloud Computing</category>
                <pubDate>Tue, 23 Oct 2012 05:00:00 -0700</pubDate>
                <author>Brian Proffitt</author>
            </item>
                    <item>
                <title><![CDATA[Graphing the Occupy Movement's Use of Social Networks]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/occupy-150.png" style="" />
			</span>
Whether you think the protestors camping out in various city parks around the world is justified or not, it is interesting to see this <a href="http://www.technologyreview.com/blog/editors/27324/?p1=A1">analysis published in Technology Review today</a>. They used a tool from SocialFlow that examined a pile of Twitter data. Did you know the first use of their hashtag was in a July 13 Adbusters blog post? </p>
<p><a href="http://www.readwriteweb.com/occupy-after.jpg"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/assets_c/2011/11/occupy-after-thumb-610x432-35761.jpg" style="" />
			</span>
</a><br />
(Click to enlarge the screen. There is a "before" picture posted on TechReview's site too.)</p>

<p>Of course, by October things were a bit different in the Twitterverse, as on the streets of New York and elsewhere. The above screen shot shows centers of influence from Huffington Post and Keith Olbermann, with smaller nodes from the New York Post, CNN and the Associated Press. Where is the coverage by the Grey Lady? Too small to see with my eyes. That in itself shows you how the message was spread and who was relevant and who wasn't. Fascinating. <br />
</p>
                    ]]></description>
                <link>http://readwrite.com/2011/11/09/graphing_the_occupy_movements_use_of_social_networ</link>
                <guid>http://readwrite.com/2011/11/09/graphing_the_occupy_movements_use_of_social_networ</guid>
                <category>Analysis</category>
                <pubDate>Wed, 09 Nov 2011 04:30:00 -0800</pubDate>
                <author>David Strom</author>
            </item>
                    <item>
                <title><![CDATA[Take A Look At the Geeky Goodness Cooking Up At the MIT Media Lab]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/mit_mood_meter.jpg" style="" />
			</span>
</p>

<p>The Massachusetts Institute of Technology may be the birthplace of the American geek. Within MIT, its Media Lab drills down to the heart of the next wave of technology from creating buildings with 3D printing to prosthetic limbs to gesture-based user interfaces. For instance, the MIT Media Lab was where the idea for the technology seen in the movie Minority Report originated.</p>

<p>The unofficial motto of the MIT Media Lab is "demo or die." It is akin to the classic academic model of "publish or perish," except that students and faculty at the Media Lab are encouraged to actually create the products they are thinking up, as opposed to pontificating upon them in research papers. See below to check out some of the amazing waves of technology that will be bursting out of the Media Lab in the future. </p>
<h2>Interactive Robots & Holographic Imaging</h2>
The phrase "these are not the droids you are looking for" keeps passing through my brain when I think of the robots that are being created at the Media Lab. Two prominent examples are MDS prototypes - mobile, dexterous and social. See the picture of the two robots below. They can move about, have hands that can interact with the world around them and show a range of emotion by being social. 

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/mit_robots.jpg" style="" />
			</span>
</p>

<p>There is a little orange robot called Combusto. It can show emotion and has a wide array of movement. Think Wall-E with fur. Combusto is powered with an Android smartphone that has a unique chipboard in it that helps determines Combusto's functions.</p>

<p>Right next to the robot lab, there is a team working on holographic 3D imaging. Another line from Star Wars comes to mind: "Help me Obi-Wan Kenobi, you are my only hope." One of the goals for the holographic imaging team would be to make it feasible to make those Star Wars-style holographs a reality. </p>

<p>Throughout the Media Lab, students, researchers and faculty are using the Microsoft Kinect for gesture-based moving and tracking. The Kinect API is one of the most innovative consumer technologies to hit the market in a long time and its affordability makes for perfect use in research settings.</p>

<h2>Tangible Media</h2>

<p>If you put the ideas of holographic images together with the notion of tangible media, you get the computer that Tom Cruise uses in Minority Report. The company that is trying to make that vision a reality is called Oblong, and it originated at the Media Lab. </p>

<p>Tangible media is the idea of being able to manipulate a user interface either through gesture or interacting with a physical object connected through the Internet. This can also be done through Kinect, although the department also has a series of infrared cameras that can track movement and intention with more precision. The cameras are used to localize an item (say, a person) in a 3D space. </p>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/mit_tangible_media.jpg" style="" />
			</span>
</p>

<p>Tangible media is described as where the input is also the output. Say I have a gesture-based camera watching my movements. If I am using building blocks in physical space, those same building blocks should move in the digital space as well. </p>

<p>Capacitive touch, which is perhaps the biggest breakthrough in the smartphone revolution, also has origins with projects in the media labs. Indirectly, the iPhone and touch-based devices have roots at MIT.</p>

<h2>Robotic Opera, 3D Printing & More</h2>

<p>It is hard to keep track of everything that is happening at the MIT Media Lab without being an insider. There are teams working on prosthetic limbs that reduce the impact of missing legs and let a person move around normally. There is a storytelling center looking at new ways to present digital media. There is a camera center that is working on how best to utilize the powerful cameras now attached to every person's hip via cellphone. There are teams working on creating different densities of material coming through a 3D printer that can make the building blocks of structures with concrete. The innovators behind Rock Band came from the MIT Media Lab. These days, the music group is working on robotic operas. </p>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/mit_3d_printing.jpg" style="" />
			</span>
</p>

<p>The MIT Media Lab is majority funded by sponsors or "members" that are companies working on specific functions. For instance, DirecTV helps fund some of the interactive television projects that are being worked on. The building is designed to be an open center of collaboration where everybody can see what the others are working on. The idea is to get designers and scientists together to create technology that will have an impact on the world. </p>
                    ]]></description>
                <link>http://readwrite.com/2011/10/18/take_a_look_at_the_geeky_goodness_cooking_up_at_th</link>
                <guid>http://readwrite.com/2011/10/18/take_a_look_at_the_geeky_goodness_cooking_up_at_th</guid>
                <category>Features</category>
                <pubDate>Tue, 18 Oct 2011 05:30:00 -0700</pubDate>
                <author>Dan Rowinski</author>
            </item>
                    <item>
                <title><![CDATA[New Efforts to Help the Virtual Botanist ]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/botany2011.jpg" style="" />
			</span>
At their annual conference this week in St. Louis, an international group of botanists are working on two efforts to integrate the Web into their efforts. Called the <a href="http://usvirtualherbarium.org/">US Virtual Herbarium</a> and the <a href="http://www.opensciencenetwork.net">Open Science Network for Ethnobiology</a>, both are trying to make education and use of plant materials easier for scientists around the globe. </p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/virtualherbarium.jpg" style="" />
			</span>
<i>Map source: Ben Legler, University of Washington and Derick Poindexter, Appalachian State University</I></p>

<p>The two are completely different efforts, but the idea is to extend botanical knowledge through various Web-based materials and allow scientists and students to collaborate online. A herbarium is typically a huge collection of dried plants maintained by large universities and botanical gardens - two of the largest are found at the Bronx, NY and St. Louis, Missouri. To give you an idea of the size of these things, the St. Louis collection contains more than 5 million plants and is housed in several buildings. There are several hundred others located all over the country as you can see by the map above. </p>

<p>Led by Mary Barkworth, a professor at the Utah State University, the project aims to coordinate all of these heretofore independent herbaria efforts around the country. The virtual edition has just gotten started, and the researchers are figuring out common data markups and trying to catalog the individual plant species that each institution has on file, which Barkworth estimates is in the several millions. Part of the problem is that the plant collections go offline, as universities change their funding and science emphasis, and are either tossed aside or dispersed to interested collectors. So, tracking these herbaria down isn't a simple task. Another problem is that these collections aren't static: researchers are continually adding to their collections. </p>

<p>Eventually, the entire catalog will be imaged and placed on the public Internet so that researchers from anywhere can find a particular leaf, fern or seed. Right now there is the <a href="http://sciweb.nybg.org/science2/IndexHerbariorum.asp">Index Herbarium maintained by the Bronx NY Botanical Garden here</a>, but it is more a collection of pointers to institutions, rather than a database of the actual plant materials itself.</p>

<p>The Open Science network is an entirely different effort, but also presented its progress at the biology conference this week. Here more than 40 academic institutions are trying to help science educators improve their curricula with all sorts of virtual tours: you can take a recorded video walk in the woods with a noted bio prof and hear him provide commentary and descriptions of plant life, for example. It aims to make science more hands-on and interactive and engaging. There are links to lesson plans, lectures, and summer field courses that students can take. <br />
</p>
                    ]]></description>
                <link>http://readwrite.com/2011/07/11/new_efforts_to_help_the_virtual_botanist</link>
                <guid>http://readwrite.com/2011/07/11/new_efforts_to_help_the_virtual_botanist</guid>
                <category>Analysis</category>
                <pubDate>Mon, 11 Jul 2011 21:00:00 -0700</pubDate>
                <author>David Strom</author>
            </item>
                    <item>
                <title><![CDATA[Whatever Happened To... Google Base]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/google_lego2.jpg" style="" />
			</span>
<em>The dream of creating a structured database of the world's content has eluded Google's grasp. But at least it got a bunch of useful retail data out of it.</em></p>
<p>The Web world is nothing if not Darwinian: it's survival of the fittest and products need to evolve with the times. Some Web products fly and some don't. Those that don't fly either die out, or evolve into something new. The latter is what happened to <a href="http://base.google.com/">Google Base</a>, which in 2011 is a shadow of its former self - and is even about to lose its API. It did however spin off a more successful offering, in the form of the <a href="http://www.google.com/merchants/default">Google Merchant Center</a> for retailers. In this post, we look back on the initial vision for Google Base and then analyze what it actually evolved into.</p>

<h2>How Google Base Started Out in 2005</h2>
<p>For those of you whose memories don't go back as far as Web 2.0, Google Base is an online database that <a href="http://www.readwriteweb.com/archives/web_20_weekly_w_43.php">launched in November 2005</a>.  At that time it enabled you to upload any type of information, along with &quot;attributes&quot; that described it further. Early examples of Google Base content included reviews, events, products and jobs. Some - but far from all - of the data was then surfaced in Google properties like search, Froogle and Local.</p>
<!--start:nonyt--><p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/google_base_2006.jpg" style="" />
			</span>
<br />
<em>A snapshot of Google Base homepage, circa January 2006 (source: <a href="http://web.archive.org/web/20060125085445/http://base.google.com/">Wayback Machine</a>)</em></p><!--end:nonyt-->
<p>Despite a clunky user interface and initial mixed reviews, Google Base was a promising product in late 2005 and early 2006 because of its potential to <strong>better structure Web content</strong>. At the time Google appeared to be aiming to use the product to improve its main search engine, but also to create other Google properties in verticals like e-commerce and classified ads. So there was a lot of talk at the time about it challenging eBay and Craigslist. </p>
<p>This is how I summarized Google Base back in <a href="http://www.zdnet.com/blog/web2explorer/will-google-base-be-the-worlds-largest-xml-database/61">November 2005</a>:</p>
<blockquote>
  <p>Google Base may not be pretty to look at and it may be a centralized database, but the potential is there to turn it into a hugely valuable directory of structured content. Plus if they add APIs and start aggregating outside RSS feeds, then they could easily extend Google Base and remove the issues around it being a 'walled garden'.</p>
</blockquote>
<p>In about April 2006, Google began to <a href="http://www.readwriteweb.com/archives/google_base_beg.php">expand the number of vertical niches</a> it covered in Google Base. The product was expanding, but how good was the data it was receiving and was there any sign of it being productively used?</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/GoogleBase_search.jpg" style="" />
			</span>
</p>
<h2>Google Base in 2011 &amp; its More Successful Spin-off</h2>
<p>Fast forward five years to April 2011 and <a href="http://base.google.com/">Google Base</a> still exists (and incidentally still labeled a &quot;beta&quot; product!), but it's been scaled back in two big ways. </p>
<p>Firstly, commercial product data has been separated out into a new Google property called <a href="http://www.google.com/merchants/default">Google Merchant Center</a>, which <a href="http://googlebase.blogspot.com/2009/09/introducing-google-merchant-center.html">launched in September 2009</a>. Google Base is now restricted to &quot;non-product data.&quot; Data in the Merchant Center is being used in <a href="http://www.google.com/commercesearch/#utm_campaign=3.0launch&amp;utm_source=en-na-us-ogblog-3.0launch&amp;utm_medium=blog">Google Commerce Search</a>, the third generation of which was <a href="http://googleblog.blogspot.com/2011/03/google-commerce-search-30-you-wont.html">announced last week</a>. As the name suggests, Google Commerce Search is a search engine for retail - a market which Google obviously sees huge potential in.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/google_base_2011.jpg" style="" />
			</span>
</p>
<p>The  other big difference between the 2011 Google Base and its 2005/2006 version: there's no Web frontend for people to search the data. Whether Google ever even wanted to make Google Base into a long-term frontend product is debatable, although what's not debatable is that it was a user interface mess from the start.</p>
<h2>What Happened to the Structured Content Dream?</h2>
<p>Looking at how Google Base evolved, one has to ask: was retail data the only valuable thing that came out of Google Base? Did the grand dream of all types of structured content never pan out?</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/google_base_2011b.jpg" style="" />
			</span>
The main problem with structured content has always been that you  can't rely on people to upload reliable data for a variety of reasons ranging from motivation to accuracy. As I wisecracked back in 2005, &quot;Google could in fact be building the world's largest database of structured shite.&quot;</p>
<p>That's not to say that structured content isn't alive and well. But nowadays the main initiative around this is the W3C's <a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">Linked Data</a>, which is getting good uptake from governments and commercial companies alike. </p>
<p>Google is obviously still innovating with new types of data, structured and otherwise, but it no longer views Google Base as a good source of that data. Or at least, not good enough to actively foster - because it's about to deprecate the <a href="http://code.google.com/apis/base/">Google Base API</a>.</p>
<h2>Google Base to Lose its API </h2>
<p> In December last year, <a href="http://googlemerchantblog.blogspot.com/2010/12/new-shopping-apis-and-deprecation-of.html">Google announced</a> that it is deprecating the Google Base Data API and replacing it with two new APIs: the Content API for Shopping and the Search API for Shopping. The Google Base Data API will be &quot;fully retired&quot; on June 1, 2011. </p>
<p>What this means for Google Base is unclear, but it's obvious that Google is far more interested in developing the shopping spin-off, Merchant Center. It appears to be giving up on actively developing its database for other types of structured content. In <a href="http://code.google.com/apis/base/faq_depr.html">an FAQ</a> about the API deprecation, Google stated:</p>
<blockquote>
  <p>&quot;The new Shopping APIs won't support use cases that require uploading & searching data for non-shopping applications like Real Estate, Events, Jobs, Activities and other custom structured content applications.&quot;</p>
</blockquote>
<p>While Google Base lives on, it's a much diluted product. Retail data was clearly the only valuable user-generated content that Google got out of it. Let us know in the comments whether you've used Google Base, in the past or now. What are your thoughts on its evolution and future?</p>
<!--start:nonyt--><p><em>Photo credit: <a href="http://en.wikipedia.org/wiki/File:GoogleBase.PNG">Wikipedia</a></em></p><!--end:nonyt-->
                    ]]></description>
                <link>http://readwrite.com/2011/04/05/whatever_happened_to_google_base</link>
                <guid>http://readwrite.com/2011/04/05/whatever_happened_to_google_base</guid>
                <category>Google</category>
                <pubDate>Tue, 05 Apr 2011 15:00:33 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[Top 10 Semantic Web Products of 2010]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/Best_of_2010.png" style="" />
			</span>
Every year ReadWriteWeb selects the top 10 products or developments across a range of categories. We kick off the 2010 'Best Of' series with our selection of the top 10 Semantic Web products and implementations of the year.</p>
<p>This year we've chosen 5 products by semantically charged startups and 5 implementations by large organizations. The startups represent the cutting edge of Semantic Web. Each has made an impact on the Internet this year, with user growth and innovation. The organizations we've selected - which include Facebook, Google and the BBC - offered the best examples of large scale deployment of semantic technology.</p>
<p>&nbsp;</p>
<div class="super-pullquote">
<h3>ReadWriteWeb's 2010 In Review:</h3>
<ul>
<li><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2010.php">Top 10 Semantic Web Products of 2010</a></li>
<li><a href="http://www.readwriteweb.com/archives/top_10_rss_and_syndication_technologies_of_2010.php">Top 10 RSS and Syndication Technologies of 2010</a></li>
<li><a href="http://www.readwriteweb.com/archives/best_bigco_of_2011_facebook.php">Best BigCo of 2010: Facebook</a></li>
<li><a href="http://www.readwriteweb.com/archives/app_stores_top_trends_of_2010.php">Top Trends of 2010: App Stores</a></li>
<li><a href="http://www.readwriteweb.com/archives/most_promising_company_for_2011_simplegeo.php">Most Promising Company For 2011: SimpleGeo</a></li>
<li><a href="http://www.readwriteweb.com/archives/internet_tv_top_trends_of_2010.php">Top Trends of 2010: Internet TV</a></li>
<li><a href="http://www.readwriteweb.com/archives/top_10_startups_of_2010.php">Top 10 Startups of 2010</a></li>
<li><a href="http://www.readwriteweb.com/archives/privacy_top_trends_of_2010.php">Top Trends of 2010: Privacy</a></li>
</ul>
</div>
<!--end:nonyt-->
<p><strong>A note on terminology:</strong>we are using 'Semantic Web' and 'Semantic technology' somewhat interchangeably, although many people believe that the term Semantic Web (upper case) should only be applied to W3C-approved technologies such as RDF and SPARQL. The fact is that a good portion of our top 10 use technologies that are either not approved by the W3C (the Web's governing body, led by Sir Tim Berners-Lee), or they've been tweaked in some way - for example, Facebook's use of RDFa. So we've chosen to use the term 'Semantic Web' in its broader, more inclusive, sense. In a nutshell, these are products that add meaning and context to data.</p>
<p>&nbsp;</p>
<p>Here then is our list of the top 10 Semantic Web products or implementations of 2010 (in no particular order).</p>
<h2>Freebase</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/googlemetaweb_jul10.jpg" style="" />
			</span>
In July <a href="http://www.readwriteweb.com/archives/google_buys_semantic_web_database_metaweb.php">Google acquired</a> one of the leading Semantic Web companies, <a href="http://metaweb.com/">Metaweb</a>. Metaweb runs <a href="http://freebase.com/">Freebase</a>, an open, semantically marked up database of information. It looks similar to Wikipedia, but Freebase is all about structured data and what you can do with it.</p>
<p>Google already had a relationship with Freebase, pulling in its information to <a href="http://www.google.com/support/news/bin/answer.py?answer=144273">provide intelligent search results</a> within Google News. With the acquisition of Metaweb, Google can now leverage the company's tools and data even further, especially within basic Web search results.</p>
<p>Freebase was one of our <a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2009p2.php">top 10 Semantic Web products last year</a> and being acquired by Google validates its potential.</p>
<h2>GetGlue</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/madmen_150.jpg" style="" />
			</span>
This year was <a href="http://www.readwriteweb.com/archives/how_getglue_taps_into_our_emotions.php">a turning point</a> for <a href="http://getglue.com/">GetGlue</a>, the service where users "check in" to watching TV shows, reading books, listening to music and more. Last November, GetGlue changed its branding and <a href="http://www.readwriteweb.com/archives/getgluecom_distributed_networking_recommendation.php">launched a new website</a>. It changed almost overnight from a geeky browser add-on called Blue Organizer to a destination website called GetGlue. Mobile applications followed soon after, enabling its users to interact with GetGlue while watching TV or at an entertainment venue.</p>
<p><a href="http://www.readwriteweb.com/archives/great_scott_how_inventive_companies_like_getglue_b.php">The changes</a> have been good for GetGlue. It's experienced strong growth this year, reaching over 600,000 users by the end of September.</p>
<p><em>Disclosure: GetGlue's founder and CEO, Alex Iskold, used to be a regular contributer to RWW.</em></p>
<h2>Flipboard</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/flipboard_logo_NEW.png" style="" />
			</span>
The launch of the iPad in 2010 triggered a new round of innovation in the startup community. Few startups utilized the touchscreen UI to create a unique user experience more than <a href="http://flipboard.com">Flipboard</a>, a magazine reading application <a href="http://www.readwriteweb.com/archives/how_flipboard_was_created_its_plans_beyond_ipad.php">built specifically for the iPad</a>.</p>
<p>It turns out that Flipboard isn't just a pretty face, it's also using Semantic technologies.</p>
<p>In July, <a href="http://www.readwriteweb.com/archives/flipboard_new_social_ipad_magazine_will_be_powered_by_semantic_data.php">Flipboard acquired semantic technology startup Ellerdale</a>, whose intelligent data-parsing algorithms had previously been used to create a real-time search engine and trends tracker. Ellerdale's technology was used by Flipboard to design a more personalized real-time experience - determining what social updates are important <em>to you </em>and presenting them in its now familiar magazine-like format.</p>
<h2>Hunch</h2>
<p><a href="http://hunch.com/">Hunch</a> started out as a Q&amp;A service, but in August it re-positioned as <a href="http://www.readwriteweb.com/archives/hunch_internet_personalization_service.php">a personalization service</a>. It's a recommendation engine that shows you movies you want to see, books you want to read, vacation destinations you want to go to, and much more. The company is on <a href="http://www.readwriteweb.com/archives/mapping_people_to_products_hunch_getglue.php">a mission</a> to "map every person on the Internet to every object on the Internet, be that a product, a service, or a person."</p>
<p>Co-founder <a href="http://www.readwriteweb.com/archives/how_hunch_went_from_qa_to_guessing_your_preference.php">Caterina Fake told us in October</a> that Hunch uses a decision tree model, as an alternative to search, to provide more personalized information to users.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/hunch_movies.jpg" style="" />
			</span>
</p>
<h2>Apture</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/apture_logo_feb09.jpg" style="" />
			</span>
<a href="http://apture.com">Apture</a> is a semantic contextual search service which <a href="http://www.readwriteweb.com/archives/apture_for_your_browser_like_augmented_reality_for.php">continues to iterate strongly</a> (it made our top 10 list last year, too). In August, Apture launched <a href="http://apture.com/extension/">Apture Highlights</a>, a plug-in that allows you to dive deep into any topic you discover on almost any page around the web.</p>
<p>When we first noticed Apture several years ago, it was a service that required publishers to load up linked pop-up widgets with multimedia of their own choosing. The company removed that barrier to entry with its August release. Everything is now automated and it's available almost everywhere. Indeed we liked it so much, we started using Apture on ReadWriteWeb (there is no commercial relationship, we just think the product adds to our site's user experience).</p>
<!--start:nonyt-->
<!-- <p><em><strong>Next Page: <a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2010p2.php">Top 5 big organization implementations of Semantic Web technology</a></strong>. Featuring Facebook's Open Graph, Google's semantic search, and more...</em></p> -->
<!--end:nonyt-->
<p>&nbsp;</p>
<!--nextpage-->
<h2>Facebook</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/facebook_opengraph_150.gif" style="" />
			</span>
Arguably the biggest Semantic Web news of the year came in April, when Facebook announced a large-scale new platform called the <a href="http://developers.facebook.com/docs/opengraph">Open Graph</a>. The stated goal of the Open Graph protocol was to enable publishers to "integrate [their] Web pages into the social graph." Essentially, each web page can now become an 'object' in Facebook's <a href="http://www.readwriteweb.com/archives/social_graph_concepts_and_issues.php">social graph</a> (which is Facebook's term for how people connect to each other in its network). This means that pages can be referenced and connected across social network user profiles, blog posts, search results, Facebook's News Feed, and more.</p>
<p>The Open Graph is a wide-ranging platform which includes features such as 'Like' buttons and publisher plug-ins. It also includes a simple, RDF-based markup. This requires publishers to include at least 4 metadata properties in each object: title, type, image, URL. There are a few additional properties which may be optionally added, such as site_name and description.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/fb_rww_properties2.gif" style="" />
			</span>
</p>
<p><em>See also: <a href="http://www.readwriteweb.com/archives/facebook_open_graph_the_definitive_guide_for_publishers_users_and_competitors.php">Facebook Open Graph: The Definitive Guide For Publishers, Users and Competitors</a></em></p>
<h2>Google Squared</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/GoogleSquaredLogo.jpg" style="" />
			</span>
The holy grail in web search technology is to be able to ask a simple question, in natural language, and get a simple answer. In May, Google <a href="http://googleblog.blogspot.com/2010/05/understanding-web-to-find-short-answers.html">announced that Google Squared</a> was coming to its search results. Google Squared, which <a href="http://googleblog.blogspot.com/2009/05/more-search-options-and-other-updates.html">launched</a> in 2009, adds additional information to search results.</p>
<p>The functionality was added to Google's traditional search results in two ways. Firstly, simple queries such as <a href="http://www.google.com/search?q=catherine+zeta-jones+date+of+birth">Catherine Zeta-Jones' date of birth</a> elicited useful data within the search results:</p>
<center><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/squared-example-result.png" style="" />
			</span>
</center>
<p>By clicking "show sources" on the Squared-provided result, a list of sources appears showing you how Google arrived at this answer.</p>
<p>Secondly, Google Squared is being used to provide a new feature in Google's sidebar (another innovation by the search giant in 2010): "Something different". This feature provides a list of related searches that may be of interest, determined by looking at your current search term.</p>
<p>This year Google also <a href="http://www.readwriteweb.com/archives/google_semantic_web_push_rich_snippets_usage_grow.php">reported strong growth in its Rich Snippets feature</a>, which adds extra information to Google search results too - in this case, data like review ratings.</p>
<h2>Best Buy</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bestbuy_logo.jpg" style="" />
			</span>
One of the themes of 2010 was the increasing usage of <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">Semantic Web technologies by large commercial companies</a> like Facebook and Google. Leading U.S. retailer, <a href="http://www.bestbuy.com/">Best Buy</a>, was another large company to impress in 2010 with its adoption of semantic technologies. Specifically, Best Buy used a Semantic Web markup language called RDFa to add semantics to its webpages.</p>
<p><a href="http://jay.beweep.com/">Jay Myers</a>, Lead Web Development Engineer at BestBuy.com, <a href="http://www.readwriteweb.com/archives/how_best_buy_is_using_the_semantic_web.php">told ReadWriteWeb in an interview</a> earlier this year that the primary goal of using semantic technologies was to increase the visibility of Best Buy's products and services. With data such as store name, address, store hours and GEO data being marked up using RDFa, search engines are now able to identify each of those data components more easily and put them into context. The use of semantic technology, Myers told us, led to increased traffic and better service to its customers.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/best_buy_rdfa.jpg" style="" />
			</span>
</p>
<h2>Data.gov.uk</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/20100120-th56xif1uqf1g8xia23gk3mehb.jpg" style="" />
			</span>
In January, <a href="http://data.gov.uk">Data.gov.uk</a> launched to make non-personal data held by the U.K. government available for software developers. It arrived <a href="http://www.readwriteweb.com/archives/datagov_finally_launches_looks_nice_but_short_on_d.php">six months after the U.S. government launched its Data.gov site</a>, but from the start the U.K. site had more than three times as much data. At launch, Data.gov.uk had nearly 3,000 data sets available for developers to build mashups with. By the end of the year, that had increased to over 4,600.</p>
<p>Data.gov.uk was one of the highlights of the year in Linked Data, which is when organizations or governments upload data to the Web in a format enabling it to be re-used and built on. Linked Data is a subset of the wider Semantic Web movement.</p>
<p><em>See also: <a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">The State of Linked Data in 2010</a></em></p>
<h2>BBC World Cup Website</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/worldcup_final10.jpg" style="" />
			</span>
The biggest sporting event of the year was the soccer World Cup, which was widely covered in the media. The <a href="http://news.bbc.co.uk/sport2/hi/football/world_cup_2010/default.stm">BBC World Cup 2010 website</a> used "dynamic semantic publishing" technology to enhance its daily World Cup reporting.</p>
<p>The site featured over 700 webpages and was powered by a semantic publishing framework. It boasted a comprehensive ontology (a map of concepts), that output "automated metadata-driven web pages" created on-the-fly. It was an impressive demonstration of how a large, mainstream website can add meaning and structure.</p>
<p>There you have it, ReadWriteWeb's selection of the top 10 Semantic Web products and implementations of 2010! Let us know in the comments whether you agree or not with our top 10.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/12/29/top_10_semantic_web_products_of_2010</link>
                <guid>http://readwrite.com/2010/12/29/top_10_semantic_web_products_of_2010</guid>
                <category>2010 in Review</category>
                <pubDate>Wed, 29 Dec 2010 07:17:00 -0800</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[Internet Founder Tim Berners-Lee Details 4 Concerns About Future of Mobile Web (Nokia World 2010)]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/4149/4992849630_ff600cdfe0_m.jpg" style="" />
			</span>
This morning at Nokia World 2010 in London, Sir Tim Berners-Lee, widely known as the inventor of the Web, addressed the audience in a keynote speech where he spoke about the future of mobile technology, including both the positive impacts it brings as well as the areas of concern. After encouraging developers to build for the Web, so as to deliver applications that work on all types of devices, even the ones that haven't been invented yet, he then proceeded to detail areas which need addressing, specifically privacy, accountability, network neutrality and the 80% of the world that doesn't have access to the World Wide Web. </p>
<h2>The Mobile Web Today: Location is Just "Tip of the Iceberg"</h2>

<p>Berners-Lee began his keynote by discussing the improvements we've seen in technology in recent years, most notably the ability of our devices to be location-aware. However, he says, "location-awareness is just the tip of the iceberg." Devices already know so much about you: your geographical position, which way is up, which direction you're headed, etc., but future devices may know more than this. For example, they may know about your medical information and your physical state. Perhaps they could tell when you're excited by measuring heart rate increases, he said. </p>

<p>Another major improvement which is impacting the Web is the explosion of data available online, a project which he has heavily contributed to, here in the U.K. with data.gov.uk and its U.S. counterpart data.gov. Not so long ago, less than 10 years ago, in fact, "data" on the Web consisted of governments uploading a scanned document, like a spreadsheet that would be posted as a PDF on a government-hosted website. If anyone ever wanted to do anything with that data, they would have to re-type the information. Today, that same type of data is more accessible - the raw data itself is available and, says Berners-Lee, there's a race between governments and other organizations as to who can provide the best and most interesting data. </p>

<p>As for how data access relates to mobile, Berners-Lee explained that data drives development on mobile, just as it does on the Web as a whole. Even a basic calendaring type of application is data-driven. By combining the aspects of mobile technology, like location-awareness, with the semantic Web of data, entirely new types of mobile applications can exist. Most recently, augmented reality applications are an example of this pairing, tying together location with data to identify points of interest just by aiming your mobile phone at something like a landmark or building. </p>

<h2>4 Concerns about the Future of Mobile</h2>

<p>All that being said, Berners-Lee also mentioned that there are concerns which we need to be aware of when moving forward with mobile technology. They are as follows:</p>

<h2>1. Privacy</h2>

<p>The challenge of privacy is one many companies, both mobile and otherwise, have been dealing with in recent months. However, on mobile phones, the problem that has not been worked out yet is how to allow a user to share their location while still making it easy for them to understand when they're sharing critical information, how much control they have over that information and who can access that data. The challenge here is how to do all this without getting in the way of the user's experience. </p>

<p>The solution, says Berners-Lee, is that we may need to re-adjust our ideas about privacy. "I think that we'll end up having to think about privacy from a different point of view," he said. </p>

<h2>2. Accountability</h2> 

<p>Along the same lines of user privacy, is the idea that companies that want access to our critical information have a responsibility to build systems that respect that data. "Responsible" companies that are accountable for how they use our data are key. Clearly, this is a struggle many companies are dealing with now, and no one has a winning formula yet.  </p>

<h2>3. Neutrality</h2>

<p>A perennial favorite topic for Berners-Lee is the idea of network neutrality, referring to regulations that forbid prioritizing the speed or access with which one company's data is available over another's. Companies that want you to use their services have an incentive to end neutrality for their own benefits - for example, those that provide voice services may want to slow down access to VoIP services. </p>

<p>Here Berners-Lee was the most passionate, saying point-blank that "the moment you let neutrality go, you lose the Web as it is - you lose the idea that you can click a link and go anywhere." </p>

<h2>4. Bringing Web Access to the Rest of the World</h2>

<p>The last point also involved a project in which Berners-Lee is involved: providing Web access to the 80% of the world that doesn't go online. He works on this issue through the foundation at webfoundation.org, which examines the challenges in this area. Surprisingly, lack of signal with which to log onto the Web is not the main thing holding back the spread of the Web. 80% of the world has <em>access </em>to the Web, but, for some reason, chooses not to use it. </p>

<p>The cost of data is partially to blame in many cases for this, and for those who cannot afford data plans through their carriers, they're limited to SMS for sharing information. But SMS is very constraining, says Berners-Lee. What's needed instead are better, more low-cost data plans for mobile phones. Carriers should want to offer these plans because once people get a taste of what a data plan can provide, they're potential customers for an upgrade to a more expensive plan that offers even more data and would generate more revenue for carriers.</p>

<p>Affordability of Web access is an area which Nokia thinks about when building their technologies. For example, Nokia's Ovi Maps service uses compression so as not to need data access when zooming in and out, unlike competing service from Google Maps. Nokia's messaging services also compress data and as, Mary McDowell, Nokia's EVP of Mobile Phones, mentioned in the keynote speech following Berners-Lee's, Nokia's recent acquisition of mobile Internet company Novarra was primarily for access to its proxy-browsing browser technology, which saves on clicks, while also providing faster and more efficient access to the Web. This is an important technology for emerging markets where data plans are pricey, but needed. </p>

<p><em><p>Disclosure: Nokia paid for this reporter's travel and accommodations to Nokia World 2010. </p></em></p>
                    ]]></description>
                <link>http://readwrite.com/2010/09/14/internet_founder_tim_berners-lee_details_4_concerns_about_future_of_mobile</link>
                <guid>http://readwrite.com/2010/09/14/internet_founder_tim_berners-lee_details_4_concerns_about_future_of_mobile</guid>
                <category>Location</category>
                <pubDate>Tue, 14 Sep 2010 20:12:04 -0700</pubDate>
                <author>Sarah Perez</author>
            </item>
                    <item>
                <title><![CDATA[PARC Releases New Semantic Technology (in Form of an Outlook Plugin)]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/meshin_logo.png" style="" />
			</span>
The Palo Alto Research Center is releasing new semantic technology, based on Xerox PARC IP, in the form of an Outlook plugin called <a href="http://www.meshin.com/">Meshin</a>. At first glance, Meshin looks like the ugly stepsister to a similar Outlook tool called <a href="http://www.xobni.com">Xobni</a>, as it also loads into an email sidebar window, displaying sections dedicated to recent conversations and a summary of attachments shared back and forth via email, among other things. But what makes Meshin different is the engine powering it underneath: a semantic technology that uses "natural language processing" to understand entities, how they connect and what they mean.</p>

<p><em><strong>Invites available! Click through for link.</em></strong></p>


<p>The engineers freely admit that Meshin's user interface (UI) is currently the Achilles' Heel of the app. It's nowhere near as polished and put together as competitor Xobni's, for example. But they'll fix that, they promise. "We're hiring a UI designer," they tell us. </p>

<p>Focusing on the looks, though, is missing the big picture. <a href="http://www.meshin.com/">Meshin</a> is different from other email-based contact management systems including not only Xobni, but analysis engines like <a href="http://www.gist.com">Gist</a>, too. Where those companies hinge on the person - <em>here's their title, where they work, their emails, attachments, their blog posts, their last Twitter update, etc.</em> - Meshin actually analyzes the information found in the information streams it examines. It then extracts related conversations, related messages, related people and other semantically understood data. And it does so by looking beyond keywords. It knows what things <em>mean. </em>It knows if a word is referring to a person, place or thing. It can also surface related links and news from the Web for any given entity. </p>

<em><p>Read More about <a href="http://www.readwriteweb.com/fastsearch?search=xobni&x=0&y=0">Xobni</a> and <a href="http://www.readwriteweb.com/fastsearch?search=gist&x=0&y=0">Gist</a>. </p>
</em>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/meshin_related_messages.png" style="" />
			</span>
</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/meshin_web_links.png" style="" />
			</span>
</p>


<p>Already, the engine behind Meshin isn't limited to email messages alone. For example, if you subscribe to RSS feeds within Outlook, those are also understood as being a part of the relationship map with another person. If you subscribe to Twitter feeds within Outlook, again, those are analyzed, along with the other streams. </p>

<p>Meshin arose from a Xerox-funded project inside PARC whose goal is to commercialize older PARC IP for a broader audience. The project has been in development for only a year, with a small core team and support from PARC researchers. The long-term goal for Meshin is to extend itself beyond Outlook, in order to connect other types of information streams together.&#160; </p>

<p>The researchers are contemplating where they should take the technology next - another email platform? An RSS reader? A standalone product? Should they open up Meshin APIs (application programming interfaces) for developers to use within their own applications and services? All these models are a possibility, but first the engineers wanted to just get the technology out there, in the hands of users. </p>

<p>We're helping them with that by distributing invites to the private beta. For access, <a href="http://bit.ly/d8YfgW">click here</a>. </p>

<p>
<object width="480" height="385"><param name="movie" value="http://www.youtube.com/v/chGeOYUKATg?fs=1&amp;hl=en_US"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/chGeOYUKATg?fs=1&amp;hl=en_US" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="480" height="385"></embed></object>
                    ]]></description>
                <link>http://readwrite.com/2010/09/06/parc_releases_new_semantic_technology_in_form_of_an_outlook_plugin</link>
                <guid>http://readwrite.com/2010/09/06/parc_releases_new_semantic_technology_in_form_of_an_outlook_plugin</guid>
                <category>Product Reviews</category>
                <pubDate>Mon, 06 Sep 2010 00:45:52 -0700</pubDate>
                <author>Sarah Perez</author>
            </item>
                    <item>
                <title><![CDATA[SPARQLZ Shines as a Vision for Linked Data Made Easy]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/sparqlzlogo-20100811-142822.jpg" style="" />
			</span>
<a href="http://www.sparqlz.com">SPARQLZ</a> is a stealth technology project aimed to provide a graphical user interface for everyday users to assemble, edit, share and mash-up modular, persistent, real-time searches across the web of Linked Data. It's a side project by an independent team within a large data corporation, with dreams of spinning their work off as a startup.   </p>

<p>It's a pretty hot idea: it's like <a href="http://pipes.yahoo.com">Yahoo Pipes</a>, for Linked Data - but easier to use and already populated with big sets of valuable information to mashup and parse.  Linked Data is a growing field of datasets that are categorized with standardized markup, tied together and easily cross referencable by machines.  The US and UK governments, news organizations, music data bases, social networks and other organizations are participating in the official W3C Linked Data community.  Now SPARQLZ aims to make all that data easy to construct future-facing search queries for.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/sparqlz2-20100811-121114.jpg" style="" />
			</span>
</p>

<p>SPARQLZ is named after the <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL</a> query language for structured data.  The service also uses technologies like <a href="http://en.wikipedia.org/wiki/Yahoo!_query_language">Yahoo Query Language</a> and real-time push format <a href="http://en.wikipedia.org/wiki/Pubsubhubbub">PubSubHubbub</a>.  Search results can be delivered to SMS, Email, Webhooks, a Feed Reader or other SPARQLs.</p>

<div class="pullquote">Linked Data is growing fast and has incredible potential as a development platform.  Sir Tim Berners-Lee, widely credited as the key inventor of the World Wide Web, is now focused on the growth of Linked Data as the web's next step.</div>On top of that technical stuff is an attempt to make things easy.  You want to know when a 3 bedroom house goes on sale in South East Portland, Oregon for under $250,000?  SPARQLZ says it will make setting up sophisticated alerts like that easy.  You want to know when a house like that goes on sale anywhere in the country where there's a high concentration of outdoor sports enthusiasts living and the temperature is within a range you're comfortable in?  Just snap some SPARQLs together and you can set up a search and alert for things like that, thanks to the availability of structured, linked data from government and private sources.

<p><strong>How would I use a service like this?</strong>  Just imagine stringing if/then statements together through <a href="http://richard.cyganiak.de/2007/10/lod/">the cloud of Linked Data</a> (see below).  As a technology publisher, I'd like to receive notification if and when SPARQLZ finds a photo (from <a href="http://flickr.com">Flickr</a> or <a href="http://CORDIS.com">CORDIS</a>) of a person from a known tech company (defined as the companies listed in <a href="http://crunchbase.com">Crunchbase</a>), that's headquartered in a country with a lower than world-median GNP (per the <em>CIA World Fact Book</em>).  Could SPARQLZ do that?  In theory, it could do things like that all day long - and you and I could trade queries like that back and forth like legos to piece together whatever kind of stream we were looking for.  It's a really exciting vision.</p>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/sparqlz2-20100811-121242.jpg" style="" />
			</span>
</p>

<p>Linked Data is growing fast and has incredible potential as a development platform.  Sir Tim Berners-Lee, widely credited as the key inventor of the World Wide Web, is now focused on the growth of Linked Data as the web's next step.</p>

<p>But in order to get beyond the borders of Wonky-land, Linked Data needs a good User Interface.  For many people, the ability to set up dynamic queries, mix and match them and have them deliver alerts to various devices could scratch an itch that many of us didn't know we had.  </p>

<p>Ok, so let's be honest.  This is probably not going to be a mainstream phenomenon.  But are there thousands, tens of thousands or maybe a larger number of people who could create value for themselves using a query construction and publishing model like this? Who could not or could not so easily create that kind of value on top of Linked Data today?  I think there are.  Maybe there are even millions of people who could capture some of the latent value in Linked Data thanks to a tool like this.</p>

<p><a href="http://www.SPARQLZ.com">SPARQLZ</a> is certainly intended as a way to democratize creation and use of something rich with value but previously too technically inaccessible for many people to use who might like to.  In that it's of the same vein as Blogger and WordPress were to text publishing, as YouTube is to video publishing and Twitter and Facebook are to social activity feeds.</p>

<p>Might this be the project that makes Linked Data hacking something that far more of us can engage in?  That would be great, but first the team will need to gather support and launch itself as a company.  For the sake not just of this small team of data-loving dreamers deep inside a big corporation - but for the sake of all us data-loving dreamers who would love to use their tool, I hope they can do it.</p>

<center><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/sparqlz3-20100811-142207.jpg" style="" />
			</span>
</center>

<center>Below: The Options, A Picture of the Linked Data Cloud</center>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/lod-datasets_2009-07-14_610.jpg" style="" />
			</span>
</p>

<p><em>The latest version of the <a href="http://richard.cyganiak.de/2007/10/lod/">Linking Open Data dataset cloud</a>, as at July 2009, maintained by Richard Cyganiak and Anja Jentzsch.</em></p>
                    ]]></description>
                <link>http://readwrite.com/2010/08/11/sparqlz</link>
                <guid>http://readwrite.com/2010/08/11/sparqlz</guid>
                <category>Data Services</category>
                <pubDate>Wed, 11 Aug 2010 07:31:07 -0700</pubDate>
                <author>Marshall Kirkpatrick</author>
            </item>
                    <item>
                <title><![CDATA[Mapping People to Products: Hunch & GetGlue]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/neo_bullets.jpg" style="" />
			</span>
A few weeks ago I wrote that we've <a href="http://www.readwriteweb.com/archives/beyond_social_web_internet_of_things.php">moved to an era of the Web</a> that is <strong>beyond social</strong>. My contention is that successful services of this era of the Web will be ones that <strong>filter, structure and personalize</strong> the vast amount of data coming onto the Web. An example of this kind of application is <a href="http://hunch.com/">Hunch</a>, which this week re-launched as an <a href="http://www.readwriteweb.com/archives/hunch_internet_personalization_service.php">Internet personalization service</a>. Hunch is one of a number of modern web services aiming to connect you not only to other people, but to <strong>products and objects</strong>.</p>
<p>Hunch co-founder and Chief Product Office Caterina Fake <a href="http://www.wired.com/magazine/2010/07/ff_caterina_fake/2/">told Wired</a> in a recent profile that  &quot;the ultimate goal of the company is to map every person on the Internet to every object on the Internet, be that a product, a service, or a person.&quot;</p>
<p>I visited the Hunch web site today and answered more than 20 questions, in exchange for which I was offered a list of recommendations of magazines, books and TV shows. It's not a perfect list - I doubt I'll ever watch <em>(connect to, follow)</em> The West Wing, for example, no matter who or what recommends it to me. Nevertheless, Hunch is onto something.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/hunch_movies.jpg" style="" />
			</span>
</p>
<h2>Why Hunch Exists</h2>
<p>The so-called Web 2.0 era of the Web was based on user-generated content and social networking around that. Services like YouTube, MySpace and Flickr (which was co-founded by Caterina Fake) were the success stories of that era. </p>
<p>But now, in 2010, there is <strong>too much</strong> user-generated content to manually process. What's more, social networking is practically dominated by one company: Facebook. We no longer rely so much on niche sites like Flickr, YouTube, Netflix, Amazon to connect to other people socially. Another aspect to consider is that there's a lot of new data streaming in from sensors, RFID tags and other Internet-connected objects. </p>
<p>The upshot is that we need web services that can help us process all of this data and connect us to the parts that are personally relevant to us.</p>
<h2>Opportunities For Startups</h2>
<p>The refreshing thing is that these trends are opening up huge opportunities for startups.</p>
<p><a href="http://getglue.com">GetGlue</a> is another example of a startup aiming to match social data to objects or media. It knows for example that I recently watched Inception and (mostly) liked it. GetGlue can use that piece of data about me, look at my history of other movie likes, connect that to the movie history and preferences that it knows about <em>other people</em> who liked Inception. Ultimately all of that social and 'like' data can be used by GetGlue to recommend other movies to me that I may like to see.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/getglue_inception2.jpg" style="" />
			</span>
</p>
<p>We're early in this era, but both Hunch and GetGlue are busy building up extensive databases about people and what they like (their &quot;taste&quot; data). Not only that, they're slowly perfecting recommendation engines that process this data - ultimately filtering, structuring and personalizing it.</p>
<p>Let us know what other 'beyond social' startups have caught your eye recently.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/08/06/mapping_people_to_products_hunch_getglue</link>
                <guid>http://readwrite.com/2010/08/06/mapping_people_to_products_hunch_getglue</guid>
                <category>Recommendation Engines</category>
                <pubDate>Fri, 06 Aug 2010 00:00:00 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA["The Almighty API," Crawling and The Programmable Web]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/opendata.png" style="" />
			</span>
Today, applications increasingly depend on a rich ecosystem of APIs. Thousands of different services are variously tethered together to form new software offerings and enhance existing ones. The idea of a programmable Web is finally coming true.</p>
<p>While this is not trivial, I am nonetheless beginning to question the long-term effects of an API-centric worldview, a sort of blind faith in the almighty API, which has at best a difficult relationship with open data and big data concepts.</p>
<p><em>As CEO, guest author Shion Deysarkar is responsible for the overall business and development of <a href="http://www.80legs.com">80legs</a>. In a previous life, he ran a predictive modeling firm. He enjoys playing poker and soccer, but is only good at one of them.</em></p>
<h2>How Do We Access Data Today?</h2>
<p>There are two core ways to access data today – via a publisher or via a crawler. Each has a different role.</p>
<p>&nbsp;</p>
<div class="pullquote">APIs, at least as we think of them today, have many disadvantages. And before you grab your shovels and organize a mob to come after me, please understand that I'm not calling for the discontinuation of APIs.</div>
<p>Publishers have data and choose to make it publicly available through an API so that developers can easily design products powered by a given service.</p>
<p>&nbsp;</p>
<p>Crawlers on the other hand are used to proactively go out and grab data by yourself - scraping Web pages for whatever it is you're looking for, data that can then be used to build products and inform better product and marketing decisions.</p>
<p>There is something of a third option, as well: data aggregators like <a href="http://www.factual.com">Factual</a> and <a href="http://www.infochimps.org">Infochimps</a> and <a href="http://www.hoovers.com">Hoovers</a>. I'm not going to treat them as part of this post because they gain access to data like the rest of us – via APIs and/or crawlers. They facilitate the distribution of that data as part of their core business (most often using a marketplace concept or subscription), but the input mechanisms are no different.</p>
<p>And there is potentially even a fourth option – human curation of the kind that Factual and <a href="http://www.wolframalpha.com/">WolframAlpha</a> and <a href="http://crowdflower.com/">CrowdFlower</a> employ to acquire new data altogether. But all of these providers offer API access to their data, so I'm still going to bucket them as such.</p>
<p>APIs, at least as we think of them today, have many disadvantages. And before you grab your shovels and organize a mob to come after me, please understand that I'm not calling for the discontinuation of APIs.</p>
<p>At 80legs, we ourselves offer a popular API, which takes a particularly hybrid approach – providing programmatic access to the data acquired via crawls.</p>
<p>What I really want is a natural stratification based on who is good at what, essentially. Right now, we're asking APIs to do too much.</p>
<p>APIs are great for the real-time web, for example – they're great for staying up to speed, whether that means trending search data or retweet velocity. APIs are great for enhancing functionality – whether that's a <a href="http://www.klout.com">Klout</a> score or geolocation. APIs are great for integrating certain pieces of non-strategic infrastructure like invite codes (<a href="http://www.prefinery.com">Prefinery</a>) if you're a startup in beta, or <a href="http://www.freshbooks.com">Freshbooks</a>, if you're an accountant. They're also great for app-level integrations, like adding Facebook accounts to <a href="http://www.tweetdeck.com">Tweetdeck</a>, or sucking down content from Netflix.</p>
<p>But at a higher level, as all applications and services become more and more data-driven, it's important to understand the differences between these different methods for extracting data, regardless of where you net out philosophically.</p>
<p>This is a discussion that needs to take place, but too few people recognize the distinctions.</p>
<h2>Control, Control, Control</h2>
<p>Control and flexibility are the two most important elements to look at when it comes to the difference between an API and a crawl. I also spend some time at the end of this post talking about security and privacy, because I think there are big impacts in these areas for APIs and crawlers alike.</p>
<p>Cost might be a fourth facet to look at, but that's grounds for a different post because pricing varies so widely.</p>
<p>Let's start with control.</p>
<p>When using an API, publishers – companies like Amazon and LinkedIn, for example – control the entire process. Publishers provide you with an API account, which allows you a certain amount of calls, or requests for data per day. They also determine what kinds of content are made available, and in what context.</p>
<p>Publishers offer an API for many reasons. It's financially in their best interest to have products built on top of their data to increase developer loyalty and form a kind of API-dependency to their content. It's also useful as a way to accurately measure server usage and overall engagement, even if there's no money involved.</p>
<p>APIs can go down and become unavailable, they can go from free to paid, and their publishers can be acquired by larger companies that make all manner of changes. There's a lot of uncertainty in APIs, and many devs have learned this fact the hard way. Think back to Gnip rethinking their entire business model due to the relicensing of certain APIs.</p>
<p>But like moths, we so often head right back to the flame.</p>
<p>Crawlers act very differently. They allow much more control over the data acquisition process. This has many advantages.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/json.jpg" style="" />
			</span>
For starters, the format in which content is delivered can be a lifesaver if formatted properly, or prompt hours of additional work if not. APIs supply content in one format – the format chosen by the publisher. Say you need a XML file type but the company only delivers JSON through their API. You're either stuck or left spending hours re-formatting.</p>
<p>Crawlers let the choice-driven developer have his cake and eat it too. Formats are just another choice to make beforehand, instead of a hindrance.</p>
<p>Granted, standardization can be great in some cases – for example with sites like MySpace where each profile is customized and therefore rendered in HTML differently. MySpace APIs format the content to make it uniform, meaning that what was once difficult to work with as a developer (i.e. large discrepancies in the data), is now standardized and simple to use.</p>
<p>But the "one size fits all" mentality fails more often than you might think, especially once you step outside of the Web's largest sites – one size fits all rarely fits anyone well.</p>
<p>And it's not just format – crawling offers much more control when it comes to time and timing, scope, and cost, too.</p>
<!-- <p><em><strong>Next page: </strong>Flexibility and Availability</em></p> -->
<!--nextpage-->
<h2>Flexibility and Availability</h2>
<p>Data access choices are an important component of building any Web product, especially when it comes to flexibility and availability. Specs change, needs change - heck, markets change. Especially if you're a lean startup, out early plus iterate often is a way of life.</p>
<p>APIs only deliver content from the publisher's site. You're locked into a single interface's content sources and structure, without flexibility by definition, which can be very limiting. You're left with acquiring stand-alone datasets to supplement your evolving needs, or mashing up with another API to fill in holes.</p>
<p>Now, the very best API providers are great at adapting to developers' needs and evolving alongside them. Companies like <a href="http://www.yolink.com">Yolink</a>, for which its API is its bread and butter, are particularly responsive. But too often an API is left unattended, having been a mere box to check, instead of a strategic commitment.</p>
<p>Unimaginative APIs can also limit use cases unwittingly, because some of the furthest-flung (if more promising) applications just aren't supported in the calls or code. There's a huge difference between an API that wants to be heightened and explored and an API whose scope, if anything, constrains original thinking.</p>
<p>Crawlers on the other hand aren't specific to any one site's data, meaning that they can access content from any number of sources and compile it in one place, mixing and matching, comparing and contrasting to your heart's delight.</p>
<p>Crawls can be more open-ended and investigative as well, whereas an API is more about putting a square peg in a square hole. API's also don't offer competitive advantage; everyone has access to the same stuff. A clever crawl can help build a moat.</p>
<p>Finally, crawlers can reach far beyond the capabilities of an API. Millions of pieces of data are publicly available on the Web, and only a very small percentage of it is available via an API. At a certain point it's purely an issue of volume. Much of the Web is instantly crawl-able, and the amount of data available freely on the Web is growing more quickly than the number of APIs by an order of magnitude. The caveat is that you just have to know where to look.</p>
<h2>The Elephant in the Room - Security and Privacy</h2>
<p>Let's talk about privacy and data, because how the world evolves in this respect could have huge implications for APIs and crawlers alike.</p>
<p>As the recent Facebook data privacy concerns highlight, the security of people's data is a high priority, regardless of how it may or may not be acquired or sold. Further, users expect publishers to protect their data aggressively (whether they do is another matter).</p>
<p>This is a PR and perception issue as much as it is anything else.</p>
<p>Users worry that their data might get into the hands of people who will use it for malicious purposes, whether via an API or a crawler. I would argue that this is <a href="http://www.readwriteweb.com/archives/bulk_social_data_80legs.php">not always the case</a> because responsible crawling companies, at least, have strict licensing agreements with their clients to ensure data is used lawfully.</p>
<p>But the reality is that publishers are increasingly incentivized because of public policy issues to constrain API access. And the world's biggest crawler, Google, is starting to look evil, with the ominous question, "What exactly does Google know about me?" popping up at family dinners around the country.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/Tim_Berners_Lee_Knight.png" style="" />
			</span>
Some are <a href="http://www.futurelab.net/blogs/marketing-strategy-innovation/2010/05/facebook_utility_utilities_get.html">even arguing</a> that Facebook is bound to be federally regulated sooner or later because of its profligacy when it comes to data, and that would certainly have broad impacts.</p>
<p>APIs are not inherently more or less secure than crawlers, but in the current climate, especially with regards to privacy, we can expect companies large and small to make less and less data open and available (something that the linked data community has been ruing as well).</p>
<p>Security right now is a big X factor that is going to take some time to play out.</p>
<p>The nice thing about crawlers (depending on your perspective) is that they are harder to control, at least for now. But it is a reasonable thing to say that data responsibility and privacy issues are going to shape and reshape this conversation big time.</p>
<p>Today's Web is full of data that if kept within an API-driven paradigm suffers from less creative use, less flexibility, and less control (from a developer standpoint).</p>
<p>An endlessly crawl-able Web was in many ways what Tim Berners-Lee and WC3 intended for the Web all along. Content creators like publishers and social networks can create sites as they've always done, while data aggregators can access data in whatever format they like.</p>
<p>In fact, in an older but still applicable <a href="http://www.zdnet.com/blog/berlind/web-inventor-tim-berners-lee-unplugged-semantic-web-better-than-apis-for-data-access/518">interview</a> with Berners-Lee, he talks about why an open, linked data Web is by far preferable than APIs for data access.</p>
<p>There is a foundational, DNA-level need to share data. Without openness, you lose the full value and impede any future innovation in the process. APIs absolutely have benefits – but only when we are not beholden to them – when we can use them rationally, strategically, and carefully. And when data isn't at the crux of your site, service or application.</p>
<p>"We have an open API" is an overused phrase, especially as APIs are by no definition open or closed.</p>
<p>If you need certain attributes, like real-time or speed, certain capabilities, or certain pieces of infrastructure, there are thousands of amazing APIs out there. But if your business runs on data, crawling is the only way to go.</p>
<p><em><small>Top photo by <a href="http://www.flickr.com/photos/jmv/2098820908/">Jason V</a>. Tim Berners-Lee photo by the <a href="http://www.flickr.com/photos/knightfoundation/2859733452/">Knight Foundation</a></small></em></p>
                    ]]></description>
                <link>http://readwrite.com/2010/08/04/the_almighty_api_crawling_and_the_programmable_web</link>
                <guid>http://readwrite.com/2010/08/04/the_almighty_api_crawling_and_the_programmable_web</guid>
                <category>Structured Data</category>
                <pubDate>Wed, 04 Aug 2010 03:00:00 -0700</pubDate>
                <author>Guest Author</author>
            </item>
                    <item>
                <title><![CDATA[Beyond Social: Read/Write in The Era of Internet of Things]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/neo_bullets.jpg" style="" />
			</span>
This blog was founded in 2003 on the philosophy of <a href="http://www.readwriteweb.com/archives/the_readwrite_w.php">a read/write Web</a> - a Web in which people can <strong>create content</strong> as easily as they <strong>consume it</strong>. This trend eventually came to be known as <a href="http://www.readwriteweb.com/archives/evolving_wikipe.php">Web 2.0</a> - although others preferred <a href="http://factoryjoe.com/blog/2009/05/18/the-open-social-web/">Social Web</a> - and was popularized by activities like blogging and social networking. </p>
<p>It would be easy to say that the 'social' element is still the primary part of today's Web, since the popular products of this era enable you to say what's on your mind (<a href="http://facebook.com">Facebook</a>), what's happening (<a href="http://twitter.com">Twitter</a>), or where you are (<a href="http://foursquare.com">Foursquare</a>). All of these are mostly social activities. But more significantly, these and other products output data that will increasingly be used to build personalized services for you.</p>
<p>The more data there is, the better Web services will be at delivering personal value to you. While part of this increase in data is coming from social data from the likes of Facebook and Twitter, much of it is coming from the <a href="http://www.readwriteweb.com/archives/internet-of-things/">Internet of Things</a> and <a href="http://www.readwriteweb.com/archives/structured-data/">data uploaded by governments and organizations</a>. In short: the read/write Web is <strong>now much more than the Social Web</strong>.</p>
<h2>How We Went Beyond Social</h2>
<p>So how did we arrive at a Web that is less about <em>social</em> and more about <em>you</em>?</p>
<div class="pullquote">It's not how much content you consume that is important, it's about what you do with data.</div>
<p>After the peak of Web 2.0, we (meaning all of us) began to get overwhelmed with the choice of content available. We thought we had to actually 'read' as much of that content as possible. So we watched YouTube, chatted on MySpace and Facebook, read blogs, followed lots of people on this new thing called Twitter, and so on. By the end of 2008, we were exhausted by all of this CONTENT. How could we possibly keep up?!</p>

<p>In 2010, we're still struggling to digest all of what social media throws at us. However,  a shift has been happening since 2009 which alleviates the problem. We've begun to realize that it's not <strong>how much content we consume</strong> that is important: it's <strong>what we do</strong> with all of the social and other data available to us. The social is still important, but the resulting data is - slowly - becoming more important because it can be analyzed, filtered, mashed up and personalized.</p>
<h2>Structured Data & Internet of Things</h2>
<p>Two relatively new trends are driving this change.</p>
<div class="pullquote">If I was an entrepreneur or developer, I wouldn't be thinking about social anymore. I'd be thinking: How can I use all of this data and build on top of it?
</div>
<p>The first is the increasing amount of data being uploaded to the Web by governments, organizations and people. Much of this data is being structured using <a href="http://www.readwriteweb.com/archives/semantic-web/">Semantic Web</a> technologies like RDFa or microformats. In other words, it is categorized and encoded with meaning that machines can process. Recent examples include <a href="http://www.readwriteweb.com/archives/datagov_finally_launches_looks_nice_but_short_on_d.php">U.S.</a> and <a href="http://www.readwriteweb.com/archives/uk_launches_open_data_site_puts_datagov_to_shame.php">U.K. government data</a>, <a href="http://www.readwriteweb.com/archives/how_best_buy_is_using_the_semantic_web.php">Best Buy's store and product data</a> and <a href="http://www.readwriteweb.com/archives/facebook_the_semantic_web.php">Facebook's Open Graph</a>.</p>

<p>And then we have the <a href="http://www.readwriteweb.com/archives/internet-of-things/">Internet of Things</a>: an evolving trend where real-world objects and 'things' are connected to the Internet via technologies such as <a href="http://www.readwriteweb.com/archives/sensors_next_big_wave_of_computing.php">sensors</a> and <a href="http://www.readwriteweb.com/archives/rfid_state_of_the_market.php">RFID tags</a> - everything from cars to houses to roads and more. The upshot is that the Web is about to experience a <a href="http://www.readwriteweb.com/archives/data_analytics_software_must_adapt.php">data explosion</a>, as billions of sensors and other data input and output devices upload exabytes of new data to the Web.</p>
<h2>How do We Use This Data?</h2>
<p>If we add together social data from the likes of Facebook and Twitter, data from governments and businesses, and data from sensors and RFID, this is a huge amount of data. Most of it isn't for "consuming." Rather, the value of all of this new Web data will be in how it's filtered, mixed together ("mashed up") and personalized in new Web services - most of which haven't yet been built. </p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/momcomp1.png" style="" />
			</span>
Adam Greenfield is one of the leading thinkers of the Internet of Things; <a href="http://www.readwriteweb.com/archives/interview_with_adam_greenfield_part1.php">I interviewed him</a> earlier this year about his book called <a href="http://www.studies-observations.com/everyware/">Everyware</a>. Greenfield recently <a href="http://speedbird.wordpress.com/2010/07/15/momcomp/">wrote a post</a> describing a near future scenario for non-technical people using the Web. He posited a use case where his mother would be able to plan a train trip to see her son, by creating an "ad-hoc service" that tapped into the Web and utilized real-time data sources. </p>
<p>In 2010, his mother would have to find and "read" several different applications in order to plot her travel schedule, and some of that information isn't even currently on the Web. Greenfield envisions a near future where his mother can essentially "write" her requirements into her mobile or other device, and the Web will deliver a personalized schedule to "read." You can view a diagram of Adam's concept <a href="http://speedbird.files.wordpress.com/2010/07/momcomp.pdf">here</a> (PDF).</p>
<h2>Don't Think Social, Think Data</h2>
<p>Successful products in the Web 2.0 era had a strong social element: YouTube, MySpace and Flickr were a few relatively early examples. In the current era of the Web, which began to form in early 2009, the focus has shifted from social to data-driven software. Successful products of this era of the Web will be ones that <strong>filter, structure and personalize</strong> this vast amount of data coming onto the Web.</p>
<p>So if I was an entrepreneur or developer wondering what to build for this era of the Web, I wouldn't be thinking social. I'd be thinking: How can I use all of this data and build on top of it? There are incredible opportunities out there for you.</p>
<p>This current era of the Web doesn't have a name, which is probably a good sign! One thing is for sure though: It's still a read/write Web - only now you're reading and writing data from much more than just social services. You're increasingly interacting with "things," organizations, governments - virtually anything that can connect to the Web.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/18/beyond_social_web_internet_of_things</link>
                <guid>http://readwrite.com/2010/07/18/beyond_social_web_internet_of_things</guid>
                <category>Features</category>
                <pubDate>Sun, 18 Jul 2010 19:39:39 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[BBC World Cup Website Showcases Semantic Technologies]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/worldcup_final10.jpg" style="" />
			</span>
The soccer World Cup has now ended, with Spain the victor.  England was unceremoniously dumped out before the quarter finals - but if there was a World Cup for the Semantic Web, then the BBC may have lifted the trophy for its country. <a href="http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html">A post on the BBC Internet site</a> explains how the <a href="http://news.bbc.co.uk/sport2/hi/football/world_cup_2010/default.stm">BBC World Cup 2010 website</a> used &quot;dynamic semantic publishing&quot; technology.</p> 
<p>It's an impressive demonstration of how a large, mainstream website can have added meaning and structure.</p>
<div class="super-pullquote">ReadWriteWeb's Guide to The Semantic Web:
<ol> 
<li><a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">Semantic Web Adoption by Facebook, Best Buy & Others</a></li> 
  <li><a href="http://www.readwriteweb.com/archives/open_data_linked_data_semantic_web.php">It's All Semantics: Open Data, Linked Data & The Semantic Web</a></li> 
<li><a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">The State of Linked Data in 2010</a></li> 
<li><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2009.php">Top 10 Semantic Web Products of 2009</a></li> 
<li><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee</a></li> 
 
 
</ol> 
</div> 
<p>The BBC World Cup site featured over 700 webpages and was powered by a semantic publishing framework. The site boasted a comprehensive ontology (a map of concepts), that output &quot;automated metadata-driven web pages&quot; created on-the-fly.</p>
<p>Jem Rayfield, Senior Technical Architect, BBC News and Knowledge, explained further:</p>
<blockquote>
  <p>&quot;The underlying publishing framework does not author content directly; rather it publishes data about the content - metadata. The published metadata describes the world cup content at a fairly low-level of granularity, providing rich content relationships and semantic navigation. By querying this published metadata we are able to create dynamic page aggregations for teams, groups and players.&quot;</p>
</blockquote>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bbc_world_cup2.jpg" style="" />
			</span>
</p>
<p>The basis of this system was an ontology  that described how World Cup facts related to each other. For example, "Frank Lampard" was part of the "England Squad" and the "England Squad" competed in "Group C" of the "FIFA World Cup 2010". The ontology also included &quot;journalist-authored assets&quot; such as stories, blogs, profiles, images, video and statistics.</p>
<p>The publishing platform had both manual and automated tagging features. BBC journalists  could, for example, tag Frank Lampard in <a href="http://news.bbc.co.uk/sport2/hi/football/world_cup_2010/8766423.stm">a story about the disallowed goal</a> from England's last-16 loss against Germany. This is a normal part of most modern-day publishing systems (we tag content in this manner here at ReadWriteWeb). But the BBC World Cup site also <em>automatically</em> analyzed journalist content and matched it &quot;against the World Cup ontology.&quot; It did this by using what it describes as a &quot;natural language and ontological determiner process.&quot; <em>[<b>Update:</b> IBM wrote in to inform us that the technology behind this was <a href="http://www-01.ibm.com/software/globalization/topics/languageware/index.jsp">IBM LanguageWare</a>.]</em> This is similar to software such as Thomson Reuters' <a href="http://www.opencalais.com/">Calais</a> or the new <a href="http://www.extractiv.com/">Extractiv</a> product that <a href="http://www.readwriteweb.com/archives/extractiv_launches_semantics_as_a_service_platform.php">we reviewed yesterday</a>. In the BBC's case, the resulting tags were moderated before being published.</p>

<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bbc_diagram_595.png" style="" />
			</span>
</p>
<p>The BBC used Semantic Web technologies such as RDF and SPARQL to build their World Cup site. The stated goal was to achieve &quot;intelligent mapping of journalist assets to concepts and queries.&quot; </p>
<p>The  site reportedly served millions of page requests a day throughout the World Cup. The BBC may use this semantic publishing platform for other parts of the BBC sports site; and it will certainly deploy it again for the the London 2012 Olympics.</p>
<p>The official <a href="http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html">explanatory post</a> has the technical details, should you wish to follow up. Let us know in the comments about other large scale Semantic Web deployments that you know of.</p>
<p><b>Update:</b> See also <a href="http://www.bbc.co.uk/blogs/bbcinternet/2010/07/the_world_cup_and_a_call_to_ac.html">The World Cup and a call to action around Linked Data</a>, by the BBC's John O'Donovan. Thanks <a href="http://blog.georgikobilarov.com/">Georgi Kobilarov</a> for the pointer.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/12/bbc_world_cup_website_semantic_technology</link>
                <guid>http://readwrite.com/2010/07/12/bbc_world_cup_website_semantic_technology</guid>
                <category>NYT</category>
                <pubDate>Mon, 12 Jul 2010 22:27:08 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[Extractiv Launches "Semantics as a Service" Platform]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/extractiv_logo.jpg" style="" />
			</span>
<a href="http://www.extractiv.com/">Extractiv</a> has quietly launched a service that crawls the Web for text on a specific topic, then transforms it into &quot;structured semantic data.&quot; It's a direct competitor to Thomson Reuters' <a href="http://www.opencalais.com/">Calais</a> product, which has been doing this for <a href="http://www.readwriteweb.com/archives/reuters_calais.php">a couple of years now</a>. This type of service is potentially valuable to media companies, search services and monitoring applications - because it turns messy, unorganized HTML content into data that is organized into categories and given other semantic 'meaning.'</p>
<p>I sat down with Extractiv CEO Shion Deysarkar at the recent <a href="http://semtech2010.semanticuniverse.com/">Semantic Technology conference</a> in San Francisco, to find out how Extractiv intends to compete with the more well-known and big media backed Calais.</p>
<h2>How Extractiv Works</h2>
<p>Extractiv is a joint venture between Houston-based web crawling service <a href="http://www.80legs.com/">80legs</a> and natural language processing company <a href="http://www.languagecomputer.com/">LCC</a> (which created <a href="http://swingly.com/">Swingly</a>, a <a href="http://www.readwriteweb.com/archives/the_robot_made_me_do_it_comparing_three_new_cyborg_q_and_a_services.php">Q&amp;A service</a>).</p> 
<p>Deysarkar explained that Extractiv uses technology from both of its parent companies, to crawl the Web for  content on a particular topic and then - using natural language processing - transform it into structured data. <a href="http://vimeo.com/13169145">This video</a>, produced by Extractiv, explains how the service might be used to crawl the Web for stories about smart phones over the past month. </p>
<p><object width="601" height="338"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=13169145&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=00adef&amp;fullscreen=1" /><embed src="http://vimeo.com/moogaloop.swf?clip_id=13169145&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=cc0000&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="601" height="338"></embed></object></p>
<p>The output of the crawl and analysis can be JSON or XML, two formats commonly used for structured data. Support for <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">RDFa</a>, a popular Semantic Web standard, will be available &quot;soon&quot; according to the company. Extractive also offers an API, allowing customers to bypass the web  site.</p>
<div class="super-pullquote">ReadWriteWeb's Guide to The Semantic Web:
<ol> 
<li><a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">Semantic Web Adoption by Facebook, Best Buy & Others</a></li> 
  <li><a href="http://www.readwriteweb.com/archives/open_data_linked_data_semantic_web.php">It's All Semantics: Open Data, Linked Data & The Semantic Web</a></li> 
<li><a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">The State of Linked Data in 2010</a></li> 
<li><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2009.php">Top 10 Semantic Web Products of 2009</a></li> 
<li><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee</a></li> 
 

</ol> 
</div> 
<p>Extractiv is free to try, but if you'll be a moderate or heavy user of the service then you'll have to pay (the pricing is as yet unavailable on the web site).</p>
<h2>Extractiv vs Calais</h2>
<p>Deysarkar told ReadWriteWeb that Extractiv is targeting &quot;mid-market Calais customers&quot; -  such as media companies or those developing search applications, monitoring services, recommendation engines or aggregators. He also claimed that Extractiv goes beyond what Calais offers, because it can mine sentiment data (which is data about how people feel about products and services).</p>
<p>Extractiv also wants to &quot;provide access to more types of semantic information than any other provider.&quot; As CEO of partner company LCC, Andrew Hickl, put it, &quot;if you're interested in baseball pitchers, a generic type like PERSON just won't cut it.&quot; </p>
<p>At launch, Extractiv offers about 250 different types of named entities, but it aims to have more than 3000 different entity types by the end of the U.S. summer.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/extractiv_screenshot1.jpg" style="" />
			</span>
</p>
<h2>Preparing For the Future of the Web</h2>
<p>The product is not aimed at the consumer market, so it's not for the faint hearted and you need to know what to do with all of that XML or JSON data! It also remains to be seen how competitive it is with Calais, which is a proven performer and has many reputable companies as its customers. Some startups have <a href="http://www.readwriteweb.com/archives/eqentia.php">taken on Calais</a> before, but fallen short. </p>
<p>However, there is undoubtedly a need for products like Extractiv and Calais that turn the Web's unstructured data into meaningful, organized content. This is the future of the Web, because there is going to be a large increase in the quantity of data online over the next 5-10 years - and all of that data will need to be structured if we're going to be make the best use of it.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/11/extractiv_launches_semantics_as_a_service_platform</link>
                <guid>http://readwrite.com/2010/07/11/extractiv_launches_semantics_as_a_service_platform</guid>
                <category>Semantic Web</category>
                <pubDate>Sun, 11 Jul 2010 18:58:13 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[5 Key Trends of 2010: Half-Year Report for The Web]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/twitter_facebook_people2.jpg" style="" />
			</span>
It's now a little over 6 months into 2010, so a good time to reflect on the highlights of the year so far. At the beginning of the year, we identified some key trends to track: (in alphabetical order) <a href="http://www.readwriteweb.com/archives/augmented-reality/">Augmented Reality</a>,  <a href="http://www.readwriteweb.com/archives/internet-of-things/">Internet of Things</a>, <a href="http://www.readwriteweb.com/archives/mobile-services/">Mobile</a>, <a href="http://www.readwriteweb.com/archives/real-time-web/">Real-Time Web</a>, <a href="http://www.readwriteweb.com/archives/structured-data/">Structured Data</a>.</p>
<p>Mobile and Real-Time Web have been particularly eventful in 2010, as you'll see below. Augmented Reality and Internet of Things are both early stage trends, but have continued to edge towards the mainstream this year. The movement towards Structured Data has made significant progress in 2010, primarily thanks to RDFa and the adoption of that Semantic Web format by Facebook, Google and other big companies.</p>
<h2>Mobile</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/ipad_150.jpg" style="" />
			</span>
The next generation of the iPhone operating system, <a href="http://www.readwriteweb.com/archives/apple_announces_iphone_40.php">iPhone OS 4</a>, was launched in April - it  included long-awaited support for multitasking. That was followed  by the June <a href="http://www.readwriteweb.com/archives/apple_unveils_iphone_4_for_199_available_june_24th.php">release of iPhone 4</a>, the new handset. </p>
<p>However, it hasn't all been about the iPhone this year. Perhaps even more notable (although less hyped) has been the continued growth and expansion of Google's Android competitor. It started in January, when <a href="http://www.readwriteweb.com/archives/live_blog_googles_android_press_gathering.php">Google announced the new Nexus One</a>. Google described it as "Where Web Meets Phone" and called it a "super phone" (ok, so there was hype there too...). Android, Google's mobile OS, has also experienced <a href="http://www.readwriteweb.com/archives/android_market_share_doubles_will_overtake_palm_soon.php">strong</a> <a href="http://www.readwriteweb.com/archives/android_steals_market_share_from_iphone.php">market share growth</a> this year.</p>
<p>The biggest news of the year, though, has undoubtedly been <a href="http://www.readwriteweb.com/archives/apple_announces.php">Apple announcing the iPad</a> in late January. This led to <a href="http://www.readwriteweb.com/archives/ipads_top_apps_and_early_trends.php">a flurry of new apps</a> and a revival of interest in digital magazines.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/foursquare_badges_small.jpg" style="" />
			</span>
In terms of non-device innovation, much of the attention has been on <a href="http://www.readwriteweb.com/archives/foursquare_location_platform.php">location-based social networking services</a> like Foursquare, Gowalla and Brightkite.</p>
<p>While <a href="http://www.readwriteweb.com/archives/poll_which_location-based_mobile_app_do_you_use_now.php">all three were popular</a> during the annual geekfest in Austin, SXSW Interactive, by the half-year mark Foursquare appears to have the most momentum.</p>
<h2>Real-Time Web</h2>
<p>Twitter and Facebook have dominated the Real-Time Web landscape so far in 2010. Facebook has been under pressure  for its <a href="http://www.readwriteweb.com/archives/facebooks_zuckerberg_says_the_age_of_privacy_is_ov.php">controversial privacy changes</a> (that is, more and more of Facebook is being made public), yet it continues to grow market share. Meanwhile Twitter has become more well-known in the mainstream, on the back of news such as Michael Jackson's passing and huge events like <a href="http://www.readwriteweb.com/archives/world_cup_becomes_most_popular_web_event_ever.php">the World Cup</a>.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/twitter_worldcup1.jpg" style="" />
			</span>
</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/google_buzz_logo2.jpg" style="" />
			</span>
Google tried to get in on the real-time action in February, with the launch of a <a href="http://www.readwriteweb.com/tag/buzz">media sharing service called Google Buzz</a>. Buzz was mostly a damp squib, however. More successful for Google was its <a href="http://www.readwriteweb.com/archives/google_developing_real_time_index.php">integration</a> of Twitter, <a href="http://www.readwriteweb.com/archives/google_adds_myspace_status_updates_to_real-time_search_results.php">MySpace</a>, <a href="http://www.readwriteweb.com/archives/google_takes_first_shot_at_facebook_search_results.php">Facebook</a> and other real-time results into its core search product.</p>
<p>There have been some interesting startups to emerge in this space over 2010. For example, <a href="http://quora.com/">Quora</a> - <a href="http://www.readwriteweb.com/archives/facebooks_1st_cto_launches_his_next_company_screen.php">a real-time enabled Q&A site</a> created by Facebook's first CTO Adam D'Angelo. Other products to have caught our eye this year include <a href="http://www.readwriteweb.com/archives/collecta_brings_real-time_content_to_any_site_with.php">Collecta</a>, <a href="http://www.readwriteweb.com/archives/oneriot_brings_its_real-time_ads_to_the_web.php">OneRiot</a>, <a href="http://www.readwriteweb.com/archives/digital_intuition.php">My6Sense</a> and <a href="http://www.readwriteweb.com/archives/superfeedr_team_real-time.php">SuperFeedr</a>.</p>
<h2>Internet of Things</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/ces_web_apps_2010.jpg" style="" />
			</span>
Internet of Things is when real-world objects are connected to the Internet, often via sensors, barcodes and RFID tags. This was a trend we began exploring in earnest in 2009, when much of the applications were experimental. </p>
<p>In 2010 we've seen commercial applications begin to arise, starting at the 2010 Consumer Electronics Show (CES) in January. The main trend we noticed from this CES was <a href="http://www.readwriteweb.com/archives/web_apps_meet_consumer_electronics_at_ces.php">web applications being ported to consumer electronics</a> - everything from the technology inside cars to Web-enabled TVs.</p>
<p>Many of our posts on Internet of Things this year have been explanatory, to help you prepare for the expected increase in commercial applications over the next few years. Check out <a href="http://www.readwriteweb.com/archives/2010_trend_sensors_mobile_phones.php">our post on sensors and mobile phones</a> for an example. Also find out <a href="http://www.readwriteweb.com/archives/sensors_next_big_wave_of_computing.php">why HP thinks sensors will lead to the next big wave of computing</a>.</p>
<h2>Augmented Reality</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/AR_picto_jun10.jpg" style="" />
			</span>
The field of AR has been similarly experimental up till now, so we have again spent a lot of time explaining this trend and putting it in context. An example is Austrian company Wikitude <a href="http://www.readwriteweb.com/archives/wikitude_brings_augmented_worlds_to_the_iphone.php">bringing augmented "Worlds" to the iPhone</a>, in February. A wider audience saw AR in April, when the Discovery Channel promoted its docu-drama hit Deadliest Catch with <a href="http://www.readwriteweb.com/archives/discovery_channel_puts_ar_in_front_of_millions_of.php">a desktop-based AR ad campaign</a>.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/arsession2_may10.jpg" style="" />
			</span>
<br />
<em>Augmented Reality was a hot topic <a href="http://www.readwriteweb.com/archives/state_of_the_augmented_reality_union_from_the_rww_mobile_summit.php">at the RWW Mobile Summit</a>, in May in Mountain View, CA. Photo: Chris Cameron</em></p>
<h2>Structured Data</h2>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/facebook_opengraph_150.gif" style="" />
			</span>
If 2009 was the year of &quot;open data&quot; (when previously offline data is uploaded to the Web), then 2010 so far has been <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">the year of RDFa</a>. RDFa is a lightweight way to add extra meaning to HTML web pages. <a href="http://www.readwriteweb.com/archives/facebook_the_semantic_web.php">Facebook is using it</a> in their Open Graph platform, which was announced in April. The stated goal of the Open Graph protocol is to enable publishers to "integrate [their] Web pages into the social graph."</p>
<p>Others that have climbed on board the structured data train include <a href="http://www.readwriteweb.com/archives/google_semantic_web_push_rich_snippets_usage_grow.php">Google</a> and retailers like <a href="http://www.readwriteweb.com/archives/how_best_buy_is_using_the_semantic_web.php">Best Buy</a> and Tesco. Twitter hasn't yet supported RDFa, but its <a href="http://www.readwriteweb.com/archives/how_twitter_annotations_could_bring_the_real-time_semantic_web_together.php">Twitter Annotations</a> project comes close.</p>
<p>The open data movement also continues to expand. In January, Data.gov.uk, a new web site dedicated to making non-personal data held by the U.K. government available for software developers, <a href="http://www.readwriteweb.com/archives/uk_launches_open_data_site_puts_datagov_to_shame.php">launched with the help of Sir Tim Berners-Lee</a>. </p>
<p>You can see that 2010 has been very active with innovation! The iPad, new mobile devices, the increasing market penetration of Facebook and Twitter, the rise of the Semantic Web - it's been a fascinating year so far! In the comments, let us know your personal highlight of 2010 so far.</p>
<p><em>Lead image:  <a href="http://www.flickr.com/photos/ari/4066952389/">Steve Rhodes</a></em></p>
<p><script type="text/javascript" charset="utf-8" src="http://static.polldaddy.com/p/3439245.js"></script>
<noscript>
	<a href="http://polldaddy.com/poll/3439245/">What's Most Impressed You on the Web in 2010?</a><span style="font-size:9px;"><a href="http://polldaddy.com/features-surveys/">survey software</a></span>
</noscript></p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/06/5_key_trends_of_2010_half-year_report_for_the_web</link>
                <guid>http://readwrite.com/2010/07/06/5_key_trends_of_2010_half-year_report_for_the_web</guid>
                <category>Augmented Reality</category>
                <pubDate>Tue, 06 Jul 2010 03:15:00 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[Evri's Evolution From Search to Real-Time News Curation]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/evri-logo.png" style="" />
			</span>
When semantic recommendations service <a href="http://www.evri.com/">Evri</a> <a href="http://www.readwriteweb.com/archives/evri_beta_launches_search_less.php">launched two years ago</a>, the product (backed by Microsoft co-founder Paul Allen) was seen by many as a type of search engine. Nowadays, Evri models itself as a <strong>topic-based news service</strong>;  in particular, tapping into the real-time streams of mixed media coming from Facebook, Twitter, YouTube and other sources.</p>
<p>At the recent <a href="http://semtech2010.semanticuniverse.com/">Semantic Technology conference</a>, I sat down with Evri CEO Will Hunsinger. He called Evri the &quot;topical equivalent of a Facebook stream.&quot;</p>
<p>The technology of Evri is much the same as it was two years ago - it connects together topics using Semantic Web algorithms - but the landscape in which Evri is playing has evolved. In 2010, real-time information streams dominate. So Evri now aims to be a <strong>curation service</strong>.</p>
<p>In a nutshell, Evri allows you to explore and track topics. Its homepage displays a current hot topic (at time of writing, it was Lance Armstrong and the Tour de France), with other trending topics offered in the menu (for example 'Gulf Oil Spill' is a featured trend and 'LeBron James' is listed as &quot;Trending in US &amp; World'). You may also enter your own topic into the search bar. Evri is like a mix between Google Trends and Google News, with liberal sprinklings of Twitter and Facebook.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/evri_july10.jpg" style="" />
			</span>
</p>
<h2>Whither Twine...</h2>
<p>Evri made the news earlier this year when it <a href="http://www.readwriteweb.com/archives/paul_allen_backed_semantic_service_evri_has_been_a.php">acquired failed semantic web bookmarking application, Twine</a>. Curiously, Hunsinger described Evri as &quot;the inverse of Twine&quot; - because Evri does all the work, rather than the user. <a href="http://www.twine.com">Twine</a> is an application that relies on its users to actively bookmark pages, a la the much more successful <a href="http://delicious.com">Delicious</a>. Evri automatically collates topical information and presents it the user.</p>
<p><em><b>Update:</b> Twine founder Nova Spivack wrote in and stated: "Twine was in fact highly automated as well, but in a different way than Evri - Twine used NLP [Natural language processing] to auto-tag every entry, generate summaries, and used graph algorithms to make recommendations."</em></p>
<p>Where Evri shares similarities with Twine is in the ability for users to track a topic. Hunsinger said that Evri users may &quot;follow a story as it evolves over time, and tune it.&quot; He described this as being like a &quot;mini blog&quot; for its users.</p>
<p>Twine appears to have been of most use to Evri for its underlying technology. Hunsinger told us that Evri is using technology it acquired from Twine to extend Evri's categories and for advanced filtering.</p>
<h2>What's Next</h2>
<p>Evri is not short on ideas and innovation - for example it announced a <a href="http://www.readwriteweb.com/archives/how_does_the_web_feel_evri_tells_you.php">Sentiment API</a> last August. However some of these ideas are slow to eventuate. Hunsinger said that the Sentiment API is not in commercial deployment yet, because it requires much more media in order to calculate sentiment and it tends to work best with well-known people (like Barack Obama). However, Evri is working on incorporating data about shares, tweets, and more in order to beef up its Sentiment analysis engine.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/evri_july10b.jpg" style="" />
			</span>
</p>
<p>The company is also currently  working on what Hunsinger described as &quot;Pandora-like recommendations,&quot; referring to the geographically-limited online music service <a href="http://www.pandora.com">Pandora</a>. </p>
<p>Other expansion plans include  launching one new channel per week, to extend Evri's topical coverage.</p>
<h2>Can Evri Compete as a Consumer App?</h2>
<p>Evri is an interesting product and is currently being used by media partners like Hearst and Canwest. </p>
<p>However, Evri will likely continue to struggle as a consumer offering. It's competing against a plethora of real-time news apps - everything from TweetDeck, Google News, Topix, Techmeme, and more. I wouldn't be surprised if Evri is eventually acquired by a big media company for its technology, much as Evri snapped up the struggling Twine for the same reason.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/04/evri_real-time_news_curation</link>
                <guid>http://readwrite.com/2010/07/04/evri_real-time_news_curation</guid>
                <category>New Media</category>
                <pubDate>Sun, 04 Jul 2010 15:50:00 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[Facebook & The Semantic Web]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/facebook_opengraph_150.gif" style="" />
			</span>
This week we've been exploring <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">the emergence of the Semantic Web</a> among companies <a href="http://www.readwriteweb.com/archives/how_best_buy_is_using_the_semantic_web.php">like Best Buy</a> and <a href="http://www.readwriteweb.com/archives/google_semantic_web_push_rich_snippets_usage_grow.php">Google</a>. It's all thanks to <a href="http://www.w3.org/2010/02/rdfa/">RDFa</a>, code that is inserted into the HTML of web pages to add extra meaning. The increasing usage of RDFa was one of the main themes at the recent <a href="http://semtech2010.semanticuniverse.com/">Semantic Technology conference</a> in San Francisco.</p>
<p>There is perhaps no better example than <a href="http://www.facebook.com">Facebook</a>'s use of RDFa. We chatted to Facebook open standards evangelist David Recordon to find out more.</p>
<p>In April this year, Facebook announced a large-scale new platform called the <a href="http://developers.facebook.com/docs/opengraph">Open Graph</a>. The stated goal of the Open Graph protocol was to enable publishers to &quot;integrate [their] Web pages into the social graph.&quot; Essentially, each web page can now become an 'object' in Facebook's <a href="http://www.readwriteweb.com/archives/social_graph_concepts_and_issues.php">social graph</a> (which is Facebook's term for how people connect to each other in its network). This means that pages can be referenced and connected across social network user profiles,  blog posts, search results, Facebook's News Feed, and more.</p>
<h2> The Open Graph in a Nutshell</h2>
<p>The Open Graph is a wide-ranging platform which includes features such as 'Like' buttons and publisher plug-ins. It also includes a simple, RDF-based markup. This requires publishers to include at least 4 metadata properties in each object: title, type, image, URL. There are a few additional properties which may be optionally added, such as site_name and description. </p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/fb_rww_properties2.gif" style="" />
			</span>
</p>
<p>When the Open Graph was announced, ReadWriteWeb feature writer Alex Iskold wrote a comprehensive <a href="http://www.readwriteweb.com/archives/facebook_open_graph_the_definitive_guide_for_publishers_users_and_competitors.php">guide to the Open Graph for publishers, users and competitors</a>. Here's how he described the markup:</p>
<blockquote>
  <p> &quot;...the markup enables publishers to say what object is on the page - a movie, a book, a recording artist, an event, a sports team, etc. This automatically enables semantics, that is, an understanding that the user is not just interacting with a web page, but that he or she is liking a specific kind of thing. Semantics then leads to bucketing of the objects into categories like books, movies, music, etc., and gives rise to all sort of applications, including personalized recommendations.
    
  Perhaps even more importantly, the markup helps Facebook connect the users across common interests across different web sites.&quot;</p>
</blockquote>
  <p>The syntax that Facebook uses is fairly similar to RDFa, without being an exact match. Sandro Hawke from the <a href="http://www.w3.org/">W3C</a> (the Web's official standards body), <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">told ReadWriteWeb last week</a> that Facebook is using RDFa "in an abbreviated, not really good modeling way.&quot;</p>
  <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/facebook_opengraph_jun10a.jpg" style="" />
			</span>
</p>
  <h2>Facebook's  KISS (Keep it Simple Semantics)</h2>
<p>The reason that Facebook is using an abbreviated version of RDFa is that they want to make it <strong>as simple as possible for publishers to deploy it</strong>. And the W3C is fine with this, because it has a rules standard called <a href="http://www.w3.org/TR/2010/NOTE-rif-overview-20100622/">RIF</a> that can convert Open Graph code into proper RDFa if needed. Also the W3C and Semantic Web community seem to be having <a href="http://www.semanticweb.com/news/facebook_and_w3c_connect_rdf_schema_follows_article_page_types_added_too_160588.asp?c=rss">fruitful discussions</a> with Facebook.</p>
<p>We asked Facebook open standards evangelist <a href="http://davidrecordon.com/">David Recordon</a> why the company chose <em>not</em> to use exact RDFa syntax. He replied that simplicity was paramount, so they wanted to use as few properties as possible. </p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/3505/3854257463_c2e291776d_m.jpg" style="" />
			</span>
However, Recordon said that Facebook's code will actually help the adoption of RDFa. He thinks that RDFa will become a more interesting technology when more people are consuming it (via Facebook's variation of RDFa). </p>
<p> Facebook did not set out to support the Semantic Web, added Recordon, it just wanted simple code that was as close as possible to open standards. He told us that Facebook found the &quot;sweet spot&quot; between the complexity of existing Semantic Web standards and the simplicity required for tens of thousands of developers to place 'objects' inside their web pages.</p>
<h2>Open Graph Issues</h2>
<p>Facebook's Open Graph protocol has its issues, which Alex Iskold outlined in <a href="http://www.readwriteweb.com/archives/does_facebook_really_want_a_semantic_web.php">a follow-up analysis in May</a> (it should be noted that Iskold's startup, <a href="http://getglue.com/">GetGlue</a>, has been mining web page metadata for a while now - so it's partly competitive with Facebook, although it also <a href="http://www.readwriteweb.com/archives/facebook_open_graph_protocol_implementation.php">provides support for the Open Graph</a>). Some of Facebook's issues were due to poor implementation of the markup by Facebook partners, but Recordon told us that much of this has been fixed now. </p>
<div class="super-pullquote">ReadWriteWeb's Guide to The Semantic Web:
<ol> 
  <li><a href="http://www.readwriteweb.com/archives/open_data_linked_data_semantic_web.php">It's All Semantics: Open Data, Linked Data & The Semantic Web</a></li> 
<li><a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">The State of Linked Data in 2010</a></li> 
<li><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2009.php">Top 10 Semantic Web Products of 2009</a></li> 
<li><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee</a></li> 
 
<li><a href="http://www.readwriteweb.com/archives/semantic_web_patterns.php">Semantic Web Patterns: A Guide to Semantic Technologies</a></li> 
</ol> 
</div> 
<p>Perhaps a bigger problem is that currently only one object can be specified per web page, which prevents multiple topics or people on a single web page from being semantically marked up. Also, Iskold pointed out that there  is no way to disambiguate objects - so if two objects have the same name, they are considered the same thing by Facebook. For example, 'Paris' could refer to the city in France or the city in the USA (also it could be the name of an infamous socialite).</p>
<h2>Kick Starting the Semantic Web</h2>
<p>These issues ideally should be ironed out, but for now it's clear that Facebook wants Open Graph to be very simple so that as many publishers as possible implement it. That's a sensible business decision, although it means that we're only getting a limited version of the Semantic Web for now. However, the support of Semantic technologies by big companies like Facebook, Google and Best Buy will be enough to kick start this new era of a smarter and more structured Web of data.</p>
<p>What do you think, is Facebook doing enough to support Semantic Web standards? Let us know your thoughts in the comments.</p>
                    ]]></description>
                <link>http://readwrite.com/2010/07/01/facebook_the_semantic_web</link>
                <guid>http://readwrite.com/2010/07/01/facebook_the_semantic_web</guid>
                <category>SemTech 2010</category>
                <pubDate>Thu, 01 Jul 2010 18:12:00 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
                    <item>
                <title><![CDATA[How Best Buy is Using The Semantic Web]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bestbuy_logo.jpg" style="" />
			</span>
Yesterday we wrote about the increasing usage of <a href="http://www.readwriteweb.com/archives/w3c_pleased_with_semantic_web_adoption.php">Semantic Web technologies  by large commercial companies</a> like Facebook, Google and Best Buy. The <a href="http://www.readwriteweb.com/archives/semantic-web/">Semantic Web</a> is a Web of added meaning, which ultimately enables smarter and more personalized web apps to be built. In this post we explore how a leading U.S. retailer, <a href="http://www.bestbuy.com/">Best Buy</a>, is using a Semantic Web markup language called RDFa to add semantics to its webpages. <font style="float: right; margin-left: 10px;"><script type="text/javascript">
tweetmeme_url = 'http://www.readwriteweb.com/archives/how_best_buy_is_using_the_semantic_web.php';
tweetmeme_source = 'rww';
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font></p>
<p>This is not just an academic exercise for Best Buy. As we will see, semantic technology has already led to increased traffic and better service to its customers. We spoke to <a href="http://jay.beweep.com/">Jay Myers</a>, Lead Web Development Engineer at BestBuy.com, to find out how. </p>
<div class="super-pullquote">ReadWriteWeb's Guide to The Semantic Web:
<ol> 
  <li><a href="http://www.readwriteweb.com/archives/open_data_linked_data_semantic_web.php">It's All Semantics: Open Data, Linked Data & The Semantic Web</a></li> 
<li><a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">The State of Linked Data in 2010</a></li> 
<li><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2009.php">Top 10 Semantic Web Products of 2009</a></li>
<li><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee</a></li>

<li><a href="http://www.readwriteweb.com/archives/semantic_web_patterns.php">Semantic Web Patterns: A Guide to Semantic Technologies</a></li>
</ol>
</div>
<p>Myers told us that the primary goal of using semantic technologies was to increase the visibility of its products and services. And with data such as store name,
  address,
  store hours and
GEO data being marked up using RDFa, search engines are now able to identify each of those data components more easily and put them into context.</p>
<p>A quick refresher on the terminology: just as the lingua franca of the Web is HTML (Hypertext Markup Language), RDF (Resource Description Framework) is commonly thought of as the primary language of the Semantic Web. RDFa is a kind of 'lite' version of RDF, which adds metadata to HTML (or XHTML) webpages.</p>
<p>The process of adding RDFa to Best Buy's webpages began two years ago, when the company began to look for ways to get more visibility to its stores on the Web. &quot;At that time,&quot; said Myers, &quot;it was difficult for users to find basic store information like store location and hours.&quot; </p>
<p>To solve this dilemma, Best Buy gave each store its own blog.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/best_buy_rdfa.jpg" style="" />
			</span>
</p>
<p>Best Buy employees entered information into the blogs every day, using online forms that output RDFa. Myers told us that the use of RDFa makes &quot;human input from our store employees more visible on the Web.&quot; </p>
<p>Best Buy is using <a href="http://www.heppnetz.de/projects/goodrelations/">Good Relations</a>, a Semantic Web vocabulary for e-commerce that describes product, price, and company data.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bestbuy_rdfa2.jpg" style="" />
			</span>
</p>
<p>Myers remarked that &quot;there isn't a noticeable difference&quot; to the users of Best Buy's website, however all of the RDFa data is very visible to humans via search engine results and its store locator tool. The RDFa data is &quot;also great for machines,&quot; said Myers, which has resulted in &quot;a definite up tick in the amount of search traffic to these pages.&quot; At last week's <a href="http://semtech2010.semanticuniverse.com/">SemTech conference</a>, <a href="http://www.slideshare.net/jaymmyers/myers-jay-rev">Myers said</a> that it had resulted in a 30% increase in search traffic. He noted that Best Buy hadn't expected to see an SEO benefit, but it's been a boon to them since the company is &quot;very reliant on search engines&quot; for product discovery and store locations.</p>
<p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/images/bestbuy_rdfa3.jpg" style="" />
			</span>
</p>
<p>With Jay Myers at the development wheel, Best Buy's web presence will continue to be enhanced by the Semantic Web. RDFa can ultimately create rich relationships between products, which will in turn &quot;create a deeper visibility to additional products&quot; when a customer is shopping.</p>
<p>That seems like a distinct competitive advantage for Best Buy.</p>
<p><object id="__sse4636321" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=myersjayrev-100628131244-phpapp02&rel=0&stripped_title=myers-jay-rev" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse4636321" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=myersjayrev-100628131244-phpapp02&rel=0&stripped_title=myers-jay-rev" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></p>
                    ]]></description>
                <link>http://readwrite.com/2010/06/30/how_best_buy_is_using_the_semantic_web</link>
                <guid>http://readwrite.com/2010/06/30/how_best_buy_is_using_the_semantic_web</guid>
                <category>SemTech 2010</category>
                <pubDate>Wed, 30 Jun 2010 23:00:00 -0700</pubDate>
                <author>Richard MacManus</author>
            </item>
            </channel>
</rss>

