<?xml version="1.0" encoding="UTF-8" ?>
<rss xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
        <channel>
        <title>nosql - ReadWrite</title>
        <link>http://readwrite.com</link>
        <description />
        <language>en</language>
        <copyright>Copyright 2012 SAY Media, Inc.</copyright>
        <managingEditor>readwriteweb@gmail.com</managingEditor>
        <docs>http://blogs.law.harvard.edu/tech/rss</docs> 
        <lastBuildDate>Mon, 25 Mar 2013 05:30:00 -0700</lastBuildDate>
        <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://rww.superfeedr.com/" />

                    <item>
                <title><![CDATA[When NoSQL Databases Are — Yes — Good For You And Your Company]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/shutterstock_nosql.jpg" />
                                        <p>The proliferation of non-relational databases in the tech sector these days could lead you to think that these data management tools (also known as NoSQL databases) are eventually going to make traditional relational databases extinct.</p>
<p>Not so. Each of these database types is best suited for very different types of workloads, and that's going to prevent either one from tromping the other into the dust. Which means that IT and other managers are going to have to figure out which approach is best suited for the task at hand.</p>
<p>In this two-part series, I'll examine the capabilities of both NoSQL and relational databases to help you make the right decisions for your organization.</p>
<h2>"NoSQL"? More Like "Not Only SQL"</h2>
<p>Right off the bat, NoSQL databases are unique because they are usually independent from Structured Query Language (SQL) found in relational databases. Relational databases all use SQL as the domain-specific language for <em>ad hoc</em> queries, while non-relational databases have no such standard query language, so they can use whatever they want. That can, if need be, include SQL.</p>
<p>NoSQL databases are designed to excel in speed and volume. To pull this off, NoSQL software will use techniques that can scare the crap out of relational database users — such as not promising that all data is consistent within a system all of the time.</p>
<p>That's a key result of using relational databases, because when you are conducting a financial transaction, such as buying something on Amazon, databases have to be very sure that one account is debited the same amount that another account is debited at the same time.</p>
<p>Because so much of this back-and-forth read-write activity is needed in a single transaction, a relational database could never keep up with the speed and scaling necessary to make a company like Amazon work.</p>
<p>According to Bob Wiederhold, CEO of NoSQL database vendor <a title="http://www.couchbase.com" href="http://www.couchbase.com">Couchbase</a>, the architecture and features of NoSQL databases bring four key advantages to NoSQL users.</p>
<h2>Go Big Or Go Home</h2>
<p>Easier scalability is the first aspect highlighted by Wiederhold. NoSQL databases like Couchbase and 10Gen's MongoDB, he said, can be scaled up to handle much bigger data volumes with relative ease.</p>
<p>If your company suddenly finds itself deluged by overnight success, for example, with customers coming to your Web site by the droves, a relational database would have to be painstakingly replicated and re-partitioned in order to scale up to meet the new demand.</p>
<p>Wiederhold cited social and mobile gaming vendors as the big example of this kind of situation. An endorsement or a few well-timed tweets could spin up semi-dormant gaming servers and get them to capacity in mere hours. Because of the distributed nature of non-relational databases, to scale NoSQL all you need to do is add machines to the cluster to meet demand.</p>
<h2>There's No Need To Fear</h2>
<p>Performance is another way that NoSQL databases can excel. First, every time you add a new server to a NoSQL database cluster, there is performance scaling by virtue of the fact that you're throwing another processor into the equation.</p>
<p>Beyond the scaling advantages, the very architecture of NoSQL tools aids performance. If a relational database had tens or even hundreds of thousands of tables, data processing would generate far more locks on that data, and greatly degrade the performance of the database.</p>
<p>Because NoSQL databases have weaker data consistency models, they can trade off consistency for efficiency. In Wiederhold's social gaming example, if a user updated his or her profile, there's no real degradation of game performance if that profile's new info isn't updated across the entire database instantly. This means that resources can be dedicated to other things, like tracking down that orc that's about smack you around in-game.</p>
<h2>Objects Of Desire</h2>
<p>When applications developers have to work with data in relational databases, it can at times be troublesome due to data mapping and impedance issues. In NoSQL databases, this is not usually a problem, because data is not stored in the same manner.</p>
<p>With document-oriented NoSQL databases, for instance, data is stored in just that format: documents. And since documents are objects, after all, then programmers who tend to think in object-oriented terms are going to be much more familiar with manipulating such data.</p>
<p>That weaker consistency model helps programmers, tool, since their apps don't have to rigidly conform to data consistency requirements. That makes coding much simpler and (by extension) much faster.</p>
<h2>Downtime? What Downtime?</h2>
<p>Wiederhold's fourth example is a new one to the NoSQL advantage list. It's something that non-relational databases weren't specifically designed to do, but at which they've nonetheless turned out to be proficient.</p>
<p>Because of their distributed nature, Wiederhold says, NoSQL databases can be pretty much always on. This is a huge advantage for web- and mobile-based businesses that can't afford to be down for a single moment.</p>
<p>With some advanced planning, software updates and hardware upgrades can be performed while the database is still running hot. Try doing that with a relational database without taking it down, and you're in for a world of trouble.</p>
<h2>get section where name = Conclusion</h2>
<p>NoSQL databases' advantages sound great, and they are — for the right use case. If your company has a data set that will remain relatively constant in size, or that only grows slowly, you'll have little need to migrate to a non-relational system.</p>
<p>If sudden growth, agile development or high performance are something your organization will need for its data, then you should take a look at the NoSQL class of databases and see what might be a fit for you.</p>
<p><em>Image courtesy of <a href="http://www.shuttertock.com">Shutterstock</a><br /></em></p>
                    ]]></description>
                <link>http://readwrite.com/2013/03/25/when-nosql-databases-are-good-for-you</link>
                <guid>http://readwrite.com/2013/03/25/when-nosql-databases-are-good-for-you</guid>
                <category>NoSQL</category>
                <pubDate>Mon, 25 Mar 2013 05:30:00 -0700</pubDate>
                <author>Brian Proffitt</author>
            </item>
                    <item>
                <title><![CDATA[Attacking Big Data Old-School Style - With VMware's SQLFire]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/shutterstock_6397828.jpg" />
                                        <p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
<p class="p1">As more and more information floods into the Internet, organizing and making sense of this <a href="http://readwrite.com/2012/03/05/big-data">Big Data</a> becomes more important and more difficult.</p>
<p class="p1">New database methods are emerging to help process unstructured data, but IT developers and database deployers also have to figure out how to deal with the world of legacy technology.</p>
<p class="p1">For the last 40 years, relational database programs (usually powered by <a href="http://en.wikipedia.org/wiki/SQL">SQL</a>-based management systems) have been the backbone of supplying businesses with organized rows and columns of data. The problem is that these legacy systems may not be able to work together to give businesses the information they need when they need it. Older programs may also have trouble processing data requests over long distances.</p>
<h2 class="p2">A New Way Of Thinking About Databases</h2>
<p class="p1">A new way of thinking is needed. Over the past decade the push for “not only SQL” or <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a> database software has provided a pathway for businesses to connect bits and pieces of data from a variety of sources at very rapid speeds across different geographies.</p>
<p class="p3"><strong>(See also </strong><strong><a href="http://readwrite.com/2013/02/20/whats-next-for-taming-big-data">What's Next For Taming Big Data?</a><span style="line-height: 1.538em;">)</span></strong></p>
<p class="p1">Some businesses are spreading out the workloads using noSQL databases within cloud computing-based networks. Others are approaching the problem still using traditional SQL relational database software - and that’s perfectly OK.</p>
<p class="p1">Previous articles in this series (<a href="http://readwrite.com/series/taming-big-data/">Taming Big Data</a>) discuss the benefits of a noSQL database tool like <a href="http://vmware.com/go/gemfire">VMware’s vFabric GemFire</a>. But SQL database software retains a well-established community of tens of thousands of developers and integrators who may be reluctant to move beyond the SQL they know and love. What’s a company to do?</p>
<p class="p3"><strong>(See also </strong><strong><a href="http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data">VMware's Cloud-Based GemFire Makes It Easier To Work With Big Data</a>.)</strong><strong><br /></strong></p>
<p class="p1">For SQL diehards, <a href="http://www.vmware.com/go/sqlfcomm">VMware’s vFabric SQLFire</a> SQLFire is a distributed SQL database typically used for online transactions. The software is more modern than most traditional relational database management systems.</p>
<h2 class="p2">What Can SQLFire Do For Me?</h2>
<p class="p1">SQLFire functions and performs much like <a href="http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data">GemFire</a> under the hood. SQLFire uses GemFire's data grid engine, which lets both programs capture data and then replicate and partition the information "in-memory” on the server. But instead of having to learn GemFire commands and controls, SQLFire has a user interface and programming framework that will be familiar to developers used to programming in a SQL interface and with SQL tools.</p>
<p class="p1">Backups are enabled through virtual copies on other connected servers, although data can be stored long-term on disks as needed.</p>
<p class="p1">Unlike other embedded databases, SQLFire allows several servers to store replicated and partitioned tables, persist data to disk, communicate directly with other servers and participate in distributed queries.</p>
<p class="p1">For traditional IT developers and database deployers, the SQLFire interface makes it easier to write applications and take advantage of GemFire’s underlying noSQL technology. Developers and integrators who know SQL well will have an easy time adapting SQLFire to new projects.</p>
<p class="p1">SQLFire is perfect for classic Web transactions, especially where there is a need for fast speeds and a requirement to dig deep into clusters of data.</p>
<h2 class="p2">Business Case For SQLFire</h2>
<p class="p1">In addition to making SQL developers feel comfortable, SQLFire can work across multiple networks and geographies. This comes in handy when enterprises need information at the moment it becomes available on multiple continents.</p>
<p class="p1">For example, a large regional bank in the Northeastern United States collects large amounts of data that helps it maintain its regional and branch offices. The bank also monitors customer transactions at tellers and various ATMs.</p>
<p class="p1">Bank management was interested in measuring the different types of transactions being handled at each of type of station, what types of accounts they were accessing and the various times of day the transactions took place.</p>
<p class="p1">Historically, the bank could attach an individual database to each branch, but in today's global environment the company decided it needed to measure all of these data points at the same time for each office. The company tested vFabric SQLFire against its own systems and found the existing server took 20 minutes to complete the queries while the SQLFire server completed its task in less than a minute.</p>
<h2 class="p2">Deploying SQLFire In The Enterprise</h2>
<p class="p1">In the enterprise sQLFire is generally found on inexpensive computer servers in database clusters. A typical use case would find SQLFire helping eliminate potential data bottlenecks in new mobile and Web environments. Another common deployment option for SQLFire is to integrate it with existing traditional databases or analytics programs.</p>
<p class="p1">The software can also be interfaced through an API using programming languages such as Java or <a href="http://www.springsource.org/">Spring</a>. SQLFire is also compatible with Java database (JDBC) or ADO.NET.</p>
<p class="p1">As companies look for new ways to make data accessible and provide a consistent view of that information, it's important to have tools that suit the needs of all kinds of developers and IT managers.</p>
<p class="p1">VMware’s <a href="http://www.vmware.com/products/application-platform/vfabric-gemfire/overview.html">GemFire</a> and <a href="http://www.vmware.com/products/application-platform/vfabric-sqlfire/overview.html">SQLFire</a> software are designed to address just those needs - allowing companies to move beyond concerns over speed and scale and tackle Big Data applications head on.</p>
<p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
                    ]]></description>
                <link>http://readwrite.com/2013/03/12/attacking-big-data-old-school-style-with-vmware-sqlfire</link>
                <guid>http://readwrite.com/2013/03/12/attacking-big-data-old-school-style-with-vmware-sqlfire</guid>
                <category>Taming Big Data</category>
                <pubDate>Tue, 12 Mar 2013 10:33:00 -0700</pubDate>
                <author></author>
            </item>
                    <item>
                <title><![CDATA[FoundationDB's NoSQL Breakthrough Challenges Relational Database Dominance]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/FoundationDB%20Founders.jpeg" />
                                        <p>NoSQL databases are well-known for their speed and scalability - useful traits when dealing with the size and complexity of big data and hyper-fast transaction requirements. But one thing they have lacked has been strong data consistency: the ability to ensure that an update to data in one part of the database is immediately propagated to all other parts of the database.</p>
<p>A startup database vendor launched this week is making claims that its database, <a title="http://www.foundationdb.com" href="http://www.foundationdb.com">FoundationDB</a>, finally delivers on the promise of true data consistency for a NoSQL database, without a huge loss of speed or flexibility.</p>
<p>Understanding why this is such a big deal in the Big Data (or any) sector requires a little background on how NoSQL, or non-relational, databases work.</p>
<h2>Solving The ACID Test</h2>
<p>When talking about relational databases, like PostgreSQL, MariaDB, Oracle and the like, there's one acronym that keeps coming up: ACID. ACID stands for Atomic, Consistent, Isolated and Durable - core aspects that must apply to all data within a relational database. Data is broken down to atomic values (name, address_1, city...) while remaining consistent across the database, isolated from other transactions until the current transaction is finished, and durable in the sense that the data should never be lost.</p>
<p>The infrastructure of a relational database is well-suited to meet the ACID criteria for data: Data is held in tables connected by relational algebra, and transactions are performed in a way that is consistent with ACID principles.</p>
<p>But for non-relational databases, such as Bigtable, MongoDB or Dynamo, ACID has always been sacrificed for other qualities, like speed and scalability.</p>
<p>This tends to freak out some companies, stopping them from moving to NoSQL because they can't give up ACID. Especially the "C," because not having data consistency is a particularly terrifying prospect for companies dealing with financial transactions.</p>
<p>Yet non-relational databases are being used by firms like Amazon and Google every day, with great success. Amazon, in particular, needs to track millions of sales transaction on any given day - how does it get away with inconsistent data?</p>
<p>The short answer is, it has to. The trade-off would be a relational database that could never keep up with the speed and scaling necessary to make a company like Amazon work as it does now. Recall that non-relational databases are structured to sacrifice some aspect of ACID to gain something in return. In the case of Amazon, its non-relational DynamoDB database is willing to apply an "eventually consistent" approach to the data in order to gain speed and uptime for the system when a database server somewhere goes down (though Dynamo can also have strong consistency, an Amazon spokesperson informed us after this story went to press).</p>
<h2>Bringing Back Consistency</h2>
<p>It's not that having ACID compliance on a NoSQL database is impossible, explained David Rosenthal, one of FoundationDB's co-founders. It's just that most people think that applying ACID to NoSQL systems would come at a huge cost.</p>
<p>That's certainly what Werner Vogels, CTO of Amazon, thought in a <a href="http://queue.acm.org/detail.cfm?id=1466448" target="_blank">2008 paper that described the company's Dynamo database</a>&nbsp;and it's relationship to consistency.&nbsp;</p>
<blockquote>Data inconsistency in large-scale reliable distributed systems has to be tolerated for two reasons: improving read and write performance under highly concurrent conditions; and handling partition cases where a majority model would render part of the system unavailable even though the nodes are up and running.</blockquote>
<p>Translation: Requiring ACID on non-relational databases would make that database too slow and inflexible.</p>
<p>For the longest time, everyone using NoSQL systems was resigned to this eventual, or "weak," consistency model. After all, they had money to make and data to analyze. Who cares if consistency was not at the top of the priority list?</p>
<p>It turns out, quite a few people, including the founders of FoundationDB, Rosenthal, Nick Lavezzo and Dave Scherer.</p>
<h2>Inside FoundationDB</h2>
<p>After a successful start up with Visual Sciences, a technology that's now part of Adobe as the Adobe Insight product, the trio turned to developing another successful project, and hit on the lack of ACID-capable non-relational databases as a goal.</p>
<p>"We weren't satisfied with any of the data guarantees on non-relational systems," Rosenthal explained, even as they understood that the needs of many potential clients would preclude relational systems like MySQL or Oracle because of performance limitations.</p>
<p>Non-relational systems seemed to wear their weak consistency model like a badge of honor, but in the secret origin story of FoundationDB, the team saw weak consistency as a bug, not a feature.&nbsp;"Not having transactional integrity is not a good thing," Rosenthal emphasized.</p>
<p>They're not the only ones. Google's up-and-coming <a title="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/spanner-osdi2012.pdf" href="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/spanner-osdi2012.pdf">Spanner database</a>, a second-generation distributed database that could ultimately replace the search engine company's Bigtable systems, is being built on the premise that transactional integrity has to be a part of that database, too.</p>
<h2>Side Effects Include...</h2>
<p>Establishing consistency in transactions within a NoSQL database is worthy news in itself, but the implications extend beyond that core news.</p>
<p>FoundationDB uses a key-value-like storage engine core that's surrounded by layers of whatever data model that's needed, which will in turn enable developers to much more easily code their apps to reach into the FoundationDB. These layers, according to the founders, can't be used on other key-value systems, because without consistent transactions, it would not work.</p>
<p>Also, since data is going to be consistent, applications won't have to be built to "wait" for data to catch up within a given transaction - thus making apps less complex and easier to build.</p>
<p>The best news of all concerns the so-called performance penalty that many in the NoSQL world said will be incurred if ACID was applied to non-relational database systems. According to FoundationDB, performance is hampered by only 10%, which seems a very small price to pay for consistent transactions.</p>
<p>The FoundationDB database, which was launched into public beta on Monday, is <a title="http://www.foundationdb.com/get/" href="http://www.foundationdb.com/get/">available for download now</a>.</p>
<p><em>Image courtesy of FoundationDB.</em></p>
                    ]]></description>
                <link>http://readwrite.com/2013/03/08/foundationdbs-nosql-breakthrough-challenges-relational-database-dominance</link>
                <guid>http://readwrite.com/2013/03/08/foundationdbs-nosql-breakthrough-challenges-relational-database-dominance</guid>
                <category>NoSQL</category>
                <pubDate>Fri, 08 Mar 2013 04:02:00 -0800</pubDate>
                <author>Brian Proffitt</author>
            </item>
                    <item>
                <title><![CDATA[VMware's Cloud-Based GemFire Makes It Easier To Work With Big Data]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/VMware_Gemfire.png" />
                                        <p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
<p class="p1">While companies of all sizes are struggling with the growth of information overload often referred to as “Big Data,” some IT developers and database deployers are approaching the challenge with a cloud-based service designed to make accessing mass amounts of data faster.</p>
<p class="p1">In short, they turn to <a href="http://www.vmware.com/products/application-platform/vfabric-gemfire">VMware’s vFabric GemFire</a>.</p>
<p class="p1">GemFire is a distributed in-memory data grid database software product that enables data distribution, data replication and partitioning (sharding), cashing data management at the exact moment the information is needed.</p>
<p class="p1"><strong>(See also <a href="http://readwrite.com/2013/02/20/whats-next-for-taming-big-data" target="_blank">What's Next For Taming Big Data</a>.)</strong></p>
<p class="p1">While the ability to move data from server to server and replicate it to more than one location has proven invaluable over the last 10 years, today's critical challenge is how can companies manage this data properly.</p>
<p class="p1">Over the past decade, <a href="http://www.vmware.com/files/pdf/vmware-vfabric-gemfire-distributed-main-memory-platform-WP-EN.pdf">GemFire</a> has helped companies:</p>
<ul class="ul1">
<li class="li2">Maintain simultaneous data connections over long distances.</li>
<li class="li2">Protect their data from natural and man-made disasters.</li>
<li class="li2">Maintain data reliability and availability, even when server hardware periodically fails.</li>
</ul>
<p class="p1">The software is able to achieve these goals by creating an object-oriented "data fabric" across a server cluster. It accesses copies of data that are stored in various locations as needed. To ensure compatibility with the latest <a href="http://readwrite.com/2012/04/06/8-reasons-why-cloud-computing">cloud configurations</a>, the management platform can spread the data across many virtual machines and GemFire servers to manage application objects.</p>
<p class="p1">But what does that mean in the real world? To find out, it helps to look at how <a href="http://www.vmware.com/go/gemfcomm">vFabric GemFire</a> is already working in key industrial applications, how it can be developed for new projects and how it can be deployed in a business network.</p>
<h2 class="p3">Passing Military Grade</h2>
<p class="p1">Keeping connected across town or around the globe is never more important than when national security is on the line. So when the <a href="file:///Users/fpaul/Documents/Stories/U.S.%20Defense%20Information%20Systems%20Agency">U.S. Defense Information Systems Agency (DISA)</a> needed to deal with up-to-the-minute information and awareness of military actions wherever they occur, the <a href="http://www.vmware.com/files/pdf/solutions/vFabric-GemFire-fo-Defense-and-Government-Agencies.pdf">agency chose vFabric GemFire</a>.</p>
<p class="p1">GemFire provided speed and the ability to easily increase and decrease the size of projects, but also a management tool that orchestrates data delivery from the back-end data stores to the consuming applications.</p>
<p class="p1">Since 2007, DISA has used GemFire for managing massive amounts of data for the various government agencies it supports, including U.S. military commands, joint task forces and the Pentagon. And because GemFire allows for a consistent view of data across all geographies and in different clusters, the military has reliable event notification, continuous querying, parallel execution, high throughput, low latency, high scalability, continuous availability and WAN distribution</p>
<h2 class="p3">South America Calling</h2>
<p class="p1">GemFire’s expertise at the middle data tier delivers reliability and critical data redundancy that keeps the information up to date even if one part of the network goes offline.</p>
<p class="p1">Take the case of a large telecommunications company in South America that sells prepaid phone cards via kiosks. The telecom uses GemFire to enable the sale and provisioning of pre-paid cards even when disconnected from the network. Because the country’s infrastructure is not 100% reliable, sometimes network data is not updated for several hours at a time and customers might not be able to use their cards. To overcome this obstacle, the telecom uses GemFire's distributed databases to maintain up-to-the-minute information.</p>
<p class="p1">vFabric GemFire was the optimal choice for managing a distributed database in this environment because it automatically recognizes systems and moves data around so that it remains accessible even on unreliable networks.</p>
<p class="p1">As VMware product line marketing manager Blake Connell put it, “vFabric GemFire automatically spreads the data over a wide network and accommodates network disruptions.</p>
<p class="p1"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/VMW_10Q4_DGRM_vFabric_GemFire_Architecture_R4_800x600.jpg" style="" />
			</span>
</p>
<h2 class="p3">Capturing GemFire In The Enterprise</h2>
<p class="p1"><a href="http://www.vmware.com/files/pdf/techpaper/vmw-vfabric-gemFire-best-practices-guide.pdf">vFabric GemFire</a> is best suited for new Big Data projects that require NoSQL - or distributed unstructured data - models.</p>
<p class="p1">GemFire is well-designed for latency-sensitive applications such as virtualized environments that may require interrupt-moderation or interrupt-throttling - industry terms that IT developers and database deployers use when building a system that potentially doesn't take well to lags in data flow or processing.</p>
<p class="p1">Because GemFire is designed for data distribution, data replication, caching and data management, it has special requirements. For example, GemFire suggests enabling hyperthreading and keeping at least 50% of the server’s memory space available.</p>
<p class="p1">Configuring GemFire servers and regions is optimally done with the <a href="http://www.springsource.org/">Spring</a> object-oriented programming framework. This allows developers to centralize application service configuration instead of having to deal with Spring context configuration <em>plus</em> a separate cache.xml file.</p>
<p class="p1">For those working with structured data and who are knowledgable in SQL, VMware offers a related product called&nbsp;<a href="http://vmware.com/go/sqlfire">SQLFire</a>. SQLFire is&nbsp;a distributed SQL data-management platform. SQLFire will look familiar to SQL developers thanks to a similar interface and programming framework, and it allows the management of "not only SQL" databases much the way GemFire does.</p>
<p class="p1">Look for more information on the benefits of SQLFire in an upcoming ReadWrite post.</p>
<p class="p1"><a style="text-decoration: underline;" href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
<p class="p1">&nbsp;</p>
                    ]]></description>
                <link>http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data</link>
                <guid>http://readwrite.com/2013/02/27/cloud-based-gemfire-makes-it-easier-to-work-with-big-data</guid>
                <category>Taming Big Data</category>
                <pubDate>Wed, 27 Feb 2013 10:30:00 -0800</pubDate>
                <author></author>
            </item>
                    <item>
                <title><![CDATA[What’s Next For Taming Big Data?]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/fields/shutterstock_74392294_data_analytics.jpg" />
                                        <p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
<p class="p1">Imagine you are walking in London and want to book a trip to New York at the last minute: You hop online to search for a hotel. The smartphone you are holding helps you find an establishment that looks appealing. You tap a few buttons to send your credit card information.</p>
<p class="p1">Unbeknownst to you, however, there is only one available room left at this hotel for the night you need. You and and another traveler, from Brazil perhaps, are vying for the same space. That person is trying to make the reservation from a laptop at his office.</p>
<ul class="ul1">
<li class="li2">Could these two transactions be handled at the exact same time?</li>
<li class="li2">Can the hotel see a consistent view of this globally distributed data?</li>
<li class="li2">Could the transactions be handled in a way that the hotel owner can identify where each request originated from?</li>
<li class="li2">Could the hotel distinguish on which device the purchase was made and any other personal information required?</li>
</ul>
<p class="p1">If the two transactions were being handled by a traditional SQL database with a siloed network configuration and little ability to scale on demand or access data in real time, the short answer would be, “No.”</p>
<p class="p1">As Jerry Chen, <a href="http://www.vmware.com/solutions/cloud-computing/index.html">VP of VMware’s Cloud and Application Services</a> noted recently: “The monolithic database of the past cannot meet the needs of modern applications.” What is needed, he added, are designs that support high-availability low-latency applications.</p>
<h2 class="p3">The Influence Of Big Data</h2>
<p class="p1">Your experience with booking a room is a perfect example of how large amounts of important, fast-moving information - commonly known as <a href="http://readwrite.com/2012/03/05/big-data">Big Data</a> - is creating problems and opportunities for customers and the companies that serve them.</p>
<p class="p1">While modern technologies provide faster access to higher volumes of both&nbsp;<span style="line-height: 1.538em;">structured and unstructured&nbsp;</span><span style="line-height: 1.538em;">information at any time of the day, making sense of the bits and pieces of unrelated data has required a radical shift in the way we organize networks and the Internet. It’s a challenge for even seasoned IT developers and database deployers.</span></p>
<p class="p1">In addition, real-world events such as the <a href="http://readwrite.com/2011/06/15/the_arab_spring_a_status_report">Arab Spring</a> and the <a href="http://readwrite.com/2011/03/14/japan_twitter_resources">earthquake and tsunami in Japan</a> have challenged engineers and developers to break through existing constraints and work wonders in hyperconnectivity using non-traditional methods.</p>
<p class="p1">Historically, companies sorted customer and transactional data in neat rows and columns of databases. These databases were built and managed by <a href="http://en.wikipedia.org/wiki/SQL">Structured Query Language (SQL)</a> and stored in isolated, or siloed, networks.</p>
<p class="p1">But traditional, hard-disk-drive-based database products forged on SQL can no longer handle the deluge of Big Data in a timely fashion. The era of Big Data calls for new strategies and new approaches to queries.</p>
<p class="p1">The limitations of traditional SQL-based databases have prompted a slew of innovative developments in this direction starting with <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a>&nbsp;distributed database models. These types of software platforms have caught the eye of decision-makers in the financial services, e-commerce, government and R&amp;D communities.</p>
<p class="p1">NoSQL has the capacity to answer to their need to process Big Data sets as they happen. However, many NoSQL deployments still fail to cope with scenarios like the one detailed above because they cannot provide consistent speed or scale. Additionally, many NoSQL offerings lack maturity when dealing with data that needs to span multiple datacenters across vast distances.</p>
<h2 class="p3">Other Innovations Add Support</h2>
<p class="p1">To keep up with the true volume and velocity of information from multiple sources, innovations in cloud computing are being paired with high-performance “in-memory” storage on database appliances created to address the obstacles of managing Big Data.</p>
<p class="p1">While cloud-based systems have helped develop online computing and software service delivery systems, IT developers and database deployers have adopted the cloud to take advantage of the ability to scale to globally distributed marketplace. Meanwhile, the addition of solid-state memory drives to storage devices has given companies an extra layer of quickly accessible data. These configurations allow for more rapid database queries and transactions - and thus the ability to deal with much larger data sets in real time.</p>
<p class="p1">But the raw technology is only part of the equation. No matter how powerful the exterior influences, IT developers and database deployers will always need tool sets that help them to design for their current Big Data database needs (such as our hotel booking example) as well as prepare for a future where Big Data needs to be addressed at multiple levels.</p>
<p class="p1">The tools must be customizable and easy to use - no matter the developer’s existing training. These offerings must also be easy to manage and easy to connect with new technology trends.</p>
<p class="p1">VMware’s <a href="http://www.vmware.com/products/application-platform/vfabric-gemfire/overview.html">GemFire</a> and <a href="http://www.vmware.com/products/application-platform/vfabric-sqlfire/overview.html">SQLFire</a> software are designed to address just those needs of developers and deployers.</p>
<p class="p1">GemFire is a database software product that allows for data distribution, data replication, caching and data management at the exact moment the information is needed.</p>
<p class="p1">SQLFire is a distributed data management platform which functions and performs much like GemFire, but has a user interface and programming framework familiar to developers who are more comfortable programming in SQL interface and tools.</p>
<p class="p1">Both GemFire and SQLFire help companies move beyond concerns with speed and scale and tackle Big Data applications head on.</p>
<p class="p1">&nbsp;</p>
<p class="p1"><em>Image courtesy of <a href="http://www.shutterstock.com" target="_blank">Shutterstock</a>.</em></p>
<p class="p1"><a href="http://www.vmware.com/" target="_blank"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/vmware_300x60_contributed.jpg" style="" />
			</span>
</a></p>
                    ]]></description>
                <link>http://readwrite.com/2013/02/20/whats-next-for-taming-big-data</link>
                <guid>http://readwrite.com/2013/02/20/whats-next-for-taming-big-data</guid>
                <category>Taming Big Data</category>
                <pubDate>Wed, 20 Feb 2013 11:06:00 -0800</pubDate>
                <author></author>
            </item>
                    <item>
                <title><![CDATA[Trickle-Down Web Innovation Breathes New Life Into Enterprise IT]]></title>
                <description><![CDATA[
                                        <img src="http://readwrite.com/files/styles/800_450sc/public/files/files/2012-01-04-studioslackofinnovation-thumb.jpg" />
                                        <p>IBM spends <a href="http://ycharts.com/companies/IBM/r_and_d_expense">over $1.5B every quarter</a> in research and development (R&amp;D) expenses. SAP? Closer to <a href="http://ycharts.com/companies/SAP/r_and_d_expense">$700M</a>. Oracle? Assuming we don't count the tens of billions it spends buying other companies, its <a href="http://ycharts.com/companies/ORCL/r_and_d_expense">actual quarterly R&amp;D budget</a> comes in just over $1B - around <a href="http://ycharts.com/companies/HPQ/r_and_d_expense">$900M</a>. Microsoft, which has printed billions of dollars in profit each quarter for eons, spends more than them all, <a href="http://news.idg.no/cw/art.cfm?id=D9F9E9ED-B31B-91D3-30EB90CEA1D64447">topping $10B each year</a>.</p>
<p>And yet not one of these companies is responsible for the biggest advances in enterprise technology in the past decade. &nbsp;Cloud computing, Big Data, mobile... they're all being invented elsewhere, not by the enterprise behemoths.</p>
<p>Maybe they're doing it wrong?</p>
<h2>Lots Of R, Little D</h2>
<p>Take the cloud, for example. Microsoft claims to invest <a href="http://news.idg.no/cw/art.cfm?id=D9F9E9ED-B31B-91D3-30EB90CEA1D64447">90% of its R&amp;D budget on cloud computing</a>, but it is Amazon, Microsoft's penny-pinching, book-retailing neighbor, that sets the terms for innovation in cloud computing. Amazon launched EC2 back in 2006, when it was <a href="http://ycharts.com/companies/AMZN/r_and_d_expense">spending a measly $132M or so</a> each quarter on R&amp;D.&nbsp;</p>
<p>Even if we dismiss Amazon, where else are we seeing other cool advances in cloud computing? Netflix, for one, which just <a href="http://techblog.netflix.com/2013/01/janitor-monkey-keeping-cloud-tidy-and.html">released Janitor Monkey</a> to help Amazon Web Services (AWS) users dispose of their unused AWS resources, and the video company previously <a href="http://techblog.netflix.com/2012/07/chaos-monkey-released-into-wild.html">released Chaos Monkey</a>, which helps enterprises plan for and architect around cloud failure.</p>
<p>Notice the word I used? "Released." It means these tools were open sourced, not put out for sale. That's how innovation seems to happen in the 21st Century.</p>
<p>In large part innovation comes with an open-source license because it's a by-product of businesses that heavily rely on technology, but don't actually sell technology. &nbsp;It's 'trickle-down innovation' from the web business community.</p>
<p>VMware's <a href="http://bradhedlund.com/2011/08/19/distributed-systems-trickle-down-into-enterprise-it/">Brad Hedlund spotlighted this trend</a> back in 2011, when the enterprise awoke to discover it had problems that the web giants had already solved:</p>
<blockquote>As properties such as Yahoo!, Google, Facebook, Amazon became great successes, their architects and software engineers realized that they had moved mountains...The tremendous problems of efficiently running large scale applications on low cost infrastructure had been solved... At the very same time, enterprise IT begins to encounter some of the very same problems solved by the large web provider, such as scalable data warehousing and analytics (so called “Big Data”). Additionally, the software driven distributed systems that solve problems of infrastructure efficiency and management at very large scale could also be applied to infrastructure at a smaller enterprise IT scale (why not?). And finally, the cost savings of an application infrastructure designed to operate on low cost commodity hardware can be realized at any scale, large web or enterprise IT.</blockquote>
<h2>Filling The R&amp;D Gap</h2>
<p>Companies like Cloudera, DataStax and others stepped into this gap, taking the open source (or, in the case of some of Google's research, open knowledge) projects from the web and applying them to the enterprise in the form of Hadoop, Storm, NoSQL databases, etc. &nbsp;All of it developed at a comparative pittance to enterprise incumbents' R&amp;D budgets. &nbsp;All of it available free of charge on commodity hardware.&nbsp;</p>
<p>As an industry, we're richer for such open source innovation. &nbsp;Ironically, so are the enterprise IT vendors, who are investing tens to hundreds of millions of dollars in Hadoop and other open source data and cloud innovations, even as they continue to sink tens of billions of dollars into their homegrown R&amp;D. Maybe it's time for them to reevaluate how they do R&amp;D. &nbsp;Maybe, <a href="https://github.com/facebook">like Facebook</a>&nbsp;or <a href="http://twitter.github.com/">Twitter</a>, they should release their R&amp;D on GitHub as open-source code. &nbsp;At the least they could, <a href="http://research.google.com/pubs/papers.html">like Google</a>, centralize their research on the web, making it easily available to all.</p>
<p>Maybe, just maybe, they'd realize that it doesn't actually matter how much money a company spends on R&amp;D. &nbsp;What matters is whether it can execute and turn ideas into winning products, as <a href="http://news.yahoo.com/microsoft-huge-r-d-budget-useless-best-ideas-194130737.html">Brad Reed argues</a>, and whether it can help foster community around promising open source efforts. &nbsp;</p>
<p>This is what new enterprise IT - Facebook, Twitter, Google and Yahoo - demonstrates. &nbsp;It remains to be seen if IBM, Microsoft and other traditional IT vendors are paying attention. &nbsp;Unless they do, they'll lose relevance as a new breed of innovative startups emerge to claim the strategic largesse of CIOs' budgets.</p>
                    ]]></description>
                <link>http://readwrite.com/2013/01/07/trickle-down-web-innovation-breathing-new-life-into-enterprise-it</link>
                <guid>http://readwrite.com/2013/01/07/trickle-down-web-innovation-breathing-new-life-into-enterprise-it</guid>
                <category>enterprise IT</category>
                <pubDate>Mon, 07 Jan 2013 08:00:00 -0800</pubDate>
                <author>Matt Asay</author>
            </item>
            </channel>
</rss>

