<?xml version="1.0" encoding="UTF-8" ?>
<rss xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
        <channel>
        <title>Sam Charrington - ReadWrite</title>
        <link>http://readwrite.com</link>
        <description />
        <language>en</language>
        <copyright>Copyright 2012 SAY Media, Inc.</copyright>
        <managingEditor>readwriteweb@gmail.com</managingEditor>
        <docs>http://blogs.law.harvard.edu/tech/rss</docs> 
        <lastBuildDate>Mon, 27 Feb 2012 05:33:00 -0800</lastBuildDate>
        <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://rww.superfeedr.com/" />

                    <item>
                <title><![CDATA[Three New Tools Bring Machine Learning Insights to the Masses]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/hack/images/data-scientist_photo_1210.jpeg" style="" />
			</span>
Over the past few years, machine learning has quickly become the "secret sauce" of large-scale web sites. Machine learning systems have historically been hand-crafted by the small armies of computer science and mathematics Ph.D.s in employ at places like Google. With the growing popularity of machine learning and other statistical techniques, the demand for so-called "data scientists" (software developers and analysts with the skill to apply statistical techniques to large data sets) has exploded since 2010. </p>

<p>As a result, these rarefied skills have become extremely difficult to find and expensive to retain, driving up the cost of machine learning systems and making it difficult for enterprises and smaller web firms to apply the technology. In the data scientist talent shortage is opportunity, however, and a new breed of software platform is rising to meet this need. Building upon the low-level big data infrastructure now available, these new platforms seek to democratize machine learning and advanced analytics, making their benefits available to enterprises and firms who either can't afford or can't find enough PhDs and data scientists. The first of this coming wave of machine learning-powered platforms is launching at this week's O'Reilly Strata conference. Here are three companies leading the way. </p>
<h2>Skytree</h2>
<a href="http://www.skytreecorp.com/">Skytree Server</a> is a software product aimed at allowing users to very quickly deploy highly accurate and very fast machine learning systems. The idea behind Skytree Server is to disrupt the typical development cycle of modeling machine learning systems in a high-level tool like R or Matlab, and then coding them up using Python or C for deployment in order to achieve an acceptable level of performance. 
<div class="super-pullquote"><em> Sam Charrington is the principal of <a href="http://cloudpul.se/blog">CloudPulse Strategies</a>, an analyst and consulting firm focusing exclusively on cloud computing, big data and related technologies and markets. He can be followed on Twitter at <a href="http://twitter.com/samcharrington">@samcharrington</a>.</em></div>
Skytree Server is itself a back-end system designed to be put into production and called on by a number of front-end client APIs (command-line, Python, Java and R, currently). Data is passed to Skytree Server typically as CSV files. One of the strengths of the Skytree approach is that the user has direct control over which machine learning algorithms are applied. The server has built-in support for the most popular algorithms, such as support vector machines, nearest neighbor, k-means and more. The implication, however, is that the user must be pretty savvy about statistics and machine learning in order to use Skytree. (Not to mention the need for IT support.) 

<p>This is not to diminish the Skytree value proposition in the least. By analogy, if the skills required to use machine learning are akin to knowing how to drive, and the skills required to build a production machine learning system are akin to knowing how to build a car, the Amazons and Ebays of the world have built their machine learning 'vehicles' from the tires up, while what Skytree does is allow you to drive a Ferrari (a) without knowing how to build one and (b) on a Kia budget. Skytree Server is priced on a subscription basis, starting at $2,999 per year for up to 4 cores. It is also available as a Free Edition, which has the ability to process up to 100,000 rows of data.</p>

<h2>BigML</h2>
<a href="https://bigml.com/">BigML</a> was founded a year ago with the vision of creating "ML for the rest of us." With that in mind, they've created a cloud-based offering targeted at business users that dramatically lowers the barriers to performing machine learning analysis. BigML users typically begin an analysis by uploading a data set in text format. The service offers a wizard-based approach to formatting and cleaning up data, backed by some sophisticated pattern matching, aimed at making sure the system can tolerate real-world (read, "messy") data. One or more columns in the data can be denoted as prediction targets, which the tool will use to train a predictive model. 

<p>Once the model has been generated, additional data can be fed into the system and the model will be used to make predictions about the prediction targets. BigML currently only supports decision tree models for machine learning. While this may be a limitation for true aficionados, the company argues that the decision tree technique is powerful because it can handle a wide variety of data types, is particularly intuitive and lends itself to visual representation, is a great place to start if you don't already know what kind of analysis to apply, and is easy to scale. <br />
<a href="http://www.readwriteweb.com/hack/bigml-screenshot.png"><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/hack/assets_c/2012/02/bigml-screenshot-thumb-597x432-39031.png" style="" />
			</span>
</a><br />
Continuing our driving analogy, BigML offers an easy-to-drive family sedan that appears at your driveway when you need it, takes you where you want to go, and presents helpful guidance on how you got there, ensuring that you're never lost. BigML is priced using a credit-based system at $0.05 per credit, with the number of credits required based on the size of your data, the size of your model, and the number of predictions you need to make.<br />
<h2>Precog</h2><br />
<a href="http://www.precog.io/">Precog</a>, still in stealth at the time of this writing, aims to offer a developer-focused platform for "data-driven, insightful, intelligent applications." Of the three companies profiled here, Precog seems to want most clearly to be a PaaS for machine learning, and takes a very interesting approach with its platform. Precog envisions a usage model in which users "Capture" data by explicitly (via a REST API) or implicitly (via an adapter) sending it to the Precog service, "Enrich" data by mashing it up with public and partner-provided datasets, "Analyze" the data using a variety of machine learning techniques, and "Act" on it in their own applications or by pushing it to third party systems. </p>

<p>At the heart of Precog are a scalable, custom-built analytics database coupled with a high-level analytics API that allows users to perform a variety of analyses by name (e.g. "optimize", "cluster", or "predict"), without getting bogged down by the details of which algorithm is best. In this way, Precog is probably most analogous to the kit cars you could order from the back of Popular Mechanics in the 80s, offering those willing to get their hands a bit dirty a way to create unique and customized high-performance vehicle without needing to engineer the engine and frame from scratch. Oh, and you can rent it, a la SaaS. Precog will become available this week to select alpha users, with a private beta expected shortly thereafter.The product is offered by ReportGrid, a year-old company initially focused on providing sexy embeddable analytics reports for SaaS companies.</p>

<p>The three companies profiled here represent very distinct approaches to making 'machine learning as a service' a reality, and I expect we'll see many more such offerings in the coming months.</p>
                    ]]></description>
                <link>http://readwrite.com/2012/02/27/three-new-tools-bring-machine</link>
                <guid>http://readwrite.com/2012/02/27/three-new-tools-bring-machine</guid>
                <category>Big data</category>
                <pubDate>Mon, 27 Feb 2012 05:33:00 -0800</pubDate>
                <author>Sam Charrington</author>
            </item>
                    <item>
                <title><![CDATA[The Disintegration of PaaS]]></title>
                <description><![CDATA[
                                        <span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/cloud/assets_c/2011/04/CloudFoundry-thumb-150x83-29149.jpg" style="" />
			</span>
In <a href="http://www.readwriteweb.com/cloud/2012/01/paas-makes-progress-in-2011.php">PaaS Makes Progress in 2011</a>, I argued that the previous 12 months had been pivotal to the advancement of platform-as-a-service. As a result of this fast-paced evolution, the PaaS of 2012 is quite a different beast than that of just a couple of years ago. While this second-generation PaaS differs in many ways from initial forays in the field, one of the most important distinctions is that this new PaaS has been <em>disintegrated</em>, or at least made more modular.</p>
<p>Before you run off thinking I'm advocating the destruction of PaaS platforms, please realize that I am not. Rather, I'm referring to the shift away from monolithic, one-size-fits-all PaaS systems towards more open, loosely coupled platforms that makes it easy to consume code and services provided by third parties.</p> 

<p>Early PaaS offerings, circa 2007-2009, were conceived of as all-in-one affairs. In fact, a big part of the value proposition that providers envisioned was its delivery via proprietary services and custom APIs that developers would use in their applications. Examples include App Engine's data store and Memcache services, the Force.com data store, the distributed cache and storage systems we built at Appistry and many more.</p>

<p>The fact is, the early players in the space had little choice but to roll their own. At the time, there were critical gaps in the market that needed to be filled in order for developers on PaaS platforms to be able to deliver rich, scalable applications. Fortunately for PaaS users, this is no longer the case for most application-level services.</p> 

<div class="super-pullquote"><em> Sam Charrington is the principal of <a href="http://cloudpul.se/blog">CloudPulse Strategies</a>, an analyst and consulting firm focusing exclusively on cloud computing, big data and related technologies and markets. He can be followed on Twitter at <a href="http://twitter.com/samcharrington">@samcharrington</a>.</em></div>

<h2>REST Assured, We've Got Git</h2>

<p>In order to build modern, scalable, connected web applications, developers must have access to a wide variety of third-party components and services upon which to build. With the proliferation of open source and SaaS services, these are now readily available on the open market. While both open source and SaaS predated the earliest PaaS offerings, in recent years the advent of GitHub and the popularity of REST-based Web services has played a significant role in expanding the selection of building blocks available to developers.</p> 

<p>GitHub, by dramatically lowering the barriers to collaborating on and sharing open source projects, has become an "App Store" of sorts for developers, and is home to <a href="http://www.readwriteweb.com/archives/github_hits_1_million_users.php">over one million projects</a>. Likewise, the popularization of Roy Fielding's REST model for web service APIs has simplified developer access to the many application- and app-infrastructure-oriented SaaS services now available. It's now possible to store files, query and analyze data, send emails, create maps, subscribe to messages, encode videos, and much more, just by sending simple HTTP-based commands. (If you've never visited the ProgrammableWeb <a href="http://www.programmableweb.com/apis">API Directory</a>, the selection will blow your mind.)</p>

<p>This <a href="http://en.wikipedia.org/wiki/Cambrian_explosion">Cambrian explosion</a> of high quality components and services has made web application development a much more productive affair for developers. And because the  market has removed the burden of providing these low-level building blocks, those PaaS providers ready to embrace openness stand to gain great advantage.</p>

<h2>Modular PaaS is Better for Providers, Too</h2>

<p>While the end-user benefits of an open approach to PaaS, namely increased choice and reduced lock-in are apparent, the advantages of a modular approach to PaaS are two-sided, benefiting providers at least as much as users. This is because, as a PaaS provider, it's simply too hard to deliver both a solid application platform <em>and</em> the services that plug into it. For most businesses, such a thing would spread their development resources too thinly, even if they had the necessary domain expertise, which most don't. In addition, because the open source and SaaS genies have left their bottles, trying to do it all puts the provider at odds with their customers.</p>

<p>By building and offering an open platform able to easily consume third-party components and services, and by cultivating a thriving ecosystem of the tools' providers, second-generation PaaS vendors can improve their own chances of success while creating a better world for their users.</p>



                    ]]></description>
                <link>http://readwrite.com/2012/02/07/the-disintegration-of-paas</link>
                <guid>http://readwrite.com/2012/02/07/the-disintegration-of-paas</guid>
                <category>Cloud Providers</category>
                <pubDate>Tue, 07 Feb 2012 02:02:00 -0800</pubDate>
                <author>Sam Charrington</author>
            </item>
                    <item>
                <title><![CDATA[PaaS Makes Progress in 2011]]></title>
                <description><![CDATA[
                                        <p><span class="embedded-Media-image img-caption-c">
				<img src="http://readwrite.com/files/files/files/cloud/assets_c/2011/04/CloudFoundry-thumb-150x83-29149.jpg" style="" />
			</span>
While Platform-as-a-Service (PaaS) has always had its cheerleaders - yours truly included - the harsh reality is that, commercially speaking, PaaS offerings have <a href="http://www.readwriteweb.com/cloud/2011/11/forrester-vdi-paas-technologie.php">underperformed relative to expectations</a> for several years running. This is particularly the case among enterprises, which have, by and large, turned a blind eye to the technology.<br />
</p>
<p>Past performance notwithstanding, many industry watchers have predicted 2012 to be the breakout year for PaaS in the enterprise. <a href="http://www.cmswire.com/cms/information-management/gartner-looks-to-the-cloud-with-application-architecture-summit-gartneraadi-013664.php">Gartner, for example, reportedly communicated at its November Application Architecture</a> Development & Integration conference its belief that 2012 marks the beginning of a rise in PaaS adoption from almost zero (3% of enterprises) to nearly half of all enterprises (43%) in 2015.  </p>

<p>While it remains to be seen whether 2012 goes down in the history books as the year PaaS makes good, much of the groundwork for PaaS' predicted success was laid in 2011. Here are some trends from the past year:</p>

<h2>Heroku Beyond Ruby</h2>

<p><a href="http://www.readwriteweb.com/enterprise/2010/12/salesforcecom-to-buy-heroku-fo.php">Salesforce.com's acquisition of Heroku for $250M</a>, an estimated 50-100x revenue, while announced in December 2010, set the stage for a year of brisk investment in PaaS. In January, on the heels of the acquisition, we saw a flurry of Heroku investments and product launches including PHPFog (PHP), Gondor.io (Python/Django), Nodejitsu (Node.js) and CloudBees (Java). With this activity, the reach of PaaS was significantly broadened.</p>

<h2>The Rise of Polyglot Platforms</h2>

<p>The early flood of single-language PaaS platforms gave way to a move towards multi-language platforms later in the year, perhaps precipitated by <a href="http://www.readwriteweb.com/cloud/2011/06/dotcloud-comes-out-of-beta.php">DotCloud's entrance in the market</a> with a vision of "One Platform, Any Stack." Established providers like Red Hat OpenShift and Heroku broadly expanded platform support, while the aforementioned PHPFog relaunched as AppFog with a new multi-language platform.</p>

<div class="super-pullquote"><em> Sam Charrington is the principal of <a href="http://cloudpul.se/blog">CloudPulse Strategies</a>, an analyst and consulting firm focusing exclusively on cloud computing, big data and related technologies and markets. He can be followed on Twitter at @samcharrington.</em></div>

<p>The rise of so-called polyglot PaaS platforms, <a href="http://www.saasblogs.com/general-technology/a-language-a-day-keeps-value-away-getting-paas-right/">while derided by some as stifling innovation</a>, is significant in that it marks a departure from early "one size fits few" approaches to PaaS, towards something more flexible, familiar, accommodating, and with a bit less lock-in... Just what the enterprise user is looking for.</p>

<h2>Enter Cloud Foundry</h2>

<p>VMware's <a href="http://www.readwriteweb.com/cloud/tag/cloud+foundry">Cloud Foundry, which launched in April of 2011 and covered extensively in ReadWriteCloud</a>, was not the first multi-language PaaS. Nor was it the first open-source PaaS, or the first PaaS backed by a major player in enterprise IT. It wasn't the first PaaS to be readily deployable in both public and private cloud environments. Nor was it the first PaaS to embrace the power of an extended ecosystem of developers and partners. </p>

<p>What made Cloud Foundry a game-changer in 2011 is the fact that it was the first PaaS to offer all of those things--on your laptop, in your data center, or in the cloud.</p>

<h2>PaaS Ecosystems Flourish</h2>

<p>Last but not least, the developer services ecosystems that have formed around the major offerings were expanded greatly in 2011. Users of these platforms can now easily add a wide variety of services such as caching, messaging and databases (SQL and NoSQL) to their applications. These ecosystems are powerful in that they simultaneously have made PaaS platforms more productive for developers and more profitable for providers, while they have reduced the threat of lock-in for users. </p>

<p>The role of these ecosystems is key, and I'm planning to explore this topic further in a future article. </p>

<p>In the meantime, I'll continue rooting for the success of PaaS customers and providers, with full knowledge that they are building upon solid foundations laid throughout the last year.</p>
                    ]]></description>
                <link>http://readwrite.com/2012/01/23/paas-makes-progress-in-2011</link>
                <guid>http://readwrite.com/2012/01/23/paas-makes-progress-in-2011</guid>
                <category>Analysis</category>
                <pubDate>Mon, 23 Jan 2012 04:00:00 -0800</pubDate>
                <author>Sam Charrington</author>
            </item>
            </channel>
</rss>

