Home COPS: A New Consistency Model for Wide-Area Storage

COPS: A New Consistency Model for Wide-Area Storage

Here’s a little Thanksgiving reading for you while you’re digesting that fourth helping of pie: a paper on a new approach to geo-replicated, distributed data stores (PDF). The paper, authored by researchers from Princeton University, Intel Labs, and Carnegie Mellon University, proposes a new consistency model for wide-area storage.

The idea is to provide an infrastructure that can support major Internet services. That might entail tens of data centers, each one having a set of application servers and data stores for those systems.

As the paper notes “a distributed storage system has multiple, sometimes competing, goals: availability, low latency, and partition tolerance to provide an ‘always on’ user experience; scalability to adapt to increasing load and storage demands; and a supportciently strong consistency model to simplify programming and provide users with the system behavior that they expect.”

The system that is being proposed, is called Clusters of Order-Preserving Servers (COPS). According to the post:

Our system, COPS, is a key-value store that delivers this consistency model across the wide-area. A key contribution of COPS is its scalability, which can enforce causal dependencies between keys stored across an entire cluster, rather than a single server like previous systems. Scaling the size of each cluster brings causal consistency to a new set of applications, such as social networks like Facebook and Twitter, whose data is far too large and query rates far too high to be handled by single machine clusters.

The central approach in COPS involves explicitly tracking and enforcing causal dependencies between updates. For instance, if you upload a photo and add it to an album, the album update “depends on” the photo addition, and should only be applied after it. Writes in COPS are accepted by a local datacenter that then propagates them to other, remote, datacenters. These remote datacenters check that all dependencies are satisfied by querying other nodes in the cluster before applying writes. This approach differs from traditional causal systems that exchange update logs between replicas. In particular, the COPS approach avoids any single serialization point to collect, transmit, merge, or apply logs. Avoiding single serialization points is a major factor in enabling COPS to scale to large cluster sizes.

The paper is 16 pages, and full of detail. The High Scalability blog has an interesting summary of the paper. At first glance I’d say that we’re going to be hearing a lot more about ALPS and COPS.

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.