Data management is one of the key matters in Cloud Computing. This dissertation proposes a peer-to-peer document schema, called Peeraid, for the clouds structured as a collection of gaze nodes which are provided by different participants round the world. Peeraid is high scalable because facts and numbers object and node lookups are carried out by distributed hash table (DHT). User documents are stored as erasure-coded shares to achieve high data availability with less storage cost. Peeraid also supports distributed group management in a fully decentralized manner, and utilizes group locality and network locality to improve get access to performance. Besides, compose procedures are efficiently logged to support very fast update.
Cloud computing is a new style of computing in which dynamically scalable and reconfigurable resources are provided as a service over the Internet. Developers no longer require large capital outlays in hardware to establish their service or human total cost to function it. They can just lease the hardware and systems programs in the datacenters from cloud providers in a pay-as-you-go manner. [1]
Although cloud computing simplifies developer's efforts to provide new services, how to manage huge volume of data efficiently in the datacenters becomes one of the most fundamental matters for cloud providers. To construct a cloud datacenter, service availability, facts and numbers confidentiality and storage scalability have been identified as three major concerns. [2]
Besides, most cloud providers have defined their own computation model, storage form, networking form, and even proprietary APIs. This proprietary cloud computing model is not always suitable for developers to develop portable services or
to experiment new services with exact asset requirements. [3]
To motivate more developers and small service providers to evolve innovative submissions and services, we accept as factual an open cloud computing model is needed to construct a collaborative platform for experiment and deployment. This o p e n computing model can be structured as a collection of peer nodes which are supplied by different organizations or participants around the world. Under these kinds of open clouds, each participant assists some nodes and takes advantage of the others' nodes to evolve and establish their services in a share-as-you-go manner. [4]
Figure 1-1 An open cloud model.
In this dissertation, we propose a peer-to-peer document schema, Peeraid, to tackle storage and facts and numbers management problems of open cloud systems. We make the next assumptions on the document schema:
( The system is aimed at storing large files (from megabytes to gigabytes), such as long documents or media documents (photos, images, videos etc.).
( The contents of documents will not often be modified.
( Users will not adjust file routes arbitrarily. [5]
1.1 Cloud computing
Cloud computing is one of the most popular research topics in computer science recently. It combines several essential concepts such as software as a service (SaaS), utility computing, pay-as-you-go etc., and provides “anytime, anywhere” services for end users.