Overview of the Cloud Computing for Big Data
Abstract
In the last few years, there has been a growing interest in Big Data technologies with cloud Infrastructure. Hadoop software tool is an open source platform that allows for the distributed processing of huge data sets across different clusters, which handles high volumes of the structured data, semi-structured data and unstructured data from various sources. Hadoop environment is scalable, and Hadoop adds new nodes without changing data formats or the application. The main objective of this research lies in summarizing the cloud computing and big data technologies, providing details of the most common infrastructures that have been developed, discussing several big data processing technologies and the presenting key problems of big data processing and the cloud computing platform. Finally, the open issues and challenges are introduced and research directions in the future on big data processing are explored in cloud computing environments. Therefore cloud computing can be considered as an attractive technology platform for developing and deploying big data, and it has a good future.
Collections
- Engineering [45]