Zdrojový dokument:Scientific papers of the University of Pardubice. Series D, Faculty of Economics and Administration. 33/2015
ISSN:1211-555X (Print)
Abstrakt:
Public sector institutions nowadays maintain a large amount of data from various domains. This data represents a potential resource that businesses and citizens can use to enhance their own datasets or which can be used to develop new products and public services. Open data support the emergence and realization of the big data potential. While it enhances the volume and velocity of available data, its main impact is on the variety of data sources. This paper deals with the deployment of the Virtual Hadoop for the processing of the open big data idea in the public sector. The first part of this paper is based on the literature review of the cloud computing, the distributed processing of data, big / open / linked data and theirs sources on the web. The primary aim of the Virtual Hadoop deployment is to test the performance efficiency using open big data in order to obtain the direction of the future research. The last part then introduces the most important findings and recommendations.