期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2015
卷号:3
期号:2
DOI:10.15680/ijircce.2015.0302032
出版社:S&S Publications
摘要:Flow analysis of the internet traffic elucidates the sequence and pattern of the traffic in the network.This helps the network administrator to monitor the operations going on in the network, to understand the networkusage and to examine the behaviour of the user using the network. Analysis of the internet traffic can avoid a hugeamount of problems. Flow analysis helps in fault tolerance, traffic engineering, resource allocation and networkcapacity planning. Due to the fast growing network, the volume of the traffic is getting very big day by day. So it isvery difficult to collect, store and analyse this huge data on a single machine. Hadoop is a leading framework which isdesigned to execute tremendous datasets that can be of hundreds of terabytes and even petabytes of data. Hadoopperforms brute force scan for multiple traces of input data and produces the output for traffic flow identification, flowclustering. In this paper a Hadoop based traffic analysis of the internet traffic is done. Here the system accepts a largeamount of packets coming from various networks, the input is appended to the Hadoop Distributed File System(HDFS) and finally processing is done through an approach called MapReduce. Once the output is obtained it isgraphically shown to the network operators and a detailed analysis is done on the internet traffic.