A Study on the Big Data Log Analysis: Goals, Challenges, Issues, and Tools
Security, availability, and performance are becoming more frequently requested and sophisticated. Traditional solutions are unable to protect organization’s assets or keep their services running smoothly. These solutions need to focus more on customer needs and satisfaction. Organizations needs to perform real-time analysis on a huge amount of data from various types to discover anomalous fragments within a reasonable response time. Business can widen the scale of processed data, accelerate the speed of threat detection, keep their services up and running by monitoring the servers’ status, predict failure before it happens and increase customer satisfaction by providing efficient service in a timely manner. Processing the huge amount of the system’s log files using relational database technology has been facing a bottleneck. To analyze such large data sets we need parallel processing system and reliable data storage mechanism. Big Data is the solution to overcome these issues. The main purpose of this paper is to highlight the characteristics of Big Data and present a review of log file analysis in Big Data environment as a first step towards getting the maximum benefits of big data in logs analytics. Keywords - Big Data, Hadoop, Mapreduce, HDFS, Log Analysis, Log Files.