Showing posts with label opensource. Show all posts
Showing posts with label opensource. Show all posts

Saturday, September 6, 2014

What is Big data?

Big data is omnipresent in this era . We don't realize  the amount of data we generate on every single day. We all possess a smartphone. We are all connected to different social networking sites,blogs,video portal. We share,like,comment . Every text,video,image which we share , what contributes to big data. You will feel overwhelmed , if you will start to go through the facts and figures  around big data. Unless it gets analyzed , it actually does not make any sense.  And when it gets analyzed , organisations any industry get benefits out of it like never before.To analyze this amount of data the  existing hardware and software are not good enough to handle  this vast amount of data which get generated with high speed in so much variety. To process, store, analyze and manage big data with the current traditional data tools is like overburdening and exhausting the current system. As those tools were not developed , having such scale of data in mind. So we need fresh thoughts, fresh ideas what we will help us to have a smooth transition into this era of big data.

Sources of Big data:

Social networking site : Facebook, LinkedIn, Yahoo, Google, and specific-interest social or travel sites
Machine log data : web site tracking information, application logs, and sensor data
Public Web : Government,weather,traffic, Bank



Definition from different sources:

  IBM:
       Big Data spans 3 dimensions : variety, velocity and volume,
  Teradata:
      Big Data means different analytics, data structure and diversity,
  EMC:
      Big data is more than just data volume; it includes data velocity, data variety and data   complexity.
  Wikipedia:
      Big data is an all-encompassing term for any collection of data sets so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.
  McKinsey:
      Datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze
      
      
     


More updates are coming ...........