Big data is a collection of larage sets of data which is so large and complex that is not processwd by traditional applications and technologies.

Tday big data is generated every day by various medium whether it is g3nerated by social networks,organization,IOT devices,scientific studies,online transactions etc.

Big data comprises of data in the size of terabytes,petabytes,etabytes and zetabytes.

These data is aanlysed which again generate new data. so these data is called big data .results of analysing these data helps in making real time decisions like consumer researchreal time ,traffic stats,combat against crime,fraud detection,generating real time maps etc. etc.

Examples of big data are walmart peocess 10 million reansactions per hour,facebook generates 500 terabytes data per day and process and analysed 17 petabytes of data each day that is huge.

Who geenrates big data ?

Users

Users or peoples generating very large data from smartphones on internet in the form of social networks, online transactions performing at every seconds or uploading there data.

Sensors and devices

Sensors are everywhere in the iot devices,smart cities,rfids generated continuous data which becomes big data.

Organizations

Organizations like business organization,scientific organization,universities generates big data.

Challenges of big data

Big data is generated every day through billing systems,websites,custoer relationships maangement,rfid’s,sensors,iot devices,social networks,crowd resorucing etc.

Above big data gnerating resources causes many challenges like how it is captured,how it is stored,how big data is aanlyzed and searched and its is visualized,how big data is cleaned foe further processing.

Types of data in big data

There are three categories of big data:

Structured data

Structured data is data which is represented and stored in order or in a foem like in the form of rows and columns.

Examples of structured data is data in database,data generated from sensors,web logs,machine generated data.Human generated structured data are names,age,date of birth etc.Today big data consists of 20% steuctured data.

Unstructured data

Unstructured data which is not srored and represented in an order like an database.Today approximately 80% of big data is unstrucrured.Types of unstructured data are text,images,videos and pdf.They are either machine geneeated like from satellites or human generated like from social networks,atms etc.

Semi structured data

Semi structured data which is somerimes structured and sometimes it is structuredE.xamples of semi structured data are spreadsheets files,xml files,json and nosql database..