Store Data and Analyze Data

Laveena Jethani
3 min readSep 16, 2020

--

How Facebook, Google, LinkedIn, Instagram, Medium etc. are storing data .And providing good features to world.

Social Media is one of the platform for sharing knowledge,idea,thoughts .We share our thoughts and ideas using many forms like article,image,word file,excel file,videos etc…

We mostly use Google,Facebook,Instagram,whatsapp,Medium,LinkedIn Netflix etc.For showing our data to public.Subscriber of all this companies are increased.

Social Media Apps

We can take an example of student.A student in this current pandemic situation share everyday approx 3 assignments on google classroom.Every day giving attendance using comments.Everyday posting one blogs on Medium.Sharing Instagram post . Sharing snaps to their friends by Snapchat .Using Facebook for daily news update etc..And doing many activities depend on data sharing.Data is increasing day by day as social media used by every people.

We can say that data is back bone for running business.

So data is producing by peoples is not small it is huge data and this data is big problem for all industries.And this huge data is called Big Data. All social media platform is storing data using networking . All data we store it stores on server very securely . Security is most important factor while storing data.

PROBLEM →BIG DATA

This data I collected from some article →

# 500 million tweets are sent

# 294 billion emails are sent

# 4 petabytes of data are created on Facebook

# 4 terabytes of data are created from each connected car

# 65 billion messages are sent on WhatsApp

# 5 billion searches are made

And many more.Per day we are producing huge data.

WHAT ABOUT STORE THIS DATA AND PROCESS THIS DATA??

Take an example I want to store 1GB data it takes 1min .If I want to store 1TB data it takes 1000min to store data.But it is long time to store data.And long time for retrieving also needed.Here role of big data comes.

There are four feature of BIGDATA →

Volume

There are many form of data generate:

  • Generated from hospitals keeping record all patients,doctors,nurses ,medical staff etc.
  • By social media .
  • By google drive ,drop box.
  • By organization.etc..

The data is generated in KBs,MBs,GBs,TBs,PBs etc.. Facebook per day genrate 4PBs of data.

Velocity

Input and Output of data.

Example - If I make one post on LinkedIn.And how fast it stored and how fast it is processed and retrieve by other.

Speed of data.

Variety

Data is in many format.Data type like cvs,excel,video , song ,text ,pdf,photos etc..This is called variety of data.

SOLUTION → DISTRIBUTED STORAGE

Storing data not only on one single PC. storing data on multiple PCs.

Example- If I have 1GB file and I want to store this file not in traditional format.We want to store this using distributed storage. So We distribute file size and store this distributed data on multiple PCs.And if we want to retrieve data we can retrieve fast because retrieve of data not from one PC it is from different PCs.Storing and retrieve of data is more faster than traditional way because in traditional way we store data continuous at one place .

Thank You

--

--

Laveena Jethani
Laveena Jethani

Written by Laveena Jethani

Technical Blog Writer | Research & Review different technologies | ARTH learner

No responses yet