Bigdata has no specific format. It is basically an unstructured data. To make available your big data for your data analytics, you need to store data in certain databases. Else, it is not possible to carry out proper analytics on data.
The data which is available in RDBMS is of structured data. This is what all traditional projects follow to store their data. Due to data is available in multiple patterns you need some different kind of database, which deals with unstructured data.
Data is in any media- it can be Text, It can be Videos, or Audios, or Images or combinations of all these
4 top big data databases list
HDFS serves as the intermittent Façade for the traditional DW systems.
HBaseis indexed using only one column-family and only one column and unique row-key.
Traditional RDBMS systems are replaced by NoSQL alternatives to facilitate faster access and querying of big data.
Multiple types of storage mechanisms—like RDBMS, file storage, CMS, OODBMS, NoSQL and HDFS—co-exist in an enterprise to solve the big data problem.
Typical storage in an analytics platform…
Let us take a look on Hadoop analytics platform…
The data which is coming from RDBMS/NOSQL is first comes to HDFS, which is the format understand by Hadoop framework.
In the above image, data is first coming from RDBMS/NOSQL into HDFS. This data will be used by Hadoop platform for data analysis.
Let us see an example of HP-Vertica, which has all the capabilities, to integrate your data into multiple data sources for data analytics.
|Hadoop Software Distribution||Cloudera, Hortonworks or MAPR distribution|
|Storage||HP Vertica – RAID compliant columnar database|
|Infrastructure||HP Proliant servers|