As you might know, MySQL introduces Document Store (via JSON field) from version 5.7 and version 8.0. Since currently, it is in an early stage, the performance is not really good compared to its direct competitor: MongoDB. However, this is really a good move for MySQL after their introduction of MySQL […]
Big Data
NameNode (NN) NN holds the metadata of the files in HDFS, maintains the entire metadata in RAM It is important to run NN from a machine that has lots of RAM at its disposal. The higher the number of files in HDFS, the higher the consumption of RAM In case the […]
This will add a quick note on how to install necessary Hadoop ecosystem by Installing Apache BigTop for Hadoop Development. Remember that we need a 64-bit OS to be able to run Big Data project. First, add a new repo. The latest BigTop repo (0.8.0) is located at http://www.apache.org/dist/bigtop/bigtop-0.8.0/repos/ and we just […]
The first part of this article is copied from http://www.if-not-true-then-false.com/2010/install-mongodb-on-fedora-centos-red-hat-rhel/ since we will not reinvent the wheels 🙂 What is MongoDB? MongoDB (from “humongous”) is a scalable, high-performance, open source, schema-free, document-oriented database. Written in C++. MongoDB bridges the gap between key-value stores (which are fast and highly scalable) and traditional RDBMS […]
This article describe an example of using MySQL as a NoSQL in which the application can exceed 750,000 qps on a commodity server. This is originated from http://yoshinorimatsunobu.blogspot.com/2010/10/using-mysql-as-nosql-story-for.html UPDATE: Oracle officially released memcached daemon plugin that talks with InnoDB. I’m glad to see that NoSQL+MySQL has become an official solution. It’s still preview release […]
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because specific non-functional properties are often the main justification for NoSQL usage and fundamental results on distributed systems like the CAP theorem apply well to NoSQL systems. […]
In 2010, when the world became enchanted by the capabilities of cloud systems and new databases designed to serve them, a group of researchers from Yahoo decided to look intoNoSQL. They developed the YCSB framework to assess the performance of new tools and find the best cases for their use. […]
Whenever analysts or journalists assemble lists of the top trends for this year, “big data” is almost certain to be on the list. While the catchphrase is fairly new, in one sense, big data isn’t really a new concept. Computers have always worked with large and growing sets of data, […]
1. Quick Comparison While SQL databases are insanely useful tools, their monopoly in the last decades is coming to an end. And it’s just time: I can’t even count the things that were forced into relational databases, but never really fitted them. (That being said, relational databases will always be […]