Prerequisite – Introduction to Hadoop, Apache Hive The major components of Hive and its interaction with the Hadoop is demonstrated in the figure below and al
With the phenomenal growth in digital data, particularly generated from multi-media and other enterprise application the need for high-performance storage solut
The ODBMS which is an abbreviation for object oriented database management system, is the data model in which data is stored in form of objects, which are insta
Prerequisites – Introduction to Hadoop, Computing Platforms and Technologies Apache Hive is a data warehouse and an ETL tool which provides an SQL-like interf
Prerequisites – Introduction to Hadoop, Apache HBase HBase architecture has 3 main components: HMaster, Region Server, Zookeeper. Figure – Architecture of H
Prerequisite – Introduction to Hadoop HBase is a data model that is similar to Google’s big table. It is an open source, distributed database developed by A
Hive: Hive is a datawarehousing package built on the top of Hadoop. It is mainly used for data analysis. It generally target towards users already comfortable w
A system in which each server is autonomous and centralized DBMS that has its own local users. The term Federated Database system or in short FDS is basically u
Relational Database Management System (RDBMS) – RDBMS is for SQL, and for all modern database systems like MS SQL Server, IBM DB2, Oracle, MySQL, and Microsof
Seeing the vast increase in volume and speed of threats to databases and many information assets, research efforts need to be consider to the following issues s
A distributed database is basically a database that is not limited to one system, it is spread over different sites, i.e, on multiple computers or over a networ
Inverted Index It is a data structure that stores mapping from words to documents or set of documents i.e. directs you from word to document. Steps to build Inv
Following questions have been asked in GATE 2009 CS exam. 1) Consider two transactions T1 and T2, and four schedules S1, S2, S3, S4 of T1 and T2 as given below:
Following questions have been asked in GATE 2005 CS exam. 1) Which one of the following statements about normal forms is FALSE? (a) BCNF is stricter than 3NF (b
There are many characteristics of biological data. All these characteristics make the management of biological information a particularly challenging problem. H
Distributed databases basically provide us the advantages of distributed computing to the database management domain. Basically, we can define a Distributed dat
Web 1.0 – Web 1.0 refers to the first stage of the World Wide Web evolution. Earlier, there were only few content creators in Web 1.0 with the huge majority o
In the past ten years, there is a rapid increase in the development of GIS field. Due to this growing interest, new applications will continue to present new ch
Semantic Heterogeneity basically occurs when schema or data set for same domain is developed by independent parties which leads to differences in meaning, inter
Data management technology that can support easy data access from and to mobile devices is among the main concerns in mobile information systems. Mobile computi
Following Questions have been asked in GATE 2012 exam. 1) Which of the following statements are TRUE about an SQL query? P: An SQL query can contain a HAVING cl
Big Data includes huge valume, high velocity, and extensible variaty of data. These are 3 types: Structured data, Semi-structured data, and Unstructured data. S
Addressing Modes– The term addressing modes refers to the way in which the operand of an instruction is specified. The addressing mode specifies a rule for in
Following Questions have been asked in GATE 2011 exam. 1. Consider a relational table with a single record for each registered student with the following attrib
Following questions have been asked in GATE 2008 CS exam. 1) Let R and S be two relations with the following schema R (P,Q,R1,R2,R3) S (P,Q,S1,S2) Where {P, Q}