Download Original PDF

Get the official Barkatullah University print version scanned document.

Download/Print

🤝 Help Your Juniors!

Have previous year question papers that aren't on our website? Help the next batch of students by sending them to us! With your consent, we will proudly feature your name as a Top Contributor on our platform.

Submit Papers 📩
Roll No. ............................
(a) Map Reduce
Total No. of Questions : 11
[Total No. of Printed Pages : 8
(b) Reduce
(c) YARN
(d) All of these

M.Sc. IVth Semester (New/ATKT)

Examination, 2022

Computer Science

Paper - MSCS-401

Big Data Analytics

Time : 3 Hours]
[Maximum Marks : 85

Note- Attempt all the questions.

SECTION - 'A'

Objective Type Questions

1×15=15

1.
Choose the correct answer :

(i) What are the main components of Big Data ?

  1. Map Reduce
  2. Reduce
  3. YARN
  4. All of these

(ii) Hadoop achieves readability by replicating the data across multiple hosts and lence does not require........storage on hosts :

  1. RAID
  2. ZFS
  3. Operating System
  4. None of these

(iii) ............has the worlds largest Hadoop cluster :

  1. Apple
  2. Datamatics
  3. Facebook
  4. None of these

(iv) Hive also support custom extensions writter is..........:

  1. C#
  2. Java

(v) ............is genes at purpose computing model and sustime system for distributed data analytics :

  1. MapReduce
  2. Drill
  3. Oozie
  4. None of these

(vi) ............is the slave/wark es node and hoeds the user data in the form of Data Blocks :

  1. Data Node
  2. Name Node
  3. Data Block
  4. Replication

(vii) HDFS is implemented in ............programming language :

  1. C++
  2. Java
  3. Scala
  4. None of these

(viii) ............is the default partitioner for partioning key space :

  1. Hash pax
  2. Partitioner
  3. Hash Partitioner
  4. None of these

(ix) ............maps input key/value pains to a set inter mediate key/value points :

  1. Mapper
  2. Reducer
  3. Both (a) and (b)
  4. None of these

(x) ............is an online NOSQL developed by cloudera :

  1. Hcatalog
  2. Hbase
  3. Imphala
  4. Oozic

(xi) What are the different features of Big Data Analytics ?

  1. Open Source
  2. Scalability
  3. Data Recovery
  4. All of above

(xii) All of the following accurately describe Hadoop, except :

  1. Oper Source
  2. Real time
  3. Java based
  4. Distributed computing approach

(xiii) ............hides the limitation of java behind a powerful and concise clojure API for cascading :

  1. Scalding
  2. Cascalog
  3. Hcatalog
  4. H calding

(xiv) Which package contains most fundamental functions to seen R ?

  1. Root
  2. Child
  3. Base
  4. Pasent

(xv) Advanced usecss can write.........code to manipulate R objects directly :

  1. C, C++
  2. C++ java
  3. Java, C
  4. Java

SECTION - 'B'

Short Answer Type Questions

5×5=25

2.

Describe any five characteristics of Big Data.

OR

Discuss any three application of Big data.

3.

What are the configuration parameters is mapreduce program ?

OR

How are bigdata and Hadoop related to each other ?

4.

Describe the components of HDFS.

OR

Explain secondary Name Node.

5.

What is NoSQL ?

OR

Compare and contrast SQL and No SQL.

SECTION - 'C'

Long Answer Type Questions

9×5=45

6.

Explain Supervised learning ?

OR

What is collaborative filtering ?

7.

Define Big data. Enlist its importance over traditional data-base system ?

OR

Why is finding similar items important in Big data ? Illustrate using two example applications.

8.

Explain different configuration fills in Hadoop ?

OR

Explain concept of Map Reduce using an example.

9.

Write short notes on Hive components with a neat diagram.

OR

Explain PIG and Zookeeper.

10.

Describe the characteristics of a xbSQL database ?

OR

Explain SPARK is data analysis ?

11.

Explain the concept of Unsupervised learning ?

OR

Explain Big Data Analysis with Big R.