Intelligent Apache Spark Test

10 Questions | Total Attempts: 1523

SettingsSettingsSettings
Please wait...
Intelligent Apache Spark Test

Spark is a registered trademark of Apache Software Foundation; it is one of the most popularly known frameworks for computing clusters. Now, let's see how knowledgeable you are when it comes to Apache Spark.


Questions and Answers
  • 1. 
    Which of these languages is NOT supported by Spark for developing big data applications?
    • A. 

      Python

    • B. 

      Java

    • C. 

      Scala

    • D. 

      Groovy

  • 2. 
    How can you use Spark to access and analyze data stored in Cassandra databases?
    • A. 

      By using Spark Special Keys

    • B. 

      By using Scala

    • C. 

      By using Sparse Vector

    • D. 

      By using Spark Cassandra Connector

  • 3. 
    What is the full meaning of RDD? 
    • A. 

      Resilient Distinctive Datasets

    • B. 

      Resilient Diagonal databases

    • C. 

      Resilient Distributed Datasets

    • D. 

      Responsive Distributed Databases

  • 4. 
    How can you describe RDDs?
    • A. 

      Mutable

    • B. 

      Immutable

    • C. 

      Positive

    • D. 

      Negative

  • 5. 
    How many cluster managers are in Spark? 
    • A. 

      1

    • B. 

      2

    • C. 

      3

    • D. 

      4

  • 6. 
    Which of the following is not a Spark cluster manager?
    • A. 

      YARN

    • B. 

      Standalone deployment

    • C. 

      Groovy

    • D. 

      Apache Mesos

  • 7. 
    To connect Spark with Mesos, which of these must the location of Spark binary packages be to Mesos?
    • A. 

      Close

    • B. 

      Far

    • C. 

      Accessible

    • D. 

      Inaccessible

  • 8. 
    What is the representation of dependencies in-between RDDs called? 
    • A. 

      Graph

    • B. 

      Quadratic graph

    • C. 

      Quadratic graph

    • D. 

      Lineage graph

  • 9. 
    What do you trigger by setting up a ‘spark.cleaner.ttl’ parameter? 
    • A. 

      Automatic delete

    • B. 

      Automatic cleanup

    • C. 

      Automatic recovery

    • D. 

      Automatic recycling

  • 10. 
    Which is described as a sequence of Resilient Distributed Databases that represent a stream of data? 
    • A. 

      Dstream

    • B. 

      YARN

    • C. 

      HDFS

    • D. 

      BlinkDB

Back to Top Back to top