.
Lazily
Eagerly
SparkContext (sc)
SparkSession (spark)
Both a and b
None of the above
To get the current status of Spark Application
To set the configuration
To Access various services
Entry point to Spark SQL
4
3
In-memory computing
Development in Scala
Batch Processing
Stream Processing
Graph Processing
Interactive Processing
All of the above
RDD is a database
RDD is a distributed data structure
RDD is a programming paradigm
None
Reduce
Map
Filter
None
Df.printSchema()
Df.show()
Df.take
Printschema
DataFrame API have provision for compile time type safety.
DataFrames provide a more user friendly API than RDDs.
Both a and b
None of the above
Error
44
40
Eagerly
Lazily
More than one
Only one
Not specific
None of the above
RDD
Datasets
Java
Scala
Python
All of the above
Supports in-memory computation
Fault-tolerance
It is cost efficient
Compatible with other file storage system
The data type of the variable has to be mentioned explicitly
The type of the variable is determined by looking at its value.
Process the data as per the specified logic
Compilation error
Won't be executed
None
Python
Java
Red
Green
Error
10.01
9.99
Error
Hello
Pretty
Error
Map
Filter
Join
All of the above
All the transformations and actions are lazily evaluated
Execution starts with the call of Action
Execution starts with the call of Transformation
Scheduling
Distributing data across cluster
Monitoring data across cluster
All of the above
Wait!
Here's an interesting quiz for you.