NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025

1. Identify the correct Kafka commands for the following:

P: It is the distributed, durable equivalent of Unix pipes. Use it to connect and compose your large-scale data applications.

Q: These are the commands of your Unix pipelines. Use it to transform data stored in Kafka.

R: It is the I/O redirection in your Unix pipelines. Use it to get your data into and out of Kafka.

P: Kafka Streams, Q: Kafka Connect, R: Kafka Core
P: Kafka Core, Q: Kafka Connect, R: Kafka Stream
P: Kafka Streams, Q: Kafka Core, R: Kafka Connect
P: Kafka Core, Q: Kafka Streams, R: Kafka Connect

Answer :- For Answers Click Here

2. Kafka is run as a cluster comprised of one or more servers each of which is called ___________

cTakes
Chunks
Broker
None of the mentioned

Answer :-

3. Kafka maintains feeds of messages in categories called______________

Chunks
Domains
Messages
Topics

Answer :-

4. Each Kafka partition has one server which acts as the ____________

Leader
Followers
Stater
None of the mentioned

Answer :-

5. Which type of processing Apache Spark can handle ?

Stream Processing
Batch Processing
Graph Processing
All of the Mentioned

Answer :-

6. Which is not a component on the top of Spark Core ?

Spark Streaming
Spark RDD
MLlib
None of the mentioned

Answer :- For Answers Click Here

7. In Spark, a ______________________is a read-only collection of objects partitioned across a set of machines that can be rebuilt if a partition is lost.

Spark Streaming
Resilient Distributed Dataset (RDD)
FlatMap
Driver

Answer :-

8. ______________is a distributed machine learning framework on top of Spark. Its goal is to make practical machine learning scalable and easy.

MLlib
Spark Streaming
GraphX
RDDs

Answer :-

9. Which of the following is true about Apache Kafka?

Kafka is a message queuing system that stores messages in queues
Kafka uses a distributed commit log to enable high throughput and fault tolerance.
Kafka is a real-time data processing framework used for complex transformations.
Kafka does not support message retention.

Answer :-

10. In Apache Spark, which of the following is true about Resilient Distributed Datasets (RDDs)?

RDDs are immutable and cannot be modified after creation.
RDDs are automatically partitioned across nodes without any fault tolerance.
RDDs require explicit management of memory and storage.
RDDs can only perform operations in a batch processing mode.

Answer :- For Answers Click Here

NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025

NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025

Related Posts