NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025

NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025

1. Identify the correct Kafka commands for the following:

P: It is the distributed, durable equivalent of Unix pipes. Use it to connect and compose your large-scale data applications.

Q: These are the commands of your Unix pipelines. Use it to transform data stored in Kafka.

R: It is the I/O redirection in your Unix pipelines. Use it to get your data into and out of Kafka.

  • P: Kafka Streams, Q: Kafka Connect, R: Kafka Core
  • P: Kafka Core, Q: Kafka Connect, R: Kafka Stream
  • P: Kafka Streams, Q: Kafka Core, R: Kafka Connect
  • P: Kafka Core, Q: Kafka Streams, R: Kafka Connect
Answer :- For Answers Click Here 

2. Kafka is run as a cluster comprised of one or more servers each of which is called ___________

  • cTakes
  • Chunks
  • Broker
  • None of the mentioned
Answer :- 

3. Kafka maintains feeds of messages in categories called______________

  • Chunks
  • Domains
  • Messages
  • Topics
Answer :- 

4. Each Kafka partition has one server which acts as the ____________

  • Leader
  • Followers
  • Stater
  • None of the mentioned
Answer :- 

5. Which type of processing Apache Spark can handle ?

  • Stream Processing
  • Batch Processing
  • Graph Processing
  • All of the Mentioned
Answer :- 

6. Which is not a component on the top of Spark Core ?

  • Spark Streaming
  • Spark RDD
  • MLlib
  • None of the mentioned
Answer :- For Answers Click Here 

7. In Spark, a ______________________is a read-only collection of objects partitioned across a set of machines that can be rebuilt if a partition is lost.

  • Spark Streaming
  • Resilient Distributed Dataset (RDD)
  • FlatMap
  • Driver
Answer :- 

8. ______________is a distributed machine learning framework on top of Spark. Its goal is to make practical machine learning scalable and easy.

  • MLlib
  • Spark Streaming
  • GraphX
  • RDDs
Answer :- 

9. Which of the following is true about Apache Kafka?

  • Kafka is a message queuing system that stores messages in queues
  • Kafka uses a distributed commit log to enable high throughput and fault tolerance.
  • Kafka is a real-time data processing framework used for complex transformations.
  • Kafka does not support message retention.
Answer :- 

10. In Apache Spark, which of the following is true about Resilient Distributed Datasets (RDDs)?

  • RDDs are immutable and cannot be modified after creation.
  • RDDs are automatically partitioned across nodes without any fault tolerance.
  • RDDs require explicit management of memory and storage.
  • RDDs can only perform operations in a batch processing mode.
Answer :- For Answers Click Here 
Scroll to Top