NPTEL Cloud Computing and Distributed Systems Week 8 Assignment Answers 2025
1. Identify the correct Kafka commands for the following:
P: It is the distributed, durable equivalent of Unix pipes. Use it to connect and compose your large-scale data applications.
Q: These are the commands of your Unix pipelines. Use it to transform data stored in Kafka.
R: It is the I/O redirection in your Unix pipelines. Use it to get your data into and out of Kafka.
- P: Kafka Streams, Q: Kafka Connect, R: Kafka Core
- P: Kafka Core, Q: Kafka Connect, R: Kafka Stream
- P: Kafka Streams, Q: Kafka Core, R: Kafka Connect
- P: Kafka Core, Q: Kafka Streams, R: Kafka Connect
Answer :- For Answers Click Here
2. Kafka is run as a cluster comprised of one or more servers each of which is called ___________
- cTakes
- Chunks
- Broker
- None of the mentioned
Answer :-
3. Kafka maintains feeds of messages in categories called______________
- Chunks
- Domains
- Messages
- Topics
Answer :-
4. Each Kafka partition has one server which acts as the ____________
- Leader
- Followers
- Stater
- None of the mentioned
Answer :-
5. Which type of processing Apache Spark can handle ?
- Stream Processing
- Batch Processing
- Graph Processing
- All of the Mentioned
Answer :-
6. Which is not a component on the top of Spark Core ?
- Spark Streaming
- Spark RDD
- MLlib
- None of the mentioned
Answer :- For Answers Click Here
7. In Spark, a ______________________is a read-only collection of objects partitioned across a set of machines that can be rebuilt if a partition is lost.
- Spark Streaming
- Resilient Distributed Dataset (RDD)
- FlatMap
- Driver
Answer :-
8. ______________is a distributed machine learning framework on top of Spark. Its goal is to make practical machine learning scalable and easy.
- MLlib
- Spark Streaming
- GraphX
- RDDs
Answer :-
9. Which of the following is true about Apache Kafka?
- Kafka is a message queuing system that stores messages in queues
- Kafka uses a distributed commit log to enable high throughput and fault tolerance.
- Kafka is a real-time data processing framework used for complex transformations.
- Kafka does not support message retention.
Answer :-
10. In Apache Spark, which of the following is true about Resilient Distributed Datasets (RDDs)?
- RDDs are immutable and cannot be modified after creation.
- RDDs are automatically partitioned across nodes without any fault tolerance.
- RDDs require explicit management of memory and storage.
- RDDs can only perform operations in a batch processing mode.
Answer :- For Answers Click Here