Exercises

  1. Go through the docs and quickstart for AWS Kinesis. This is a managed streaming platform. In particular, read this case study on streaming social media data.

    kinesis

  2. Go through the docs and quickstart for Google PubSub. This is a managed streaming platform.

  3. Go through the docs and quickstart for Apache Flume. This is an open-source streaming solution.

    flume

  4. Repeat the spark streaming recommendation example using MLlib instead of scikit-surprise.