Go through the docs and quickstart for AWS Kinesis. This is a managed streaming platform. In particular, read this case study on streaming social media data.

Go through the docs and quickstart for Google PubSub. This is a managed streaming platform.
Go through the docs and quickstart for Apache Flume. This is an open-source streaming solution.

Repeat the spark streaming recommendation example using MLlib instead of scikit-surprise.