Apache Spark Streaming with Kafka and Cassandra Apache Spark 1.2 with PySpark (Spark Python API) Wordcount using CDH5 Apache Spark 1.2 Streaming Apache Drill with ZooKeeper install on Ubuntu 16.04 - Embedded & Distributed Apache Drill - Query File System, JSON, and Parquet Apache Drill -...
Welcome to Tempo: timeseries manipulation for Spark. This project builds upon the capabilities ofPySparkto provide a suite of abstractions and functions that make operations on timeseries data easier and highly scalable. NOTEthat the Scala version of Tempo is now deprecated and no longer in develop...