Page 1 of 1

Integrating with Apache Spark

Posted: Mon Oct 02, 2017 6:23 pm
by JimKnicely
The Vertica Connector for Apache Spark is a fast parallel connector that allows you to use Apache Spark for pre-processing data. Apache Spark is an open-source, general purpose, cluster-computing framework. The Spark framework is based on Resilient Distributed Datasets (RDDs), which are logical collections of data partitioned across machines.

Continue reading here:
https://my.vertica.com/blog/integrating-apache-spark/