Using Hadoop and Big Data with VoltDB
VoltDB serves as a real-time application database used in conjunction with Hadoop and analytical results derived from Hadoop and big data in applications including real-time scoring, policy enforcement, and customer interaction. VoltDB provides the ability to ingest data as fast as it arrives; perform real-time analytics in-memory; make automated decisions in real time; and continuously pass, or export, processed data into Hadoop.
A Hadoop Pipeline with VoltDB is shown below:
VoltDB provides support for high-velocity export of processed data via a built-in, transactional extract feature. VoltDB Export feeds processed data to HDFS/Hadoop. Application developers can automate the export process by specifying tables in the schema as sources for export. At runtime, any data written to the specified tables is sent to an export connector, whose job it is to move these tuples to the export target safely and with the lowest possible latency. VoltDB provides connectors for export to files (CSV); via WebHDFS to Hadoop; via data serialization and exchange services such as Avro; and for export to other relational databases via JDBC.