What is Apache Apex?
Apache Apex is a YARN-native platform that unifies stream and batch processing. It processes big data-in-motion in a way that is scalable, performant, fault-tolerant, stateful, secure, distributed, and easily operable. Apache Apex was named a top-level project by The Apache Software Foundation on April 25, 2016. Apache Apex is developed under the Apache License 2.0.
There are two parts of Apache Apex: Apex Core and Apex Malhar. Apex Core is the platform or framework for building distributed applications on Hadoop. The core Apex platform is supplemented by Malhar, a library of connector and logic functions, enabling rapid application development. These input and output operators provide templates to sources and sinks such as HDFS, S3, NFS, FTP, Kafka, ActiveMQ, RabbitMQ, JMS, Cassandra, MongoDB, Redis, HBase, CouchDB, generic JDBC, and other database connectors.
Want to know how to get started?
Video: Introduction to Apache Apex
Find content tagged apex
- Get notified of new Apache Apex content with this tag by creating a custom tag stream and adding the tag to the stream. Learn how: Follow a tag (topic) by creating a custom news stream in the community
For more information on Apache Apex, visit: Apache Apex Open Source Stream & Batch Processing Platform