Apache Apex Malhar (Incubating) is recently updated to 3.3.0!
Hadoop Examples is a set of simple example scripts to illustrate Hadoop ecosystem tools like Hive and Pig.
The public repo for Oozie Editor Plugin
A webinterface for time series and bucket storage in HBase.
Let’s deploy Apache Fuseki servers in a CDH cluster via Cloudera Manager.
A tool to verify the compatibility of Avro-schema.
Etosha aims on building a bridge between the Big-Data and Linked-Data domains.
The Apache Accumulo™ sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system.
Apache Apex is a unified platform for big data stream and batch processing.
Hadoop.TS.Next.Generation … more than just mappers and reducers.
RDF store on a cloud-based architecture
Sarama is a Go library for Apache Kafka 0.8 and 0.9
Syslog Collector written in Go, streams to Kafka 0.8
Apache Apex Malhar
Malhar repository contains open source operator and codec library that can be used with the Apache Apex (incubating) platform to build Realtime streaming applications.
the fast way of building time series processing pipelines for Hadoop using Crunch.
A library for time series analysis on Apache Spark
Lets layout large graphs using GraphX and Spark
Subscribe for upcoming posts!
Join the channel!