Let’s get started February’s 2nd week with a weekly digest!
Apache Spark and Apache Kafka integration example
Spark example code demonstrating RDD, DataFrame and DataSet APIs.
HBase RDD example project
Examples of HBase observers implementations.
Tweet Analysis with Spark
Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.
SnappyData: OLTP + OLAP Database built on Apache Spark
SnappyStore is a row oriented, transactional, main-memory distributed data store that is designed for applications that have demanding scalability and availability requirements.
Hive storage handler implementation to query databases over JDBC.
Hadoop framework for metagenomic processing
Apache Spark with SnappyData extensions.
Easy & Flexible Alerting With ElasticSearch
Spark RDD to read and write from HBase
A lightweight pure-Java Reactive Streams-compliant connector for Kafka.
Solr Redis Extensions
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
The h-rider is a UI application that provides an easier way to view or manipulate the data saved in the HBase™.
HTrace is a tracing framework for use with distributed systems.
Data-Driven Spark allows quick data exploration based on Apache Spark.
Hadoop library to read packet capture (PCAP) files.
An interactive and explorative visualization tool in D3 based on hive plots built for complex networks.
Several diagrams describing Apache Hadoop internals 2.3.0 or later.
Subscribe for upcoming posts!
Join the channel!