Weekly Roundup – Feb. 7, 2016

Let’s get started February’s 2nd week with a weekly digest!

Examples

Example-spark-kafka
Apache Spark and Apache Kafka integration example

Spark-RDD-dataframe-dataset
Spark example code demonstrating RDD, DataFrame and DataSet APIs.

Spark-Terasort
Spark Terasort.

HBase-RDD-examples
HBase RDD example project

HBase-observer-examples
Examples of HBase observers implementations.

TweetAnalysisWithSpark
Tweet Analysis with Spark

 

Frameworks

Apache Falcon
Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.

SnappyData
SnappyData: OLTP + OLAP Database built on Apache Spark

Snappy-Store
SnappyStore is a row oriented, transactional, main-memory distributed data store that is designed for applications that have demanding scalability and availability requirements.

 

Libraries

Hive-jdbc-storage-handler
Hive storage handler implementation to query databases over JDBC.

Koonkie
Hadoop framework for metagenomic processing

Snappy-Spark
Apache Spark with SnappyData extensions.

Elastalert
Easy & Flexible Alerting With ElasticSearch

HBase-RDD
Spark RDD to read and write from HBase

Kafka-Reactive-Streams
A lightweight pure-Java Reactive Streams-compliant connector for Kafka.

Solr-Redis
Solr Redis Extensions

Spark-SQL-on-HBase
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces

 

Toolsets

h-rider
The h-rider is a UI application that provides an easier way to view or manipulate the data saved in the HBase™.
hrider

Apache-HTrace (incubator)
HTrace is a tracing framework for use with distributed systems.

Spawncamping-dds
Data-Driven Spark allows quick data exploration based on Apache Spark.

Hadoop-pcap
Hadoop library to read packet capture (PCAP) files.

HivePanelExplorer
An interactive and explorative visualization tool in D3 based on hive plots built for complex networks.

 

Misc

HadoopInternals
Several diagrams describing Apache Hadoop internals 2.3.0 or later.

 


You can find a lot more tools, frameworks and libraries at PocketCluster Index. Go check it out! Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s