Weekly Roundup – Jan. 1 2016

This year’s first weekly roundup! Check all the additions from PocketCluster Index !

Examples

Stock Inference on Spark

Stock Inference engine using SpringXD, Apache Geode / GemFire and Spark ML Lib.

Stock Prediction demo

Stock Prediction demo on Spark.

Spark Drools Example

This project shows a simple example of how to integrate drools into an Apache Spark job.

Nifi-sandbox

Sandbox for Apache nifi.

romainr / Hadoop Tutorials Examples

Source, data and tutorials of the Hue video series, the Web UI for Apache Hadoop.

Sprue

An example to run drools with spark.

Libraries

SpectralLDA-TensorSpark

This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.

Sparkling-water

Sparkling Water integrates H2O’s fast scalable machine learning engine with Spark.

spring-hadoop

The Spring for Apache Hadoop project provides extensions to Spring, Spring Batch, and Spring Integration to build manageable and robust pipeline solutions around Hadoop.

Spark Druid Package

Spark-Druid package enables Logical Plans written against a raw event dataset to be rewritten to take advantage of a Drud Index of the Event data.

 

Toolsets

Spark kernel

The main goal of the Spark Kernel is to provide the foundation for interactive applications to connect to and use Apache Spark.

Spear

Spear is a SparkListener that maintains info about Spark jobs, stages, tasks, executors, and RDDs in MongoDB.

Grafana-spark-dashboards

Scripts for generating Grafana dashboards for monitoring Spark jobs

Yarn-logs-helpers

Scripts for parsing / making sense of yarn logs.

Guacamole

Guacamole is a framework for variant calling, i.e. identifying DNA mutations from Next Generation Sequencing data. It currently includes a toy germline (non-cancer) variant caller as well as a somatic variant caller for finding cancer mutations. Most development effort has gone into the somatic caller so far.

Spark-ext

Spark ML transformers, estimator, Spark SQL aggregations, etc that are missing in Apache Spark.


You can find a lot more tools, frameworks and libraries at PocketCluster Index. Go check it out! Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s