Weekly BigData & ML Roundup – Sep. 12, 2016

Last two weeks went by quite fast. Here comes another roundup for this week!

Examples

Clickbait Cluster
Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly

Supervised Learning
Predictions based on a set of training data. Credits for some of the algorithms used go to Andrew Ng.

Avro-SparkStreaming-Kafka
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)

Libraries

PyKafka
Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.

Infinispan-Spark Connector
Infinispan and Spark Connector

Readonly REST Elasticsearch Plugin
Safely expose Elasticsearch REST API directly to the public

Models

DeepMask
Torch implementation of DeepMask and SharpMask

DeepOSM
Train a deep learning net with OpenStreetMap features and satellite imagery.

DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning


You can find a lot more tools, frameworks and libraries at PocketCluster Index. Go check it out! Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s