Weekly BigData & ML Roundup – Sep. 12, 2016

Last two weeks went by quite fast. Here comes another roundup for this week!


Clickbait Cluster
Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly

Supervised Learning
Predictions based on a set of training data. Credits for some of the algorithms used go to Andrew Ng.

Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)


Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.

Infinispan-Spark Connector
Infinispan and Spark Connector

Readonly REST Elasticsearch Plugin
Safely expose Elasticsearch REST API directly to the public


Torch implementation of DeepMask and SharpMask

Train a deep learning net with OpenStreetMap features and satellite imagery.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

You can find a lot more tools, frameworks and libraries at PocketCluster Index. Go check it out! Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s