Weekly BigData & ML Roundup – Dec. 29, 2016

Apache Edgent, an analytics framework on edge devices, has recently reached the 1.0.0 milestone. Also, Facebook has recently released Beringei – a new storage engine specifically for time-series data.

Since this is the last round-up for this year, I’d like to take an opportunity to thank you for visiting, subscribing, and reaching out to me for various suggestions and encouragement. Weekly round-up will continue next year. Stay tuned!

Examples

NBA Player Movement
Visualization and analysis of NBA player tracking data

Google Youtube-8M
Starter code for working with the YouTube-8M dataset

Toolsets

Sergeant
Tools to Transform and Query Data with the Apache Drill, REST API, JDBC Interface, dplyr, and DBI Interfaces in R

Content Data Store
A system to provide storage facilities to massive data sets is in the form of images, pdfs, documents and scanned documents

Sematext Solr-Researcher
Solr SearchComponent for altering and re-executing queries that product poor results

Naniar
Tools for numerical and visual summaries of NAs

Models

ByteNet
A tensorflow implementation of French-to-English machine translation using DeepMind’s ByteNet

Neural Painter
Paint artistic patterns using random neural network

VAE-Clustering
Unsupervised clustering with (Gaussian mixture) VAEs

Libraries

Fregata
A light weight, super fast, large scale machine learning library on apache spark

Intel pWord2Vec
Parallelizing word2vec in shared and distributed memory

Cortex
Machine learning in Clojure

Tulip Indicators
A library of functions for technical analysis of financial data

Genann
Simple neural network library in ANSI C

Frameworks

Facebook Beringei (incubating)
A high performance, in-memory storage engine for time series data.

Apache Edgent (incubating)
An open source stream processing programming model and lightweight micro-kernel style runtime for edge devices that enables you to analyze data and events at the device


*https://pocketcluster.wordpress.com will move to https://blog.pocketcluster.io on Jan 1, 2017. If you have subscribed this blog, please make sure to change the feed address.

Looking into adding your repo? Any suggestion? Comment? Send your feedback to stkim1@pocketcluster.io or tweet to @stkim1!

Looking for more BigData or Machine Learning repositories? You can find a lot more tools, frameworks and libraries at PocketCluster Index.

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

2 thoughts on “Weekly BigData & ML Roundup – Dec. 29, 2016

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s