Weekly BigData & ML Roundup – Aug. 31, 2017

Examples

Awesome Time Series in Python
This curated list contains python packages for time series analysis

Awesome Core ML Models
Largest list of Apple Core ML models

The Nuts and Bolts of Deep RL Research
Hacks for training RL systems from John Schulman’s lecture at Deep RL Bootcamp

Deep Learning Frameworks
Demo of running NNs across different frameworks

Random Forest Explainer
A set of tools to understand what is happening inside a Random Forest

Oracle
Scratch Implementations of Major Machine Learning Algorithms

NeuroBlast
A classic arcade space shooter with ML-powered AI

 

Toolsets

Torch2CoreML
Torch7 -> CoreML

Cruise Control for Apache Kafka
A fully automate the dynamic workload rebalancer and self-healing of a kafka cluster

Doctor Kafka
A service for Kafka cluster auto healing and workload balancing

KSQL
A Streaming SQL Engine for Apache Kafka

Fashion-MNIST
A MNIST-like fashion product database for benchmark.

 

Models

Tensorflow Generative Model Collections
Collection of generative models in Tensorflow

ARC PyTorch
PyTorch implementation of Attentive Recurrent Comparators

ResNet 1K Layers
Deep Residual Networks with 1K Layers

Libraries

Kafka Node
Node.js client for Apache Kafka 0.8 and later.

Pytorch-C++
Pytorch C++ Library

ANNetGPGPU
A GPU (CUDA) based Artificial Neural Network library


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – Aug. 24, 2017

Examples

GOT Book #6
For those tired of waiting for the next GOT book to come out, RNN trained on the first five GOT books

Pytorch Tutorial
Quick PyTorch introduction and tutorial. Targets computer vision, graphics and machine learning researchers eager to try a new framework.

Machine Learning Mindmap / Cheatsheet
A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

ReactionRNN
Python module + R package to predict the reactions to a given text using a pretrained recurrent neural network.

Machine Learning Flappy Bird
Machine Learning for Flappy Bird using Neural Network and Genetic Algorithm

Toolset

nivo
nivo provides a rich set of dataviz components, built on top of the awesome d3 and Reactjs libraries

Model

SMASH
An experimental One-Shot Model Architecture Search through HyperNetworks

Libraries

Frugal
An extension of Apache Thrift which provides support for request headers, request multiplexing, thread safety, and code-generated pub/sub APIs

Synaptic
Architecture-free neural network library for node.js and the browser

SimpleDNN
A lightweight library written in Kotlin to support the development of feed-forward and recurrent Artificial Neural Networks.

TVM
End to end Tensor IR/DSL stack for deploying deep learning workloads to hardwares

Framework

Apache Bookkeeper
A scalable, fault tolerant and low latency storage service optimized for append-only workloads.


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – Aug. 10, 2017

Examples

Machine Learning Links and Lessons Learned
List of all the lessons learned, best practices, and links from the author’s time studying machine learning

DeepMind PySC2
StarCraft II Learning Environment

FaceNet
FaceNet is for easy face recognition, verification, and clustering in JavaScript/Node.js.

textgenrnn
Python module to easily generate text using a pretrained character-based recurrent neural network.

Toolsets

stick-bug-ml
A framework to organize the process of designing supervised machine learning systems

Keras Weight Animator
Save keras weight matrices as short animated videos during training

Jupyter + Angular2
The jupyter angular2 stack for creating wire-frame UX.

Models

RL-Teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

EAST
A tensorflow implemention of EAST text detector

Segmenty
Training convnets to segment visual patterns without annotated data

Libraries

CommAI-env
A platform for developing AI systems as described in A Roadmap towards Machine Intelligence

TiSpark
A thin layer built for running Apache Spark on top of TiDB/TiKV

 


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – Aug. 3, 2017

Examples

Jpeg Defense
A simple jpeg defense for the OpenAI attack

Unsupervised
Applying unsupervised learning using K-means clustering

Awesome Data Science
An open source Data Science repository to learn and apply towards solving real world problems

Deep Learning Models for QA
Keras DL models to answer 8th grade science multiple choice questions (Kaggle AllenAI competition)

Models

Go Perceptron
A single level perceptron classifier with weights estimated from sonar training data set using stochastic gradient descent

Tensorflow XNOR BinaryNets
BinaryNets in TensorFlow with XNOR GEMM op

Libraries

Text Classification
All kinds of text classification models and more with deep learning

Gonum
A set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more

Elastic4s
Elasticsearch Scala Client – Non Blocking, Type Safe, HTTP, TCP

Pandas Redshift
Load data from redshift into a pandas DataFrame and vice versa

Netflix Vectorflow
A minimalist neural network library optimized for sparse data and single machine environments


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – July 27, 2017

Examples

Numerical Linear Algebra for Coders
This course contains the notebooks for the Numerical Linear Algebra elective in USF’s MSAN program, summer 2017

Deep Learning – The Straight Dope
An interactive book on deep learning, in concept and in MXNet

Detect Language from Text
This project focuses on Text Classification, and creates a model capable of classifying the language of the input text

Awesome Data Science with Ruby
Practical Data Science with Ruby based tools

Toolsets

Jupyter Notify
A Jupyter Notebook magic for browser notifications of cell completion

Tensor Tab
Little bit of Tensorflow in every new tab

Zookeeper Leader Election
Leader election using the Curator recipes with Zookeeper

Models

Spotlight
Deep recommender models using PyTorch

SimGAN Captcha
Solve captcha without manually labeling a training set

Keras GAN
Keras implementations of Generative Adversarial Networks

Facebook DrQA
Reading Wikipedia to Answer Open-Domain Questions

Library

Tecent Ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – July 20, 2017

Examples

Twitch Streamer
Spark Streaming library for reading chat messages from Twitch.tv

Deep Learning Project
An in-depth machine learning tutorial introducing readers to a whole machine learning pipeline from scratch.

Depression Detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Neural Machine Translation (seq2seq) Tutorial
This tutorial gives readers a full understanding of seq2seq models and shows how to build a competitive seq2seq model from scratch.

Iris: A Conversational Agent for Data Science
A extensible conversational agent for data science tasks

Toolsets

Guild AI
Guild AI supplements your TensorFlow™ operations by collecting a wide range of information about your model’s performance

TF Stage
A fast and canonical project setup for TensorFlow models. The most difficult part of getting started with TensorFlow isn’t deep learning, it’s putting together hundreds of API calls into a cohesive model.

Hugo Jupyter
Publish Jupyter notebooks with Hugo

Models

Foolbox
Python toolbox to create adversarial examples that fool neural networks

Facebook SparseConvNet
This is the Torch/PyTorch library for training Submanifold Sparse Convolutional Networks

Lip Reading
Cross Audio-Visual Recognition using 3D Architectures

Libraries

Catboost
CatBoost is an open-source gradient boosting on decision trees library with categorical features support out of the box for Python, R

Tensorforce
A TensorFlow library for applied reinforcement learning

Intel OAP
Optimized Analytics Package for Spark Platform

Apache Olingo
A Java library and extensions around the Open Data specification.

Finch
Many Machine Learning Models based on TensorFlow / PyTorch (Keep Updating)

Dense AI
A library for dense inference and training of Convolutional Neural Networks (CNNs) on Images for Segmentation and Detection in PyTorch

Tensorflow Scala
TensorFlow API for the Scala Programming Language

Framework

Apache Hawq (incubating)
A Hadoop native SQL query engine that combines the key technological advantages of MPP database with the scalability and convenience of Hadoop


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!

Weekly BigData & ML Roundup – July 13, 2017

Examples

Awesome Machine Learning On Source Code
Interesting links & research papers related to Machine Learning applied to source code

TensorFlow World Resources
The purpose of this project is to introduce a shortcut to developers and researcher for finding useful resources about TensorFlow.

Deep Learning with Cats
Deep learning with cats (^._.^)

Image Captioning
Image Captioning using InceptionV3 and beam search

Aerial Crack Detection with Caffe
Detect and recognize the aerial pavament crack using Caffe.

Toolsets

Jupyshare
JupyShare lets you release your notebook to the cloud and gives you a public endpoint for it through ngrok

Facets
Visualizations for machine learning datasets

Models

RON
Reverse Connection with Objectness Prior Networks for Object Detection, CVPR 2017

InferSent
Sentence embeddings (InferSent) and training code for NLI.

Scatteract
Project which implements extraction of data from scatter plots

Reco
Fast Weighted Alternating Least Squares for collaborative filtering and topic modeling

Libraries

Certigrad
A proof-of-concept for Bug-free machine learning on stochastic computation graphs

AMD MIOpen
AMD’s Machine Intelligence Library


1000+ tools, frameworks and libraries indexed at PocketCluster Index!
Looking into adding your repo? tweet to @stkim1!

E-mail Subscribtion
Subscribe for upcoming posts!
Join Slack
Join the channel!