Weekly Machine Learning Opensource Roundup – Sep. 27, 2018

Examples

Coursera Machine Learning MOOC by Andrew Ng
Python assignments for the machine learning class by andrew ng on coursera with complete submission for grading capability and re-written instructions.

Just Enough Scala for Spark
A tutorial on the most important features and idioms of Scala that you need to use Spark’s Scala APIs.

Lit2Vec
Representing Books as vectors using the Word2Vec algorithm

deep learning object detection
A paper list of object detection using deep learning

Toolsets

Transfer Learning Suite
Transfer Learning Suite in Keras. Perform transfer learning using any built-in Keras image classification model easily!

jiant
jiant sentence representation learning toolkit is an extensible platform meant to make it easy to run experiments that involve multitask and transfer learning across sentence-level NLP tasks.

Semantic Segmentation Suite
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!

Models

generative-models
Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

Optical Flow Prediction with TensorFlow
Implements “PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume,” by Deqing Sun et al. (CVPR 2018)

PyTorch Image Dehazing
PyTorch implementation of some single image dehazing networks.

Libraries

mia
A library for running membership inference attacks (MIA) against machine learning models

apricot
apricot implements submodular selection for the purpose of selecting subsets of massive data sets to train machine learning models quickly.

 

Weekly Machine Learning Opensource Roundup – Sep. 20, 2018

Examples

The Machine Learning cheatsheet
A 5-pages only Machine Learning cheatsheet focusing on the most popular algorithms under the hood

Research2Vec
Upcoming project for representing research papers as vectors / latent representations. More coming soon

Toolsets

PBO
Probability of Backtest Overfitting

Merging Models for TensorFlow Serving
This tool can merge TensorFlow frozen models(.pb file) into the same model

LogDevice
Distributed storage for sequential data

Models

question-answering
This repository contains curated PyTorch implementations of several question answering systems evaluated on SQuAD

Multi-View Network in Keras
Keras implementation of Guo et al.’s Multi-View Network (2017).

machine-translation
Sequence to sequence models for machine translation in PyTorch

neural-processes
PyTorch implementation of Neural Processes

NeuralProcesses
Neural Processes implementation for 1D regression

fastTSNE
A visualization of 160,796 single cell trasncriptomes from the mouse nervous system

Policy Gradient (PG) Algorithms
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Libraries

ONNX Scala
Numerically generic typeful Scala ONNX API

TonY
TensorFlow on YARN (TonY) is a framework to natively run TensorFlow on Apache Hadoop.

Deequ – Unit Tests for Data
Deequ is a library built on top of Apache Spark for defining “unit tests for data”, which measure data quality in large datasets.

 

Weekly Machine Learning Opensource Roundup – Sep. 13, 2018

Examples

GAN Decks
AI Generated Skateboard Decks

Graffiti Net
A way to document all unique street art in the world and create a genealogy of all publicly visual media using machine learning.

FAST
End-to-end earthquake detection pipeline via efficient time series similarity search

Paper with Code
Papers with code. Sorted by stars. Updated weekly.

Seminars DeepBayes Summer School 2018
Discussion materials regarding how Bayesian Methods can be combined with Deep Learning and lead to better results in machine learning applications at Deep|Bayes summer school Moscow.

US Building Footprints
Computer generated building footprints for the United States

Donkey RL
Train Donkey Car in Unity Simulator with Reinforcement Learning

Toolset

jupytext
Jupyter notebooks as Markdown documents, Julia, Python or R scripts

Models

Adaptive Feeding
Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors

Keras-MMoE
Keras implementation of Multi-gate Mixture-of-Experts

ProSR
A Fully Progressive Approach to Single-Image Super-Resolution

DecoupleLearning
Implementation codes of ECCV 2018 paper “Decouple Learning for Parameterized Image Operators”

Libraries

Fluent
A fully managed, data-first computation framework under development of U.C. Berkeley RISE Lab.

go-tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go

NLP.js
An NLP library built in node over Natural, with entity extraction, sentiment analysis, automatic language identify, and so more

 

Weekly Machine Learning Opensource Roundup – Sep. 6, 2018

Examples

connect4
Solving board games like Connect4 using Reinforcement Learning Algorithms

60 Days RL Challenge
Learn Deep Reinforcement Learning in depth in 60 days

Toolsets

deepzoo
The goal of this repo is to provide a place where trained models can be shared.

MagNet
MagNet, wrapped around PyTorch, is developed with the aim of reducing boilerplate code and writing Deep Learning architectures with more grace.

lazydata
A minimalist, scalable library for including data dependencies into Python projects

Subgraphs
A Deep Learning IDE

GibsonEnv
Gibson provides reflects real-world semantic complexity through virtualizing real spaces

Mantra
A high-level, rapid development framework for machine learning projects

AgentMaps
Make social simulations on interactive maps with Javascript! Agent-based modeling for the web.

GAN Lab
An Interactive, Visual Experimentation Tool for Generative Adversarial Networks

Models

ESRGAN
Enhanced Super-Resolution Generative Adversarial Networks

Temperature Scaling
A simple way to calibrate your neural network.

keras-seq-2-seq-signal-prediction
An implementation of a sequence to sequence neural network using an encoder-decoder (Predicting Time Series with Neural Networks)

Library

Tefla
Tefla is built on top of Tensorflow for fast prototyping of deep learning algorithms

Weekly Machine Learning Opensource Roundup – Aug. 30, 2018

Examples

Statistical Modeling Examples
Basic statistical modelling & machine learning examples.

faced
Near Real Time CPU Face detection using deep learning

Toolsets

Dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Papermill
Parameterize, execute, and analyze notebooks

iodide
A frictionlessly portable interface for literate scientific computing in the browser

Databot
High Performance Python Data driven programming framework for Web Crawler, ETL, Data pipeline work

doccano
Open source text annotation tool for machine learning practitioner.

Models

Simple Baselines for Human Pose Estimation and Tracking
The project is an official implement of Microsoft ECCV2018 paper “Simple Baselines for Human Pose Estimation and Tracking”

ScalphaGoZero
An independent implementation of DeepMind’s AlphaGoZero in Scala, using Deeplearning4J (DL4J)

DRIT-Tensorflow
Simple Tensorflow implementation of Diverse Image-to-Image Translation via Disentangled Representations (ECCV 2018 Oral)

Noise2Noise
An unofficial and partial Keras implementation of “Noise2Noise: Learning Image Restoration without Clean Data”

Libraries

flair
A very simple framework for state-of-the-art NLP

HyperLearn
50%+ Faster, 50%+ less RAM usage, GPU support re-written Sklearn, Statsmodels combo with new novel algorithms.

Hyperparameter Hunter
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries

Weekly Machine Learning Opensource Roundup – Aug. 23, 2018

Examples

Food Recipe CNN
Deep Learning food image recognition system for cooking recipe retrieval

Retrain MobileNet for the web
Retrain a MobileNet V1 or V2 model and use it in the browser with TensorFlow.js

ENV400 Assignment
Assignment for ENV-400 (Air Pollution and Climate Change) Masters course at EPFL

Deep Learning Coursera
Projects from the Deep Learning Specialization from deeplearning.ai provided by Coursera

Models

vid2vid
Pytorch implementation of our method for high-resolution (e.g. 2048×1024) photorealistic video-to-video translation.

PolygonRNN++
PyTorch training/tool code for Polygon-RNN++ (CVPR 2018)

relational-rnn-pytorch
An implementation of DeepMind’s Relational Recurrent Neural Networks in PyTorch.

MSG-GAN
Multi-Scale Gradients GAN (Architecture inspired from ProGAN but doesn’t use layer-wise pretraining)

Libraries

DoWhy
A Python library based on a unified language for causal inference that makes it easy to estimate causal effects. DoWhy combines causal graphical models and potential outcomes frameworks.

TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Spark with minimal hand tuning

LocustDB
Massively parallel, high performance analytics database that will rapidly devour all of your data

Petastorm
A library enabling the use of Parquet storage from Tensorflow, Pytorch and other Python based ML training frameworks

libartificial
A small pure C shared library with ML/nonparametrix algorithms for researchers and developers

Weekly Machine Learning Opensource Roundup – Aug. 16, 2018

Examples

Deep Learning World
Organized Resources for Deep Learning Researchers and Developers

DanceNet
Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)

Nowcasting
Nowcasting implements the framework described in “Macroeconomic Nowcasting and Forecasting with Big Data” sponsored by Federal Reserve Bank of New York

Toolsets

Artificial Adversary
Tool to generate adversarial text examples and test machine learning models against them

Evolute
A simple tool for quick experimentation with evolutionary algorithms for numerical optimization.

GraphPipe
A protocol and collection of software designed to simplify machine learning model deployment and decouple it from framework-specific model implementations.

Models

UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation

RelativisticGAN
Code for replication of the paper “The relativistic discriminator: a key element missing from standard GAN”

DeepSpectralClustering
Pytorch Implemention of paper “Deep Spectral Clustering Learning”

Libraries

tf-nlp-blocks
Some frequently used NLP blocks

Sparser
Raw Filtering for Faster Analytics over Raw Data

Quantiles
Optimal Quantile Approximation in Streams