Weekly Machine Learning Opensource Roundup – Oct. 25, 2018

Examples

Knet-the-Julia-dope
This interactive book on deep learning is the Julia translation of the mxnet-the-straight-dope. This project grew out of the MIT course 6.338 Modern Numerical Computing with Julia taught by professor Alan Edelman.

aleph_star
Reinforcement learning with A* and a deep heuristic

Toolsets

Hail
An open-source, general-purpose, Python-based scalable data analysis tool with additional data types and methods for working with genomic data

SKLL
SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

Atari
Persistent advantage learning dueling double DQN for the Arcade Learning Environment

Faster R-CNN and Mask R-CNN
This project aims at providing the necessary building blocks for easily creating detection and segmentation models using PyTorch 1.0.

Models

Phrase2vec
An extension of word2vec to learn phrase embeddings

Latent Dirichlet allocation
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.

Pose-Guided-Image-Generation
Implementation of NIPS 2017 paper “Pose Guided Person Image Generation” in PyTorch

Libraries

Graph Nets library
Build Graph Nets in Tensorflow

modAL
An active learning framework for Python3, designed with modularity, flexibility and extensibility in mind.

rl3stdlib
A collection of modules accessible to a RL3 program to create a community-driven library of general NLP, unstructured and semi-structured text patterns that can empower personal, research and educational projects

GeneticAlgorithmsRepo
An assortment of genetic algorithms – all written from scratch, for Python 3.5.

Weekly Machine Learning Opensource Roundup – Oct. 18, 2018

Examples

Deep Play-by-Play
Labelling NBA action using deep learning

PhotoPrism
A server-based application for browsing, organizing and sharing your personal photo collection, powered by Go and Google TensorFlow.

Toolsets

Tensorflow-input-pipeline
A simpler way of reading data into TensorFlow

Camelot
PDF Table Extraction for Humans

Models

DME
Dynamic Meta-Embeddings for Improved Sentence Representations

HDLTex
Hierarchical Deep Learning for Text Classification

PyTorch-NEAT
PyTorch NEAT builds upon NEAT-Python by providing some functions which can turn a NEAT-Python genome into either a recurrent PyTorch network or a PyTorch CPPN for use in HyperNEAT or Adaptive HyperNEAT.

image-captioning
This repository contains PyTorch implementations of “Show and Tell: A Neural Image Caption Generator” and “Show, Attend and Tell: Neural Image Caption Generation with Visual Attention”.

RMDL
Random Multi-model Deep Learning for Classification

DeepQ-Decoding
Decoders for fault tolerant quantum computation via deepQ reinforcement learning

DeepMimic
Motion imitation with deep reinforcement learning. The framework uses reinforcement learning to train a simulated humanoid to imitate a variety of motion skills from mocap data.

3DDFA
The pytorch improved re-implementation of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.

Libraries

TRFL
A library built on top of TensorFlow that exposes several useful building blocks for implementing Reinforcement Learning agents.

kafkabridge
The Apache Kafka Client SDK

Scalar DB
A library that provides an storage abstraction and client-coordinated distributed transaction on top of Cassandra

 

Weekly Machine Learning Opensource Roundup – Oct. 11, 2018

Examples

pandas-tutorial
Beginner guide to data wrangling with Pandas and advance concepts on scaling pandas for large datasets

stocksight
Stock analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis

Toolsets

Talos
Hyperparameter Optimization for Keras Models

ArviZ
Python package to plot and analyse samples from probabilistic models

T4
Dropbox for data science, built on S3

Holodeck
High Fidelity Simulator for Reinforcement Learning and Robotics Research.

Models

Vel
Bring velocity to deep-learning research, by providing tried and tested large pool of prebuilt components that are known to be working well together.

CNNVocoder
A fast cnn-based vocoder. This work is inspired from m-cnn model described in “Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks”.

Progressive InfoGAN
Progressive training of GANs with Mutual Information Penalty

Libraries

Infer.NET
Infer.NET is a framework for running Bayesian inference in graphical models

garage
A framework for reproducible reinforcement learning research

PyCM
A multi-class confusion matrix library written in Python that supports both input data vectors and direct matrix, and a proper tool for post-classification model evaluation that supports most classes and overall statistics parameters.

Weekly Machine Learning Opensource Roundup – Oct. 4, 2018

Examples

TensorFlow Course
Simple and ready-to-use tutorials for TensorFlow

Java Machine Learning
Simple machine learning library for Java, with fully connected, convolutional, and recurrent layers. The library is mainly for educational purposes, and it is way too slow to be used on actual projects.

Awesome Human Pose Estimation
A collection of awesome resources in Human Pose estimation.

WordGenerator
This is the code for a blog post on Generating Words from Embeddings. It uses a character level decoder RNN to convert a word embedding (which represents a meaning) into a word by sampling one character at a time.

Toolsets

blaze
A blazing fast exporter for your Elasticsearch data.

HBaseToHive
This project is for Transferring data from Hbase table to different targets like HDFS file/ Hive table / another Hbase Table.

Model

deeplabv3
PyTorch implementation of “Rethinking Atrous Convolution for Semantic Image Segmentation” (DeepLabV3), trained on the Cityscapes dataset.

Libraries

Stellar Graph
A Python library for machine learning on graph-structured (or equivalently, network-structured) data.

keras-loves-torchtext
Make Torchtext work with Keras.

Priority Kafka Client
Kafka Client that allows records to produce to and consume from Kafka on configured priority levels

Weekly Machine Learning Opensource Roundup – Sep. 27, 2018

Examples

Coursera Machine Learning MOOC by Andrew Ng
Python assignments for the machine learning class by andrew ng on coursera with complete submission for grading capability and re-written instructions.

Just Enough Scala for Spark
A tutorial on the most important features and idioms of Scala that you need to use Spark’s Scala APIs.

Lit2Vec
Representing Books as vectors using the Word2Vec algorithm

deep learning object detection
A paper list of object detection using deep learning

Toolsets

Transfer Learning Suite
Transfer Learning Suite in Keras. Perform transfer learning using any built-in Keras image classification model easily!

jiant
jiant sentence representation learning toolkit is an extensible platform meant to make it easy to run experiments that involve multitask and transfer learning across sentence-level NLP tasks.

Semantic Segmentation Suite
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!

Models

generative-models
Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

Optical Flow Prediction with TensorFlow
Implements “PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume,” by Deqing Sun et al. (CVPR 2018)

PyTorch Image Dehazing
PyTorch implementation of some single image dehazing networks.

Libraries

mia
A library for running membership inference attacks (MIA) against machine learning models

apricot
apricot implements submodular selection for the purpose of selecting subsets of massive data sets to train machine learning models quickly.

 

Weekly Machine Learning Opensource Roundup – Sep. 20, 2018

Examples

The Machine Learning cheatsheet
A 5-pages only Machine Learning cheatsheet focusing on the most popular algorithms under the hood

Research2Vec
Upcoming project for representing research papers as vectors / latent representations. More coming soon

Toolsets

PBO
Probability of Backtest Overfitting

Merging Models for TensorFlow Serving
This tool can merge TensorFlow frozen models(.pb file) into the same model

LogDevice
Distributed storage for sequential data

Models

question-answering
This repository contains curated PyTorch implementations of several question answering systems evaluated on SQuAD

Multi-View Network in Keras
Keras implementation of Guo et al.’s Multi-View Network (2017).

machine-translation
Sequence to sequence models for machine translation in PyTorch

neural-processes
PyTorch implementation of Neural Processes

NeuralProcesses
Neural Processes implementation for 1D regression

fastTSNE
A visualization of 160,796 single cell trasncriptomes from the mouse nervous system

Policy Gradient (PG) Algorithms
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Libraries

ONNX Scala
Numerically generic typeful Scala ONNX API

TonY
TensorFlow on YARN (TonY) is a framework to natively run TensorFlow on Apache Hadoop.

Deequ – Unit Tests for Data
Deequ is a library built on top of Apache Spark for defining “unit tests for data”, which measure data quality in large datasets.

 

Weekly Machine Learning Opensource Roundup – Sep. 13, 2018

Examples

GAN Decks
AI Generated Skateboard Decks

Graffiti Net
A way to document all unique street art in the world and create a genealogy of all publicly visual media using machine learning.

FAST
End-to-end earthquake detection pipeline via efficient time series similarity search

Paper with Code
Papers with code. Sorted by stars. Updated weekly.

Seminars DeepBayes Summer School 2018
Discussion materials regarding how Bayesian Methods can be combined with Deep Learning and lead to better results in machine learning applications at Deep|Bayes summer school Moscow.

US Building Footprints
Computer generated building footprints for the United States

Donkey RL
Train Donkey Car in Unity Simulator with Reinforcement Learning

Toolset

jupytext
Jupyter notebooks as Markdown documents, Julia, Python or R scripts

Models

Adaptive Feeding
Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors

Keras-MMoE
Keras implementation of Multi-gate Mixture-of-Experts

ProSR
A Fully Progressive Approach to Single-Image Super-Resolution

DecoupleLearning
Implementation codes of ECCV 2018 paper “Decouple Learning for Parameterized Image Operators”

Libraries

Fluent
A fully managed, data-first computation framework under development of U.C. Berkeley RISE Lab.

go-tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go

NLP.js
An NLP library built in node over Natural, with entity extraction, sentiment analysis, automatic language identify, and so more