Apache has announced Arrow as a top project! Check out this week’s roundup!
This repository includes standard models and examples to run the ImageNet dataset.
This example contains code for running DL4J on Spark standalone as well as normal Spark.
Arrow is a set of technologies that enable big-data systems to process and move data fast.
Airflow is a system to programmatically author, schedule and monitor data pipelines.
Apache Tika(TM) is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
Hierarchical Temporal Memory implementation in Java – an official Community-Driven Java port of the Numenta Platform for Intelligent Computing (NuPIC).
dl4j-spark-ml is a Spark Package for the deeplearning4j library.
Fast, Scientific Computing for the JVM (NDArrays)
A platform agnostic Evaluation tool for Machine Learning algorithms
An external module for deep learning4j nlp modules for language specific addons
Weka DL4J. Original code by Mark Hall
Big Bench Workload Development
Performance Analysis Tool
HiBench is a big data benchmark suite.
Subscribe for upcoming posts!
Join the channel!