Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spar…
#
spark
Repositories 3,419
Learn and understand Docker technologies, with real DevOps practice!
Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
汇总java生态圈常用技术框架、开源中间件,系统架构、项目管理、经典架构案例、数据库、常用三方库、线上运维等知识
Updated Mar 9, 2019
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools…
A Flexible and Powerful Parameter Server for large-scale machine learning
Alluxio, formerly Tachyon, Unify Data at Memory Speed
alluxio
distributed-storage
big-data
memory-speed
hadoop
spark
virtual-file-system
presto
tensorflow
Java
Updated Mar 22, 2019
Open Source Fast Scalable Machine Learning Platform For Smarter Applications (Deep Learning, Gradient Boosting, Rando…
h2o
machine-learning
data-science
deep-learning
big-data
ensemble-learning
gbm
random-forest
naive-bayes
pca
opensource
distributed
multi-threading
java
python
r
hadoop
spark
gpu
automatic
Java
Updated Mar 22, 2019
List of Data Science Cheatsheets to rule the world
Updated Mar 15, 2019
PipelineAI: Real-Time Enterprise AI Platform
machine-learning
artificial-intelligence
tensorflow
kubernetes
elasticsearch
cassandra
spark
kafka
netflixoss
presto
airflow
pipeline
docker
redis
neural-network
gpu
microservices
nifi
scikit
prediction
Java
Updated Mar 19, 2019
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Python
Updated Mar 13, 2019
BigDL: Distributed Deep Learning Library for Apache Spark
Scala
Updated Mar 22, 2019
Open-source IoT Platform - Device management, data collection, processing and visualization.
Interactive and Reactive Data Science using Scala and Spark.
Python clone of Spark, a MapReduce alike framework in Python
Python
Updated Jan 23, 2019
REST job server for Apache Spark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Scala
Updated Feb 6, 2019
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Pyt…
machine-learning
data-science
r
python
gradient-boosting-machine
random-forest
deep-learning
xgboost
h2o
spark
R
Updated Sep 15, 2018
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
java
gpu
scientific
nd4j
jvm
dl4j
backend
scala-notebook
spark
artificial-intelligence
scientific-computing
numerical-calculations
Java
Updated Jun 16, 2018
DataStax Spark Cassandra Connector
Scala
Updated Mar 15, 2019
A large-scale entity and relation database supporting aggregation of properties
Compile-time Language Integrated Queries for Scala
Scala
Updated Mar 22, 2019
spark ml 算法原理剖析以及具体的源码实现分析
Updated Feb 12, 2018
Machine Learning Platform and Recommendation Engine built on Kubernetes
machine-learning
deep-learning
deployment
kubernetes
docker
microservices
spark
kafka
kafka-streams
tensorflow
python
java
cloud
aws
gcp
azure
seldon
recommender-system
recommendation-engine
prediction
Java
Updated Jul 28, 2018
Microsoft Machine Learning for Apache Spark
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machin…
The Hunting ELK
A better compressed bitset in Java
Distributed Deep learning with Keras & Spark
Python
Updated Mar 20, 2019