#
spark
Here are 4,884 public repositories matching this topic...
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
python
aws
data-science
machine-learning
caffe
theano
big-data
spark
deep-learning
hadoop
tensorflow
numpy
scikit-learn
keras
pandas
kaggle
scipy
matplotlib
mapreduce
-
Updated
Jun 28, 2020 - Python
Learn and understand Docker technologies, with real DevOps practice!
-
Updated
Jul 14, 2020 - Go
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
visualization
javascript
mysql
python
bigquery
bi
spark
dashboard
athena
analytics
postgresql
business-intelligence
redash
redshift
databricks
spark-sql
-
Updated
Jul 17, 2020 - JavaScript
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
python
java
clojure
scala
spark
hadoop
gpu
intellij
linear-algebra
artificial-intelligence
deeplearning
neural-nets
dl4j
matrix-library
deeplearning4j
-
Updated
Jul 6, 2020 - Java
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
machine-learning
spark
deep-learning
uber
mxnet
tensorflow
mpi
keras
pytorch
machinelearning
baidu
deeplearning
-
Updated
Jul 18, 2020 - Python
nodejs
javascript
mysql
chart
spark
presto
hive
microservice
serverless
athena
analytics
postgresql
cube
-
Updated
Jul 18, 2020 - JavaScript
List of Data Science Cheatsheets to rule the world
-
Updated
Oct 31, 2019
flink learning blog. http://www.54tianzhisheng.cn 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
-
Updated
Jul 4, 2020 - Java
Open-source IoT Platform - Device management, data collection, processing and visualization.
visualization
platform
mqtt
iot
coap
middleware
kafka
akka
spark
dashboard
netty
websockets
grpc
widgets
iot-platform
smart-farm
fleet-tracking
thingsboard
iot-analytics
-
Updated
Jul 17, 2020 - Java
A Flexible and Powerful Parameter Server for large-scale machine learning
machine-learning
scala
spark
model
spark-streaming
online-learning
parameter-server
high-dimensional
-
Updated
Jul 18, 2020 - Java
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
nodejs
mysql
python
git
vim
macos
linux
bash
redis
cli
mac
aws
elasticsearch
cloud
spark
mongodb
iterm2
sublime-text
postgresql
android-development
-
Updated
Jun 20, 2020 - Python
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBoost, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
python
java
data-science
machine-learning
multi-threading
opensource
r
big-data
spark
deep-learning
hadoop
random-forest
gpu
naive-bayes
h2o
distributed
pca
gbm
ensemble-learning
automl
-
Updated
Jul 18, 2020 - Jupyter Notebook
Alluxio, data orchestration for analytics and machine learning in the cloud
spark
presto
hadoop
tensorflow
data-analysis
alluxio
memory-speed
data-orchestration
virtual-distributed-filesystem
-
Updated
Jul 19, 2020 - Java
PipelineAI Kubeflow Distribution
docker
kubernetes
redis
machine-learning
airflow
kafka
spark
cassandra
neural-network
tensorflow
gpu
scikit-learn
keras
pytorch
artificial-intelligence
kubeflow
tfx
pipelineai
-
Updated
Apr 24, 2020 - Jsonnet
BigDL: Distributed Deep Learning Framework for Apache Spark
-
Updated
Jul 18, 2020 - Scala
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
-
Updated
Jun 22, 2020 - Python
酷玩 Spark: Spark 源代码解析、Spark 类库等
-
Updated
May 26, 2019 - Scala
Interactive and Reactive Data Science using Scala and Spark.
-
Updated
Jun 2, 2020 - JavaScript
The Hunting ELK
docker
elasticsearch
kibana
logstash
spark
jupyter-notebook
elk
threat-hunting
dockerhub
elastic
hunting
elk-stack
hunting-platforms
-
Updated
Jul 12, 2020 - Jupyter Notebook
Microsoft Machine Learning for Apache Spark
microsoft
http
machine-learning
scala
ai
spark
deep-learning
cntk
azure
ml
pyspark
lightgbm
cognitive-services
databricks
model-deployment
microsoft-machine-learning
-
Updated
Jul 17, 2020 - Scala
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
machine-learning
scala
ai
spark
dsl
transformations
ml
transformers
estimators
sparkml
pipelines
salesforce
structured-data
feature-engineering
features
einstein
automl
automated-machine-learning
transmogrification
transmogrify
-
Updated
Jul 17, 2020 - Scala
Improve this page
Add a description, image, and links to the spark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the spark topic, visit your repo's landing page and select "manage topics."