#
big-data
Here are 2,130 public repositories matching this topic...
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
computer-science
lists
devops
distributed-systems
machine-learning
awesome
web-development
programming
big-data
system
backend
architecture
scalability
resources
design-patterns
interview
awesome-list
interview-practice
interview-questions
system-design
-
Updated
Jul 22, 2020
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
python
aws
data-science
machine-learning
caffe
theano
big-data
spark
deep-learning
hadoop
tensorflow
numpy
scikit-learn
keras
pandas
kaggle
scipy
matplotlib
mapreduce
-
Updated
Jul 24, 2020 - Python
python
nlp
data-science
machine-learning
natural-language-processing
big-data
ai
deep-learning
neural-network
cython
artificial-intelligence
spacy
neural-networks
nlp-library
-
Updated
Jul 29, 2020 - Python
PredictionIO, a machine learning server for developers and ML engineers.
-
Updated
May 7, 2020 - Scala
An open source cybersecurity protocol for syncing decentralized graph data.
iot
machine-learning
cryptography
crypto
encryption
database
big-data
graph
offline-first
protocol
end-to-end
peer-to-peer
dapp
decentralized
blockchain
realtime
p2p
artificial-intelligence
crdt
dweb
-
Updated
Jul 24, 2020 - JavaScript
ClickHouse is a free analytics DBMS for big data
-
Updated
Jul 29, 2020 - C++
CMAK is a tool for managing Apache Kafka clusters
-
Updated
Jul 12, 2020 - Scala
The most widely used Python to C compiler
-
Updated
Jul 29, 2020 - Python
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
python
data-science
machine-learning
data-mining
tutorial
r
big-data
gpu
cuda
kaggle
gbdt
gbm
gpu-computing
decision-trees
gradient-boosting
coreml
catboost
categorical-features
-
Updated
Jul 29, 2020 - C++
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBoost, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
python
java
data-science
machine-learning
multi-threading
opensource
r
big-data
spark
deep-learning
hadoop
random-forest
gpu
naive-bayes
h2o
distributed
pca
gbm
ensemble-learning
automl
-
Updated
Jul 29, 2020 - Jupyter Notebook
Apache CouchDB
javascript
couchdb
content
http
cloud
erlang
database
big-data
cplusplus
network-server
network-client
-
Updated
Jul 29, 2020 - Erlang
Reproducible Data Science at Scale!
go
docker
kubernetes
distributed-systems
data-science
big-data
analytics
containers
data-analysis
pachyderm
-
Updated
Jul 29, 2020 - Go
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
-
Updated
Jul 15, 2020 - Python
Moloch is an open source, large scale, full packet capturing, indexing, and database system.
-
Updated
Jul 28, 2020 - C
Open Source In-Memory Data Grid
-
Updated
Jul 29, 2020 - Java
BigDL: Distributed Deep Learning Framework for Apache Spark
-
Updated
Jul 27, 2020 - Scala
Apache Ignite
iot
cloud
sql
database
big-data
hadoop
cache
osgi
ignite
network-server
in-memory-database
data-management-platform
network-client
distributed-sql-database
in-memory-computing
-
Updated
Jul 29, 2020 - Java
Vespa is an engine for low-latency computation over large data sets.
java
search-engine
machine-learning
big-data
ai
server
cpp
tensorflow
vespa
serving
serving-recommendation
-
Updated
Jul 29, 2020 - Java
An easy to use, self-service open BI reporting and BI dashboard platform.
-
Updated
Jun 16, 2020 - TSQL
Bare bone examples of machine learning in TensorFlow
big-data
simple
tensorflow
linear-regression
distributed-computing
tensorflow-tutorials
tensorflow-exercises
tensorflow-examples
-
Updated
Mar 14, 2017 - Python
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."