-
Updated
Nov 4, 2021 - Python
#
hadoop
Here are 2,719 public repositories matching this topic...
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
python
aws
data-science
machine-learning
caffe
theano
big-data
spark
deep-learning
hadoop
tensorflow
numpy
scikit-learn
keras
pandas
kaggle
scipy
matplotlib
mapreduce
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
Updated
Feb 4, 2022 - Python
mysql
hive
hadoop
clickhouse
vuejs2
postgresql
oracle
mysql-server
ios-swift
springboot
db2
postgresql-database
sqlserver
tidb
fastjson
typescript2
-
Updated
Feb 6, 2022 - Java
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
mysql
python
linux
docker
redis
elasticsearch
spark
spring
hadoop
rabbitmq
solr
jvm
netty
springboot
mybatis
springcloud
-
Updated
Dec 30, 2021
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
python
java
data-science
machine-learning
multi-threading
opensource
r
big-data
spark
deep-learning
hadoop
random-forest
gpu
naive-bayes
h2o
distributed
pca
gbm
ensemble-learning
automl
-
Updated
Feb 9, 2022 - Jupyter Notebook
Alluxio, data orchestration for analytics and machine learning in the cloud
spark
presto
hadoop
tensorflow
data-analysis
alluxio
memory-speed
data-orchestration
virtual-distributed-filesystem
-
Updated
Feb 9, 2022 - Java
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
-
Updated
Jan 25, 2022 - HTML
ebyhr
commented
Jan 31, 2022
The current test TestCassandraConnectorTest
has some issues:
- Unrelated types in a table
- It reduces maintainability when we want to add test cases to a type
- The definition and data are managed in the different class
- We need to look for the different class when we want to check the expected values
- Less coverage in a type
- min, max, null
- DST, julian->gregorian
It woul
Apache Ignite
iot
cloud
sql
database
big-data
hadoop
cache
osgi
ignite
network-server
in-memory-database
data-management-platform
network-client
distributed-sql-database
in-memory-computing
-
Updated
Feb 9, 2022 - Java
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
-
Updated
Apr 1, 2019 - Java
zhanghaicheng1
commented
Dec 9, 2019
问题描述:关于工程中某个工作流节点单独执行不报错,保存工作流时提示:NotClassDefFoundError:Cloud not initialize class dispatch.Http$
问题原因:linkis-publish微服务关于netty-3.6.2.Final.jar升级包缺失
问题解决:将升级包上传重启linkis-publish微服务即可。
AI on Hadoop
-
Updated
Jul 22, 2021 - Java
emilyreff7
commented
Nov 3, 2021
See #3097 which introduces support for Pandas, Dask, and Pyspark.
Open
Refactor existing hashCode() to include class for methods using the Apache Commons Lang 3 style
3
GCHQDev404
commented
Nov 3, 2020
Append class
to all HashCodeBuilders
in Gaffer for the below issue to minimise hash collisions.
@Test
void name() {
Foo foo = new Foo();
Bar bar = new Bar();
assertFalse(foo.equals(bar));
assertNotEquals(foo.hashCode(), bar.hashCode()); //fails
}
class Bar {
int a = 3;
@Override
public int hashCode() {
Apache Hadoop docker image
-
Updated
Jan 2, 2022 - Shell
深圳地铁大数据客流分析系统🚇 🚄 🌟
mysql
docker
redis
elasticsearch
phoenix
kibana
scala
kafka
spark
mongodb
hive
hadoop
clickhouse
hbase
zookeeper
springboot
flink
kylin
cdh6
szt-bigdata
-
Updated
May 9, 2021 - Scala
High performance data store solution
-
Updated
Feb 3, 2022 - Scala
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
fuse
big-data
hadoop
storage
clustering
filesystem
scalability
distributed-computing
snapshot
posix
file-system
distributed-storage
high-availability
software-defined-storage
distributed-file-system
storage-tiering
moosefs
posix-compliant
petabyte
ditributed-systems
-
Updated
Dec 7, 2021 - C
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.