Skip to content
#

hdfs

Here are 676 public repositories matching this topic...

seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, local tiering, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, Hadoop, WebDAV, encryption, Erasure Coding.

  • Updated Aug 28, 2021
  • Go
ppwwyyxx
ppwwyyxx commented Sep 17, 2020

Problem description

Be able to read public GCS files without providing credentials.

Steps/code to reproduce the problem

path = "gs://tensorflow-nightly/prod/tensorflow/release/ubuntu_16/gpu_py37_full/nightly_release/18/20190813-010608/github/tensorflow/pip_pkg/tf_nightly_gpu-1.15.0.dev20190813-cp37-cp37m-linux_x86_64.whl"

import smart_open
try:
    f = smart_open.s
datapythonista
datapythonista commented Aug 23, 2021

In #2909 the method .set_database() has been standardized, but this does not include making it always available in all backends. There are several backends, such as SQLAlchemy backends, File backends or clickhouse, that don't implement it, and don't let changing the backend.

The reason in most cases seems to be that the underlying connection object doesn't support changing the database. But i

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

  • Updated Jul 23, 2021
  • Python

ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部署,具备高性能、工业级部署、端到端体验的特点,并且作为开源套件,满足二次深度开发的需求。

  • Updated Jul 11, 2020
  • Python

Improve this page

Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."

Learn more