hdfs

Problem description

I am getting the following error when reading a file from an S3 bucket:

Invalid bucket name "xxxx:yyyy@bucket": Bucket name must match the regex "^[a-zA-Z0-9.\-_]{1,255}$" or be an ARN matching the regex "^arn:(aws).*:s3:[a-z\-0-9]+:[0-9]{12}:accesspoint[/:][a-zA-Z0-9\-]{1,63}$|^arn:(aws).*:s3-outposts:[a-z\-0-9]+:[0-9]{12}:outpost[/:][a-zA-Z0-9\-]{1,63}[/:]acce

See #3097 which introduces support for Pandas, Dask, and Pyspark.

Similar to how unix ls works, param could be -t

Given the new key-value store event stream, it'd be nice to have something like:

$ skein kv events <application id> [options...]

where the process blocks, and logs the event stream to the console until interrupted. This would be useful for debugging, as well as demos.

hdfs

Here are 729 public repositories matching this topic...

chrislusf / seaweedfs

heibaiying / BigData-Notes

wangzhiwubigdata / God-Of-BigData

juicedata / juicefs

RaRe-Technologies / smart_open

Problem description

ibis-project / ibis

TileDB-Inc / TileDB

CheckChe0803 / BigData-Interview

colinmarc / hdfs

spotify / snakebite

sunnyandgood / BigData

HariSekhon / DevOps-Python-tools

Stratio / sparta

collabH / repository

lensesio / kafka-connect-ui

confluentinc / kafka-connect-hdfs

uber / storagetapper

divolte / divolte-collector

mtth / hdfs

fabiogjardim / bigdata_docker

tirthajyoti / Spark-with-Python

Eugene-Mark / bigdata-file-viewer

wradlib / wradlib

RumbleDB / rumble

mesosphere / dcos-commons

PaddlePaddle / ElasticCTR

mullerhai / HsunTzu

avast / hdfs-shell

TileDB-Inc / TileDB-Py

jcrist / skein

Improve this page

Add this topic to your repo