big-data

Is your feature request related to a problem? Please describe.
Many static type checkers have issues finding Cython's stubs.
Here is from running mypy on my current project:

error: Skipping analyzing "cython": found module but no type hints or library stubs

The same issue can be seen when using import Cython as cython:

error: Skipping analyzing "Cython": found module but

It would be great to have FBeta, F2, or F0.5 metrics to be implemented without the need for a custom metric class defined by user.

catboost version: 0.26

Remove the initial shortcut sync code in viewer/db.js for v3.1. Note that to upgrade to v3.1+ you must upgrade to v3.0 first.

There is no technical difficulty to support includeValue option, looks like we are just missing it on the API level.

See SO question

With Hive connector

trino:default> CREATE TABLE one (a varchar);
            -> CREATE VIEW two AS SELECT * FROM one;
CREATE TABLE
CREATE VIEW

DROP TABLE is rejected on a view:

trino:default> DROP TABLE two;
Query 20210906_150832_00015_id3y3 failed: line 1:1: Table 'hive.default.two' does not exist, but a view with that name exists. Did you mean DROP VIEW hive.default.t

Could we clarify that delta-log files are JSON line-delimited files in https://github.com/delta-io/delta/blob/master/PROTOCOL.md#delta-log-entries ?

In the PROTOCOL.md file it is not clear what is the format of JSON. Every delta-log entry file is "new-line delimited json file", but this is not specified in this file. Protocol do not explicitly specify that every action is stored as a single-lin

... to make it easier to read Vespa documentation on an e-reader / offline

Vespa documentation is generated using Jekyll from .md and .html files, look into options for generating the artifact as part of site generation (there might be plugins we can use here)

Use case:

1.) A user may want to backup all tables but no metadata like users, privileges, etc. without explicitly defining each table inside the CREATE SNAPSHOT statement.

2.) A user may want to transfer users & privileges, custom analyzers or user-defined-functions from one cluster to another without backing up a complete cluster including all data (tables).

*Feature description

big-data

Here are 2,614 public repositories matching this topic...

binhnguyennus / awesome-scalability

apache / spark

donnemartin / data-science-ipython-notebooks

ClickHouse / ClickHouse

apache / flink

amark / gun

apache / predictionio

prestodb / presto

heibaiying / BigData-Notes

yahoo / CMAK

andkret / Cookbook

cython / cython

apache / storm

catboost / catboost

h2oai / h2o-3

apache / zeppelin

pachyderm / pachyderm

apache / couchdb

apache / beam

arkime / arkime

tschellenbach / Stream-Framework

hazelcast / hazelcast

trinodb / trino

apache / ignite

apache / hive

intel-analytics / BigDL

delta-io / delta

vespa-engine / vespa

linkedin / datahub

crate / crate

Improve this page

Add this topic to your repo