big-data

After add patch which fixes #4209 I found that sphinx emits some warnings.

+ /usr/bin/python3 setup.py build_sphinx -b man --build-dir build/sphinx
Unable to find pgen, not compiling formal grammar.
running build_sphinx
Running Sphinx v4.0.2
making output directory... done
loading intersphinx inventory from https://docs.python.org/3/objects.inv...
building [mo]: targets for 0 po

Problem: the approximate method can still be slow for many trees
catboost version: master
Operating System: ubuntu 18.04
CPU: i9
GPU: RTX2080

Would be good to be able to specify how many trees to use for shapley. The model.predict and prediction_type versions allow this. lgbm/xgb allow this.

Remove the initial shortcut sync code in viewer/db.js for v3.1. Note that to upgrade to v3.1+ you must upgrade to v3.0 first.

There is no technical difficulty to support includeValue option, looks like we are just missing it on the API level.

See SO question

In the past a conversion through Number "interface" has proven to be able to mask bugs in the code.
We should remove these conversions from io.trino.spi.type.TypeUtils#writeNativeValue.

... to make it easier to read Vespa documentation on an e-reader / offline

Vespa documentation is generated using Jekyll from .md and .html files, look into options for generating the artifact as part of site generation (there might be plugins we can use here)

Delta Lake 1.0.0
Spark 3.1.2
Scala 2.12
AdoptOpenJDK-11.0.11+9 (build 11.0.11+9)

The following code gives a NullPointerException. This is for a directory-based delta table that does not exist and uses a generated column.

import io.delta.tables.DeltaTable
DeltaTable.create
  .addColumn(
    DeltaTable.columnBuilder("value")
      .generatedAlwaysAs("true")
      .nullab

Use case:

1.) A user may want to backup all tables but no metadata like users, privileges, etc. without explicitly defining each table inside the CREATE SNAPSHOT statement.

2.) A user may want to transfer users & privileges, custom analyzers or user-defined-functions from one cluster to another without backing up a complete cluster including all data (tables).

*Feature description

big-data

Here are 2,533 public repositories matching this topic...

binhnguyennus / awesome-scalability

apache / spark

donnemartin / data-science-ipython-notebooks

ClickHouse / ClickHouse

apache / flink

amark / gun

apache / predictionio

prestodb / presto

yahoo / CMAK

heibaiying / BigData-Notes

andkret / Cookbook

apache / storm

cython / cython

catboost / catboost

h2oai / h2o-3

apache / zeppelin

pachyderm / pachyderm

apache / couchdb

apache / beam

arkime / arkime

tschellenbach / Stream-Framework

hazelcast / hazelcast

apache / ignite

apache / hive

intel-analytics / BigDL

trinodb / trino

vespa-engine / vespa

delta-io / delta

linkedin / datahub

crate / crate

Improve this page

Add this topic to your repo