Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 6,411 public repositories matching this topic...
-
Updated
Nov 4, 2021 - Python
-
Updated
Mar 14, 2022 - Python
-
Updated
Mar 16, 2022 - Go
-
Updated
Mar 5, 2022
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.
-
Updated
Mar 12, 2022 - Python
-
Updated
Feb 26, 2022 - Java
-
Updated
Oct 31, 2019
-
Updated
Feb 9, 2022 - Java
-
Updated
Mar 17, 2022 - Jupyter Notebook
-
Updated
Feb 8, 2022 - Python
-
Updated
Mar 17, 2022 - Java
Refactor GCSLogStore to Java for the new delta-storage artifact.
This also includes moving the tests from GCSLogStoreSuite
to class PublicGCSLogStoreSuite
inside of LogStoreSuite
. See the contributing section in the project issue below.
Also, as the referenced contributing sectio
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
Mar 17, 2022 - Jupyter Notebook
-
Updated
Jan 20, 2022 - Python
-
Updated
May 26, 2019 - Scala
I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?
-
Updated
May 12, 2021 - Jupyter Notebook
-
Updated
Oct 19, 2021 - JavaScript
Used Spark version
Spark Version: 2.4.4
Used Spark Job Server version
SJS version: v0.11.1
Deployed mode
client on Spark Standalone
Actual (wrong) behavior
I can't get config, when post a job with 'sync=true'. I got it:
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"
Created by Matei Zaharia
Released May 26, 2014
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia
Describe the bug
Using a time dimension on a runningTotal measure on Snowflake mixes quoted and unquoted columns in the query. This fails the query, because Snowflake has specific rules about quoted columns. Specifically:
So "date_from" <> date_from
To Reproduce
Steps to reproduce