Skip to content
#

data-processing

Here are 628 public repositories matching this topic...

AbhinavTuli
AbhinavTuli commented Mar 22, 2022

🚨🚨 Feature Request

  • A new implementation (Improvement, Extension)

Is your feature request related to a problem?

Currently, if a user tries to access an index that is larger than the dataset length or tensor length, an internal error is thrown which is not easy to understand.

Description of the possible solution

We can catch the error and throw a more descriptive e

enhancement good first issue
jgirault-qs
jgirault-qs commented Jul 23, 2021

Describe the bug
pa.errors.SchemaErrors.failure_cases only returns the first 10 failure_cases

  • I have checked that this issue has not already been reported.
  • I have confirmed this bug exists on the latest version of pandera. 0.6.5
  • (optional) I have confirmed this bug exists on the master branch of pandera.

Note: Please read [this guide](https://matthewrocklin.c

bug help wanted good first issue
pysparkling
svaningelgem
svaningelgem commented Jan 27, 2021

The exception in subject is thrown by the following code:

from datetime import date
from pysparkling.sql.session import SparkSession
from pysparkling.sql.functions import collect_set

spark = SparkSession.Builder().getOrCreate()

dataset_usage = [
    ('steven', 'UUID1', date(2019, 7, 22)),
]
dataset_usage_schema = 'id: string, datauid: string, access_date: date'

df = spa

Improve this page

Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."

Learn more