Skip to content
#

dataframes

Here are 195 public repositories matching this topic...

jgirault-qs
jgirault-qs commented Jul 23, 2021

Describe the bug
pa.errors.SchemaErrors.failure_cases only returns the first 10 failure_cases

  • I have checked that this issue has not already been reported.
  • I have confirmed this bug exists on the latest version of pandera. 0.6.5
  • (optional) I have confirmed this bug exists on the master branch of pandera.

Note: Please read [this guide](https://matthewrocklin.c

bug help wanted good first issue
pdpipe
yarkhinephyo
yarkhinephyo commented Nov 28, 2021

For pipeline stages provided by the pdpipe.basic_stages, supplying conditions to the prec and post keyword arguments may not return the correct error messages.

Example Code

import pandas as pd; import pdpipe as pdp;
df = pd.DataFrame([[1,4],[4,5],[1,11]], [1,2,3], ['a','b'])
pline = pdp.PdPipeline([
  pdp.FreqDrop(2, 'a', prec=pdp.cond.HasAllColumns(['x']))
])
pline.apply(
eland
jack-pappas
jack-pappas commented Dec 7, 2021

riptable currently only supports changing settings (e.g. number of threads to use for calculations and I/O) by calling functions of the library or setting class-level attributes.

It'd be helpful if the default values for these settings -- at least the most important ones -- could be overridden using environment variables, e.g. how numba supports changing the cache path or number of threads to b

enhancement good first issue
woodwork
tamargrey
tamargrey commented Aug 30, 2021
  • As a user, I wish I could access a table's column schema with a column_schemas attribute that is a dictionary of column schemas.
df.ww.column_schemas

This could be useful for helping users understand that they can df.ww.column_schemas[col] instead of df.ww[col].schema better than the columns attribute does.

We should not remove the columns attribute so we don't

good first issue new feature
ocramz
ocramz commented Oct 7, 2020

Add a few useful date/time types from time (https://hackage.haskell.org/package/time) , e.g.

  • POSIXTime
  • Date
    etc.

A checklist for where to add things :

enhancement 🚀 help wanted good first issue

Improve this page

Add a description, image, and links to the dataframes topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataframes topic, visit your repo's landing page and select "manage topics."

Learn more