pandas

Currently, we use flake8-rst for running flake8 in code snippets in rst files:

https://github.com/pandas-dev/pandas/blob/2e56a838cf5ed3058df16c11e5ebae862520bab7/.pre-commit-config.yaml#L95-L102

However, flake8-rst isn't maintained, and is currently run in its own environment with a different flake8 version because of incompatibilities with flake8 v4

Task here is:

search around to see

Describe the bug

Streaming Datasets can't be pickled, so any interaction between them and multiprocessing results in a crash.

Steps to reproduce the bug

import transformers
from transformers import Trainer, AutoModelForCausalLM, TrainingArguments
import datasets

ds = datasets.load_dataset('oscar', "unshuffled_deduplicated_en", split='train', streaming=True).with_format("

We're trying to introduce Parquet into our team, and the largest blocker that we've seen is the dreaded "schemas are inconsistent" error message:

RuntimeError: Schemas are inconsistent, try using to_parquet(..., schema="infer"), or pass an explicit pyarrow schema. Such as to_parquet(..., schema={"column1": pa.string()})

This error message is super unhelpful: surely Dask knows what th

Describe the bug
series.unique() returns a cuDF.Series while it returns a numpy.ndarray for pandas.

Steps/Code to reproduce bug

In [1]: import cudf
In [2]: import pandas as pd

In [3]: type(pd.Series([1,1]).unique())
Out[3]: numpy.ndarray

In [4]: type(cudf.Series([1,1]).unique())
Out[4]: cudf.core.series.Series

Expected behavior
I would exp

Reading currencies, alphavantage returns a greeting note ("welcome") and this note raises an error in alphavantage.py line 363.

            elif "Note" in json_response and self.treat_info_as_error:
                raise ValueError(json_response["Note"])

For this reason, alphavantage does not work in home assistant.

I would like to convert a DataFrame to a JSON object the same way that Pandas does with to_dict().

toJSON() treats rows as elements in an array, and ignores the index labels. But to_dict() uses the index as keys.

Here is an example of what I have in mind:

function to_dict(df) {
    const rows = df.toJSON();
    const entries = df.index.map((e, i) => ({ [e]: rows[i] }));

pandas

Here are 16,193 public repositories matching this topic...

jakevdp / PythonDataScienceHandbook

pandas-dev / pandas

donnemartin / data-science-ipython-notebooks

tqdm / tqdm

huggingface / datasets

Describe the bug

Steps to reproduce the bug

waditu / tushare

microsoft / Data-Science-For-Beginners

dask / dask

mwaskom / seaborn

bbfamily / abu

ydataai / pandas-profiling

guipsamora / pandas_exercises

modin-project / modin

ranaroussi / yfinance

saulpw / visidata

codebasics / py

tangyudi / Ai-Learn

rapidsai / cudf

iamseancheney / python_for_data_analysis_2nd_chinese_version

BrambleXu / pydata-notebook

lux-org / lux

RomelTorres / alpha_vantage

biolab / orange3

man-group / dtale

javascriptdata / danfojs

pixie-io / pixie

ResidentMario / missingno

databricks / koalas

TarrySingh / Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

datawhalechina / joyful-pandas

Improve this page

Add this topic to your repo