-
Updated
Apr 14, 2022 - Jupyter Notebook
pandas
Here are 16,193 public repositories matching this topic...
-
Updated
Apr 3, 2022 - Python
Describe the bug
Streaming Datasets can't be pickled, so any interaction between them and multiprocessing results in a crash.
Steps to reproduce the bug
import transformers
from transformers import Trainer, AutoModelForCausalLM, TrainingArguments
import datasets
ds = datasets.load_dataset('oscar', "unshuffled_deduplicated_en", split='train', streaming=True).with_format("
-
Updated
Mar 28, 2022 - Python
- Base README.md
- Quizzes
- Introduction base README
- Defining Data Science README
- Defining Data Science assignment
- Ethics README
- Ethics assignment
- Defining Data README
- Defining Data assignment
- Stats and Probability README
- Stats and Probability assignment
- Working with Data base README
- Rel
We're trying to introduce Parquet into our team, and the largest blocker that we've seen is the dreaded "schemas are inconsistent" error message:
RuntimeError: Schemas are inconsistent, try using
to_parquet(..., schema="infer")
, or pass an explicit pyarrow schema. Such asto_parquet(..., schema={"column1": pa.string()})
This error message is super unhelpful: surely Dask knows what th
-
Updated
May 10, 2022 - Python
-
Updated
Dec 23, 2020 - Python
-
Updated
May 9, 2022 - Python
-
Updated
May 3, 2022 - Jupyter Notebook
-
Updated
May 9, 2022 - Python
-
Updated
May 9, 2022 - Python
-
Updated
May 9, 2022 - Python
-
Updated
Apr 16, 2022 - Jupyter Notebook
-
Updated
Feb 6, 2020
Describe the bug
series.unique()
returns a cuDF.Series
while it returns a numpy.ndarray
for pandas.
Steps/Code to reproduce bug
In [1]: import cudf
In [2]: import pandas as pd
In [3]: type(pd.Series([1,1]).unique())
Out[3]: numpy.ndarray
In [4]: type(cudf.Series([1,1]).unique())
Out[4]: cudf.core.series.Series
Expected behavior
I would exp
-
Updated
May 8, 2018 - Jupyter Notebook
-
Updated
May 2, 2022 - Python
Reading currencies, alphavantage returns a greeting note ("welcome") and this note raises an error in alphavantage.py line 363.
elif "Note" in json_response and self.treat_info_as_error:
raise ValueError(json_response["Note"])
For this reason, alphavantage does not work in home assistant.
-
Updated
May 9, 2022 - Python
-
Updated
May 8, 2022 - TypeScript
to_dict() equivalent
I would like to convert a DataFrame to a JSON object the same way that Pandas does with to_dict()
.
toJSON()
treats rows as elements in an array, and ignores the index labels. But to_dict()
uses the index as keys.
Here is an example of what I have in mind:
function to_dict(df) {
const rows = df.toJSON();
const entries = df.index.map((e, i) => ({ [e]: rows[i] }));
-
Updated
May 10, 2022 - C++
-
Updated
Feb 27, 2022 - Python
-
Updated
Apr 6, 2022 - Python
Improve this page
Add a description, image, and links to the pandas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pandas topic, visit your repo's landing page and select "manage topics."
Currently, we use flake8-rst for running flake8 in code snippets in rst files:
https://github.com/pandas-dev/pandas/blob/2e56a838cf5ed3058df16c11e5ebae862520bab7/.pre-commit-config.yaml#L95-L102
However, flake8-rst isn't maintained, and is currently run in its own environment with a different flake8 version because of incompatibilities with flake8 v4
Task here is: