Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 25,559 public repositories matching this topic...

lesteve
lesteve commented Feb 23, 2022

See in #22547

MatplotlibDeprecationWarning: Axes3D(fig) adding itself to the figure is deprecated since 3.4. Pass the keyword argument auto_add_to_figure=False and use fig.add_axes(ax) to suppress this warning. The default value of auto_add_to_figure will change to False in mpl3.5 and True values will no longer work in 3.6.  This is consistent with other Axes classes

We need to rep

Documentation good first issue help wanted
superset
rumbin
rumbin commented Jan 31, 2022

The Mixed Time-Series chart type allows for configuring the title of the primary and the secondary y-axis.
However, while only the title of the primary axis is shown next to the axis, the title of the secondary one is placed at the upper end of the axis where it gets hidden by bar values and zoom controls.

How to reproduce the bug

  1. Create a mixed time-series chart
  2. Configure axi
good first issue #bug validation:validated preset:cares

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Nov 4, 2021
  • Python
architkulkarni
architkulkarni commented Mar 24, 2022

Here's the reproduction:

import os
import tempfile
from pathlib import Path
from ray._private.runtime_env.packaging import _zip_directory
from zipfile import ZipFile

with tempfile.TemporaryDirectory() as tmp_dir:
    # Prepare test directory
    path = Path(tmp_dir)
    subdir = path / "subdir"
    subdir.mkdir(parents=True)
    file1 = subdir / "file1.txt"
    with file
bug good first issue P2
pytorch-lightning
niberger
niberger commented Mar 24, 2022

🐛 Bug

When using the cross validation loop from the example pl_examples/loop_examples/kfold.py with ddp_spawn strategy, it encounter a SIGABRT Exception and the program crash.

To Reproduce

import os
from pytorch_lightning import seed_everything, Trainer

from pl_examples.loop_examples.kfold import KFoldLoop, LitImageClassifier, MNISTKFoldDataModule


def run():
good first issue example
dash
tirkarthi
tirkarthi commented Jan 12, 2022

Python 3.10 added suggestions for AttributeError and NameError in the error messages. It seems the suggestions are not stored in the exception object but calculated when Error is displayed. There is a note that that this won't work with IPython but it will be good to see if it's feasible. Opening an issue for discussion.

https://bugs.python.org/issue38530
https://docs.python.org/3/whatsnew/3.

vpicouet
vpicouet commented Mar 23, 2022

Bug summary

I think xy[4] = .25, val[0] should be commented in /matplotlib/widgets. py", line 915, in set_val
as it prevents to initialized value for RangeSlider

Code for reproduction

import numpy as np
import matplotlib.pyplot as plt
from matplotlib.widgets import RangeSlider

# generate a fake image
np.random.seed(19680801)
N = 128
img = np.random.randn(N, N
topic: widgets/UI Good first issue
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
AnirudhDagar
AnirudhDagar commented Jan 24, 2022

Although the results look nice and ideal in all TensorFlow plots and are consistent across all frameworks, there is a small difference (more of a consistency issue). The result training loss/accuracy plots look like they are sampling on a lesser number of points. It looks more straight and smooth and less wiggly as compared to PyTorch or MXNet.

It can be clearly seen in chapter 6([CNN Lenet](ht

tensorflow-adapt-track good first issue
nni
shenoynikhil98
shenoynikhil98 commented Mar 23, 2022

https://github.com/microsoft/nni/blob/8d5f643c64580bb26a7b10a3c4c9accf617f65b1/nni/compression/pytorch/speedup/jit_translate.py#L382

While trying to speedup my single shot detector, the following error comes up. Any way to fix this,

/usr/local/lib/python3.8/dist-packages/nni/compression/pytorch/speedup/jit_translate.py in forward(self, *args)
    363 
    364         def forward(self, *
danieldeutsch
danieldeutsch commented Jun 2, 2021

Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.

Good First Issue Contributions welcome Feature request