Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

PR #22722 introduced a common method for the validation of the parameters of an estimator. We now need to use it in all estimators.

Please open one PR per estimator or family of estimators (if one inherits from another). The title of the PR should mention which estimator it's dealing with and the description of the PR should begin with towards #23462.

Steps

The estimator must define

The Mixed Time-Series chart type allows for configuring the title of the primary and the secondary y-axis.
However, while only the title of the primary axis is shown next to the axis, the title of the secondary one is placed at the upper end of the axis where it gets hidden by bar values and zoom controls.

How to reproduce the bug

Create a mixed time-series chart
Configure axi

What happened + What you expected to happen

When initializing a Ray Trainer, we provide a logdir argument, and the __init__ method of the Trainer stores it as a logdir class variable.

Then, when creating a Trainable with Trainer.to_tune_trainable(), it in-turn calls _create_tune_trainable(), which does not use self.logdir. So when tune_function is defined inside `_create_tu

Problem

See #3856 . Developer would like the ability to configure whether the developer menu or viewer menu is displayed while they are developing on cloud IDEs like Gitpod or Github Codespaces

Solution

Create a config option

showDeveloperMenu: true | false | auto

where

true: always shows the developer menu locally and while deployed
false: always sho

@Borda

Hi, I see that is_last_batch trainer property isn't documented in https://pytorch-lightning.readthedocs.io/en/stable/common/trainer.html#properties.

I was lucky to find it here.

I feel it would be helpful to have all properties listed there.

Thanks.

cc @Borda @rohitgr7 @akihironitta

Describe your context
Please provide us your environment, so we can easily reproduce the issue.

replace the result of pip list | grep dash below

dash                      2.0.0
dash-bootstrap-components 1.0.0

if frontend related, tell us your Browser, Version and OS
- OS: [e.g. iOS] Windows
- Browser [e.g. chrome, safari]: Chrome 96.0x, Edge 96.0x, Firefox

Currently when a job fails on GHA we upload all 109 MB across ~7k files which takes a surprisingly long time. What we really want is just the images that failed and the computed difference.

This is labeled as "good first issue" because there are no API designs here (it is all configuring CI).

Steps:

sort out how to identify the failed test images (there is a systematic naming conventi

The warnings at

https://ipython.readthedocs.io/en/stable/config/extensions/autoreload.html

do not mention the issues with reloading modules with enums:

Enum and Flag are compared by identity (is, even if == is used (similarly to None))
reloading a module, or importing the same module by a different name, creates new enums (look the same, but are not the same)

Although the results look nice and ideal in all TensorFlow plots and are consistent across all frameworks, there is a small difference (more of a consistency issue). The result training loss/accuracy plots look like they are sampling on a lesser number of points. It looks more straight and smooth and less wiggly as compared to PyTorch or MXNet.

It can be clearly seen in chapter 6([CNN Lenet](ht

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi

Bug with GPU Model

Currently, while using pruning methods like TaylorFOWeight Pruner, If I use a model on GPU for getting the metrics (as calculated for getting masks), it fails on line while creating masks. The reason why it fails i

Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.

Data Science

Here are 27,808 public repositories matching this topic...

keras-team / keras

scikit-learn / scikit-learn

Steps

apache / superset

How to reproduce the bug

microsoft / ML-For-Beginners

GokuMohandas / Made-With-ML

CamDavidsonPilon / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

explosion / spaCy

donnemartin / data-science-ipython-notebooks

eriklindernoren / ML-From-Scratch

ray-project / ray

What happened + What you expected to happen

AMAI-GmbH / AI-Expert-Roadmap

eugeneyan / applied-ml

streamlit / streamlit

Problem

Solution

Lightning-AI / lightning

academic / awesome-datascience

plotly / dash

matplotlib / matplotlib

ipython / ipython

microsoft / Data-Science-For-Beginners

fastai / fastbook

d2l-ai / d2l-en

afshinea / stanford-cs-229-machine-learning

microsoft / recommenders

virgili0 / Virgilio

RaRe-Technologies / gensim

bharathgs / Awesome-pytorch-list

qax-os / excelize

microsoft / nni

Bug with GPU Model

rasbt / python-machine-learning-book

allenai / allennlp

Related Topics