Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Describe the issue linked to the documentation

The "20 newsgroups text" dataset can be accessed within scikit-learn using defined functions. The dataset contains some text which is considered culturally insensitive.

Suggest a potential alternative/fix

Add a section in the dataset documentation, possibly above the "Recommendation" section called "Data Considerations".
https://

Screenshot

Description

chart 3 dot menu is behind the chart title panel in chart maximize mode

Apache Arrow has a first-class tabular file format, Feather, that the Ray Datasets IO layer should support. Combined with Ray Datasets' existing .from_arrow() and .to_arrow() APIs, this would round out our "all-Arrow" experience, which should be as nice as possible given our "distributed Arrow dataset" positioning.

Implementation Note

We currently print a warning as shown below when a user sets both a widget default value in the function defining the widget as well as a widget value via the widget's key in st.session_state

While we certainly want to do this by default since doing both is not recommended, we should provide a

🚀 Feature

Refactor our internal language for master port and master address in the cluster environments and accelerators.

Motivation

Inclusive language

Pitch

rename to main_address and main_port

In recent versions (can't say from exactly when), there seems to be an off-by-one error in dcc.DatePickerRange. I set max_date_allowed = datetime.today().date(), but in the calendar, yesterday is the maximum date allowed. I see it in my apps, and it is also present in the first example on the DatePickerRange documentation page.

E

The docs for IPython.core.interactiveshell.InteractiveShell.set_custom_exc have horribly mangled a warning message into a list of arguments. I can't work out at a glance why this is happening; it might be a sphinx.ext.napoleon bug, or a sphi

Problem

The folder contains "fontlist-v330.json" in my case.

Proposed solution

Move the directory to a subdirectory in %APPDATA%, where it belongs (as the name suggests).

Additional context and prior art

No response

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi

Is your feature request related to a problem? Please describe.
I want to evaluate multiple datasets (same formatting, they can share the same dataset reader). The "evaluate" command takes much longer to load the model than to evaluate.

Describe the solution you'd like
support passing multiple input files and output files to the "evaluate" command

**Describe alternatives you've cons

Data Science

Here are 20,661 public repositories matching this topic...

keras-team / keras

scikit-learn / scikit-learn

Describe the issue linked to the documentation

Suggest a potential alternative/fix

apache / superset

Screenshot

Description

GokuMohandas / MadeWithML

CamDavidsonPilon / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

donnemartin / data-science-ipython-notebooks

explosion / spaCy

eriklindernoren / ML-From-Scratch

academic / awesome-datascience

ray-project / ray

Implementation Note

microsoft / ML-For-Beginners

streamlit / streamlit

PyTorchLightning / pytorch-lightning

🚀 Feature

Motivation

Pitch

plotly / dash

ipython / ipython

matplotlib / matplotlib

Problem

Proposed solution

Additional context and prior art

AMAI-GmbH / AI-Expert-Roadmap

virgili0 / Virgilio

fastai / fastbook

RaRe-Technologies / gensim

afshinea / stanford-cs-229-machine-learning

bharathgs / Awesome-pytorch-list

eugeneyan / applied-ml

rasbt / python-machine-learning-book

microsoft / recommenders

d2l-ai / d2l-en

hangtwenty / dive-into-machine-learning

allenai / allennlp

0xnr / awesome-bigdata

microsoft / nni

Related Topics