Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 20,780 public repositories matching this topic...

glemaitre
glemaitre commented Aug 10, 2021

I just discover that we have a helper function to validate scalar:
https://scikit-learn.org/stable/modules/generated/sklearn.utils.check_scalar.html

Since this helper could help to get consistent error types and messages, I was wondering if we could make a long-running issue to introduce this helper everywhere possible.

I think this could be a good issue for first contributors and short spr

superset
kevin-struckhoff
kevin-struckhoff commented Aug 11, 2021

When the View Query Option is selected, the screen pops up in the middle of the web page and obscures the chart I'm working on. I'd like to be able to move it around so I can see the chart results and the query at the same time. It would be nice to also be able to resize the view query screen to minimize its' presence on the web page. There's a lot of blank space on the right side of the view quer

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated May 13, 2021
  • Python
pytorch-lightning
dash
astromancer
astromancer commented Aug 4, 2021

Bug summary

Figures saved via the savefig method are not saved relative to the rcParams['savefig.directory'] path. Looking at the source code, it seems that this path is only prepended to the filename if the save request came via the interactive toolbar. Not sure if this is done intentionally, but it seems to me that the savefig.directory should always be respected.

In the exampl

gensim
c4n
c4n commented Jul 30, 2021

Is your feature request related to a problem? Please describe.
I want to evaluate multiple datasets (same formatting, they can share the same dataset reader). The "evaluate" command takes much longer to load the model than to evaluate.

Describe the solution you'd like
support passing multiple input files and output files to the "evaluate" command

**Describe alternatives you've cons

nni