Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 20,439 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated May 13, 2021
  • Python
dash
pytorch-lightning
ananthsub
ananthsub commented Jul 30, 2021

Discussed in PyTorchLightning/pytorch-lightning#8630

I have a single GPU, but I would like to spawn multiple replicas on that single GPU and train a model with DDP. Of course, each replica would have to use a smaller batch size in order to fit in memory. (For my use case, I am not interested in having a single replica with a large batch size). I tried to pas

anntzer
anntzer commented Jul 28, 2021

Bug summary

The new matplotlib.colors.make_norm_from_scale helper dynamically generates a norm class from a scale class. Currently, in the codebase, it is only used as a decorator to create "toplevel" classes (e.g., it is used to generate LogNorm from LogScale, etc.), but it can also be used within other functions to dynamically generate a norm class based on a user-given arbitrary scale (

gensim
c4n
c4n commented Jul 30, 2021

Is your feature request related to a problem? Please describe.
I want to evaluate multiple datasets (same formatting, they can share the same dataset reader). The "evaluate" command takes much longer to load the model than to evaluate.

Describe the solution you'd like
support passing multiple input files and output files to the "evaluate" command

**Describe alternatives you've cons

nni