-
Updated
Oct 4, 2020 - Python
distributed-training
Here are 44 public repositories matching this topic...
We would like to forward a particular 'key' column which is part of the features to appear alongside the predictions - this is to be able to identify to which set of features a particular prediction belongs to. Here is an example of predictions output using the tensorflow.contrib.estimator.multi_class_head:
{"classes": ["0", "1", "2", "3", "4", "5", "6", "7", "8", "9"],
"scores": [0.068196
I have the same hardware envs, same network, but I could not get the result as you, almost half as you. Any best practices and experience? thanks very much! for bytePS with 1 instance and 8 GPU, I have similar testing result.
-
Updated
Oct 2, 2020 - Python
-
Updated
Oct 3, 2020 - Go
-
Updated
Oct 4, 2020 - Python
-
Updated
Mar 12, 2020 - Python
-
Updated
Nov 19, 2018 - Python
-
Updated
May 13, 2019
-
Updated
Oct 2, 2020 - Go
-
Updated
Aug 7, 2020 - Python
Currently, AdaptDL will change the batch size whenever:
- The job is restarted.
- A new epoch is started.
This can cause the batch size to fluctuate frequently.
Instead, we should only change the batch size if the new batch size will cause a noticeable improvement in the predicted speedup (e.g. by 5% or more). Also consider adding a penalty term when finding the preferred batch size to
-
Updated
Jun 22, 2020 - Python
-
Updated
Jun 13, 2020 - Jupyter Notebook
-
Updated
Jun 5, 2020 - Python
-
Updated
Jun 11, 2020 - Python
-
Updated
Oct 2, 2020 - Java
-
Updated
Sep 28, 2020 - C++
-
Updated
Sep 7, 2020 - Python
-
Updated
Sep 23, 2020 - Jupyter Notebook
-
Updated
Dec 18, 2018 - Shell
-
Updated
Jul 8, 2019 - HCL
-
Updated
Sep 26, 2020 - Python
-
Updated
Feb 4, 2020 - Jupyter Notebook
-
Updated
Apr 16, 2019 - Python
-
Updated
Aug 25, 2020 - Python
-
Updated
Mar 17, 2020 - Python
-
Updated
Nov 14, 2018 - Python
-
Updated
Sep 26, 2019 - Python
Improve this page
Add a description, image, and links to the distributed-training topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the distributed-training topic, visit your repo's landing page and select "manage topics."
Please can you train ghostnet.
(i don't have the imagenet dataset)