Differences between .data and .detach #6990

jay960702 · 2018-04-26T13:45:48Z

Issue description

Hi all,

I am not very clear about the differences between .data and .detach() in the latest pytorch 0.4.
For example:

a = torch.tensor([1,2,3], requires_grad = True)
b = a.data
c = a.detach()

so b is not as the same as the c?

Here is a part of the 'PyTorch 0.4.0 Migration Guide':

"However, .data can be unsafe in some cases. Any changes on x.data wouldn’t be tracked by autograd, and the computed gradients would be incorrect if x is needed in a backward pass. A safer alternative is to use x.detach(), which also returns a Tensor that shares data with requires_grad=False, but will have its in-place changes reported by autograd if x is needed in backward."

Can anyone give me more explanations about this sentence: "but will have its in-place changes reported by autograd if x is needed in backward"？ Thanks！

zou3519 · 2018-04-26T15:19:00Z

Here's an example. If you use detach() instead of .data, gradient computation is guaranteed to be correct..

>>> a = torch.tensor([1,2,3.], requires_grad = True)
>>> out = a.sigmoid()
>>> c = out.detach()
>>> c.zero_()  
tensor([ 0.,  0.,  0.])

>>> out  # modified by c.zero_() !!
tensor([ 0.,  0.,  0.])

>>> out.sum().backward()  # Requires the original value of out, but that was overwritten by c.zero_()
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation

As opposed to using .data:

>>> a = torch.tensor([1,2,3.], requires_grad = True)
>>> out = a.sigmoid()
>>> c = out.data
>>> c.zero_()
tensor([ 0.,  0.,  0.])

>>> out  # out  was modified by c.zero_()
tensor([ 0.,  0.,  0.])

>>> out.sum().backward()
>>> a.grad  # The result is very, very wrong because `out` changed!
tensor([ 0.,  0.,  0.])

I'll leave this issue open: we should add an example to the migration guide and clarify that section.

jay960702 · 2018-04-26T16:33:31Z

Hi Richard @zou3519

Thanks for your reply!
Your example is good. Any in-place change on x.detach() will cause errors when x is needed in backward, so .detach() is a safer way for the exclusion of subgraphs from gradient computation.

yl-jiang · 2019-03-24T06:40:32Z

Why the fllowing code working?

a = torch.tensor([1,2,3.], requires_grad = True)
out = a.sigmoid().sum()
c = out.data
c.zero_()
tensor(0.)

out
tensor(0.)

out.backward()
a.grad
tensor([0.1966, 0.1050, 0.0452])

asanakoy · 2019-07-25T10:11:01Z

Any use-cases where .data is preferred over detach().
Is .data deprecated in Pytoch 1.* ?

shawnthu · 2019-09-20T06:40:52Z

https://zhuanlan.zhihu.com/p/83329768

Change tensor.data to tensor.detach() due to pytorch/pytorch#6990

peterzsj6 · 2020-09-15T08:12:07Z

Here's an example. If you use detach() instead of .data, gradient computation is guaranteed to be correct..

>>> a = torch.tensor([1,2,3.], requires_grad = True)
>>> out = a.sigmoid()
>>> c = out.detach()
>>> c.zero_()  
tensor([ 0.,  0.,  0.])

>>> out  # modified by c.zero_() !!
tensor([ 0.,  0.,  0.])

>>> out.sum().backward()  # Requires the original value of out, but that was overwritten by c.zero_()
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation

As opposed to using .data:

>>> a = torch.tensor([1,2,3.], requires_grad = True)
>>> out = a.sigmoid()
>>> c = out.data
>>> c.zero_()
tensor([ 0.,  0.,  0.])

>>> out  # out  was modified by c.zero_()
tensor([ 0.,  0.,  0.])

>>> out.sum().backward()
>>> a.grad  # The result is very, very wrong because `out` changed!
tensor([ 0.,  0.,  0.])

I'll leave this issue open: we should add an example to the migration guide and clarify that section.

Thank you for your example, but I saw other ppl's video saying autograd will check the version of the tensor to prevent this from happening. I am new to this, so I am a little confused.
https://youtu.be/MswxJw-8PvE?t=323

wtiman520209 · 2021-07-31T08:26:56Z

Why the fllowing code working?

a = torch.tensor([1,2,3.], requires_grad = True)
out = a.sigmoid().sum()
c = out.data
c.zero_()
tensor(0.)

out
tensor(0.)

out.backward()
a.grad
tensor([0.1966, 0.1050, 0.0452])

because value of out is not used for computing the gradient, even though value of out is change, the computed gradient w.r.t. a is still correct. tensor.detach() could detect whether tensors involved in computing gradient are changed or not, but tensor.data has no such functionality.

zou3519 added the module: docs label Apr 26, 2018

jay960702 closed this Apr 26, 2018

zou3519 mentioned this issue Apr 26, 2018

Better document the difference between .data and .detach #7000

Closed

zou3519 self-assigned this Apr 30, 2018

Oktai15 mentioned this issue Aug 26, 2018

WGAN GP detach is necessary? eriklindernoren/PyTorch-GAN#21

Closed

teja5832 mentioned this issue Dec 28, 2018

'Update MNIST PyTorch Tutorial to work with PyTorch 1.0.0' cleverhans-lab/cleverhans#937

Merged

haoma7 added a commit to haoma7/Dive-into-DL-PyTorch that referenced this issue Oct 21, 2019

Update 2.3_autograd.ipynb

6b70761

Change tensor.data to tensor.detach() due to pytorch/pytorch#6990

haoma7 mentioned this issue Oct 21, 2019

Update 2.3_autograd.ipynb ShusenTang/Dive-into-DL-PyTorch#41

Open

xush6528 mentioned this issue Apr 3, 2020

[JIT] Tensor method API behavior discrepancy, Tensor.detach(..) #35990

Open

YutaroOgawa mentioned this issue Apr 11, 2020

1章p.29 preds==labels.data の .data YutaroOgawa/pytorch_advanced#65

Open

pytorch / pytorch Public

Differences between .data and .detach #6990

Differences between .data and .detach #6990

jay960702 commented Apr 26, 2018

zou3519 commented Apr 26, 2018

jay960702 commented Apr 26, 2018

yl-jiang commented Mar 24, 2019

asanakoy commented Jul 25, 2019

shawnthu commented Sep 20, 2019

peterzsj6 commented Sep 15, 2020

wtiman520209 commented Jul 31, 2021

pytorch / pytorch Public

Differences between .data and .detach #6990

Differences between .data and .detach #6990

Comments

jay960702 commented Apr 26, 2018

Issue description

zou3519 commented Apr 26, 2018

jay960702 commented Apr 26, 2018

yl-jiang commented Mar 24, 2019

asanakoy commented Jul 25, 2019

shawnthu commented Sep 20, 2019

peterzsj6 commented Sep 15, 2020

wtiman520209 commented Jul 31, 2021