Skip to main content

All Questions

Tagged with
Filter by
Sorted by
Tagged with
1 vote
0 answers
104 views

Is the XLA-HLO different for each GPU device?

I got the below code for get the number of the flops in JAXPr.eqns. But when I run the code with different GPU devices, I get the different FLOPs numbers. For example, A100 80GB GPU, I received more ...
YuGyoung Yun's user avatar
1 vote
1 answer
199 views

looking for a tool to predict runtime of XLA-HLO computational graph

I'm looking for a tool to print the runtime when given the computational graph of XLA-HLO. I know there are HLO cost model (analytical model) for print the FLOPs of operator node for computational ...
YuGyoung Yun's user avatar
1 vote
1 answer
514 views

Tensorflow with XLA doesn't fully utilize CPU capacity

I have created a Monte-Carlo simulation model implemented in Tensorflow 2.5. The model mostly consists of vector multiplications inside a tf.while_loop. I am benchmarking the performance on a Linux ...
photon1981's user avatar