All Questions
Tagged with xla tensorflow2.0
3 questions
1
vote
0
answers
104
views
Is the XLA-HLO different for each GPU device?
I got the below code for get the number of the flops in JAXPr.eqns.
But when I run the code with different GPU devices, I get the different FLOPs numbers.
For example, A100 80GB GPU, I received more ...
1
vote
1
answer
199
views
looking for a tool to predict runtime of XLA-HLO computational graph
I'm looking for a tool to print the runtime when given the computational graph of XLA-HLO.
I know there are HLO cost model (analytical model) for print the FLOPs of operator node for computational ...
1
vote
1
answer
514
views
Tensorflow with XLA doesn't fully utilize CPU capacity
I have created a Monte-Carlo simulation model implemented in Tensorflow 2.5. The model mostly consists of vector multiplications inside a tf.while_loop. I am benchmarking the performance on a Linux ...