Tensor Flow gives different results on INTEL and AMD CPUs #56529

GChaitanya2001 · 2022-06-22T07:44:27Z

Click to expand!

Issue Type

Bug

Source

~~source~~

Tensorflow Version

2.3

Custom Code

Yes

OS Platform and Distribution

Linux Ubuntu 18.04

Mobile device

No response

Python version

= 3.6

Bazel version

~~0.26.1~~

GCC/Compiler version

No response

CUDA/cuDNN version

No response

GPU model and memory

No response

Current Behaviour?

I tried running a sample sqrt computation on INTEL and AMD CPUs. I used a tolerance of 0.0000001 i.e., the values are printed in the log output, only if the difference between corresponding values is >= 0.0000001.
The values obtained on INTEL and AMD CPUs are not matching.

Note:

Please refer this discussion for more details, https://discuss.tensorflow.org/t/tf-sqrt-computations-are-different-on-two-diff-cpu-architectures-intel-and-amd/10208
the raw files generated by numpy sqrt computation by the provided script has no difference across both the architectures.

~~Steps to reproduce building TF from source: ~~~~

Used tensorflow 1.15.5 which is built from source ( Build from source | TensorFlow).

Disabled all supports during ./configure step (XLA, CUDA, etc.,).

~~Provided “-march=x86-64” for --copt and --host_copt flags while doing bazel build and set --config=v1.~~

Standalone code to reproduce the issue

test.py

import numpy as np
import tensorflow.compat.v1 as tf

print(tf.__version__)
tf.disable_v2_behavior()
# float32 NumPy array
a = np.arange(100, dtype=np.float32)
# The same array with the same dtype in TensorFlow
a_tf = tf.constant(a, dtype=tf.float32)
# Square root with NumPy
sqrt = np.sqrt(a)
sqrt.tofile('../np_exp/sqrt.raw')
# Square root with TensorFlow
with tf.Session() as sess:
    sqrt_tf = sess.run(tf.sqrt(a_tf))
    sqrt_tf.tofile('./sqrt.raw')

compare.py - script to compare raw files

import csv
import os
import argparse
import json
import numpy as np
import math
# import cv2
result_data = ''
tolrence = 0.0000001

np.set_printoptions(precision=17, suppress=True)

def csv_reader(abc1, abc2, ):
    reader1 = np.fromfile(abc1, dtype=np.float32)
    reader2 = np.fromfile(abc2, dtype=np.float32)
    #print(reader1)
    i = 0
    c = 0
    count = 0
    anti_count = 0
    isNan = 0
    try:
       for (index1, index2) in zip(np.nditer(reader1), np.nditer(reader2)):
           i = i + 1
           if math.isnan(index2):
                continue
           if abs(index1 - index2) >= tolrence:
               count = count + 1
               if count != 0:
                 print(str(index1) + " " + str(index2) + " " + str(i))
           else:
                anti_count = anti_count + 1
       if count != 0:
             print(abc1)
             print(abc2)
             print("Total Matches :  ", anti_count, "Mismatches  : ", count)

    except:
        pass
      
def list_files_from_dir(root):
    all_raw_list = list()
    for path, subdirs, files in os.walk(root):
        for name in files:
            if name not in ['model.cpp', 'model.bin', 'model_net.json']:
                 all_raw_list.append(os.path.join(path, name))

    return all_raw_list


def compare_all(dir1, dir2):
    list1 = list_files_from_dir(dir1)
    list2 = list_files_from_dir(dir2)
    list1.sort()
    list2.sort()
    for i in range(len(list1)):
        csv_reader(list1[i], list2[i])


if __name__ == "__main__":
    parser = argparse.ArgumentParser(
        description='parser')
    parser.add_argument('-i1', '--input1', help='The path to the directory containing raw files.', required=True)
    parser.add_argument('-i2', '--input2', help='The path to the directory containing raw files', required=True)
    args = parser.parse_args()

    input1 = args.input1
    input2 = args.input2
    compare_all(input1, input2)

Sample commands to run

python3 test.py => script to run the sqrt computation
python3 compare.py -i1 ./AMD -i2 ./INTEL => compares the raw files generated by AMD and INTEL (transfer the raw files to one of the machines, ./AMD folder has raw files generated on AMD and ./INTEL has raw files generated on INTEL)

Relevant log output

1.4142135 1.4142134 3
2.0 1.9999999 5
2.4494898 2.4494896 7
2.828427 2.8284268 9
3.0 2.9999998 10
3.162278 3.1622777 11
3.6055512 3.6055508 14
4.0 3.9999998 17
4.5825763 4.5825753 22
4.6904163 4.6904154 23
4.795831 4.7958307 24
4.8989797 4.898979 25
5.099019 5.0990186 27
5.1961527 5.196152 28
5.385165 5.3851647 30
5.656854 5.6568537 33
5.7445626 5.744562 34
5.830952 5.830951 35
6.0 5.9999995 37
6.0827627 6.0827622 38
6.324556 6.3245554 41
6.557439 6.557438 44
7.0000005 6.9999995 50
7.0710683 7.0710673 51
7.2111025 7.2111015 53
7.2801104 7.280109 54
7.416199 7.4161983 56
8.0 7.9999995 65
8.062258 8.062257 66
8.124039 8.124038 67
8.306624 8.306623 70
8.42615 8.426149 72
8.5440035 8.544003 74
8.6602545 8.660254 76
9.0 8.999999 82
9.055386 9.055385 83
9.110435 9.110433 84
9.165153 9.165151 85
9.273619 9.273618 87
9.380833 9.380831 89
9.486833 9.486834 91
9.539392 9.5393915 92
9.591662 9.591661 93
./AMD/sqrt.raw
./INTEL/sqrt.raw
Total Matches :   57 Mismatches  :  43

tilakrayal · 2022-06-22T16:09:53Z

Hi @GChaitanya2001,
Tensorflow version 1.x is not actively supported. Hence, kindly update to the latest stable version 2.9 and let us know if you are facing the same issue. Thank you!

GChaitanya2001 · 2022-06-23T06:39:06Z

Hi @tilakrayal,
Thank you for the response. TF 2.9 (in non-eager mode) isn't resulting in any differences. But I am able to see the difference in TF 2.3 (in non-eager mode), which I am bound to use for my work. I updated the issue accordingly for 2.3. So, Is there any fix for TF 2.3 (in non-eager mode)? Please let me know. Thanks!

tilakrayal · 2022-06-24T13:03:05Z

@GChaitanya2001,
Its unlikely for TF 2.3 version to receive any bug fixes except when we have security patches. There is a high possibility that this was fixed with later TF versions.
As there is no difference in results and the bug was resolved on tensorflow latest stable v2.9, we request to continue with the same version. Thank you!

GChaitanya2001 · 2022-06-27T04:32:41Z

Hi @tilakrayal,
Thank you for the response. I am trying to build Tensorflow from source (I kind of have to TF v2.3). How can I link MKL (one-DNN.,) built from source while building Tensorflow from source? Please let me know. Thank you!

tilakrayal · 2022-06-29T05:06:25Z

@GChaitanya2001,
Related issue #56586 was already escalated. Kindly feel free to move this issue to closed status and follow #56586 for the updates. Thank you!

google-ml-butler · 2022-06-29T07:04:52Z

Are you satisfied with the resolution of your issue?
Yes
No

google-ml-butler · 2022-07-01T13:00:09Z

Are you satisfied with the resolution of your issue?
Yes
No

GChaitanya2001 · 2022-07-01T13:29:24Z

Hi @tilakrayal,

I had to reopen this issue as I wanted to refer the code for a clarification.

I tried converting the input tensors (which are in float32) in the above code (test.py) to float64 before running tf.Session and executed sess.run with float64 tensors and finally re-converted back the float64 result to float32. I used TF v2.3 to do this. The final re-converted answer of TF v2.3 for this modified test.py is matching with TF v2.9 answer of the original test.py code where only float32 tensors are used. And now for this modified test.py code, TF v2.3 doesn't have any difference across INTEL and AMD. Does that mean TF v2.9 is implicitly using 64-bit math even when the tensors are in float32 while using tf.session? Please let me know your views on this.

Note: Please find the below code for this experiment, test.py (modified)

import numpy as np
import tensorflow as tf
import os

os.environ['TF_CPP_MIN_LOG_LEVEL'] = '0'   #debug mode
np.set_printoptions(precision=7, floatmode="fixed")

print ("We are using Tensorflow version", tf.__version__)

################ Executing in non-eager mode
print(tf.executing_eagerly())
tf.compat.v1.disable_v2_behavior()
print(tf.executing_eagerly())

################ float32 inputs
# float32 NumPy array
a = np.arange(100, dtype=np.float32)
# The same array with the same dtype in TensorFlow
a_tf = tf.constant(a, dtype=tf.float32)

############### Square root with NumPy
sqrt = np.sqrt(a)
print(sqrt)
print(type(sqrt))
sqrt.tofile('../np_exp/sqrt.raw')
a.tofile('../np_exp/a_np.raw')


############### Square root with TensorFlow
with tf.compat.v1.Session() as sess:
    print(a_tf.dtype)
    a_tf = tf.cast(a_tf, tf.float64)
    print(a_tf.dtype)

    sqrt_tf = sess.run(tf.sqrt(a_tf))

    print(sqrt_tf)
    print(type(sqrt_tf), sqrt_tf.dtype)

    sqrt_tf = sqrt_tf.astype('float32')

    print(sqrt_tf)
    print(type(sqrt_tf), sqrt_tf.dtype)

    sqrt_tf.tofile('./sqrt.raw')

Thank you!

GChaitanya2001 · 2022-07-06T09:44:01Z

Hi @tilakrayal,

I found an example where even TF v2.9 doesn't give same result on AMD and INTEL CPUs. Please use the same compare.py file to the compare the output dumps. For the below example, I could see a difference in TF v2.9 dumps but not in Numpy dumps. Please let me know how to resolve this.

Thank you!

test_1.py

import numpy as np
import tensorflow as tf

print(tf.__version__)

print(tf.executing_eagerly())
tf.compat.v1.disable_v2_behavior()
print(tf.executing_eagerly())

x = np.array([
    24.538005828857422,
    18.491443634033203,
    87.84298706054688,
    14.998174667358398,
    19.956972122192383,
    8.451898574829102,
    13.764676094055176,
    8.09897518157959,
    7.404653549194336,
    27.5488338470459,
    24.450223922729492,
    30.647415161132812,
    141.6356658935547,
    12.053257942199707,
    312.13714599609375,
    3.22926926612854,
    4.638835906982422,
    5.29040002822876,
    17.960458755493164,
    24.698667526245117,
    15.1753511428833,
    63.419395446777344,
    19.004623413085938,
    1.5544555187225342,
    83.3165054321289,
    126.32454681396484,
    88.05239868164062,
    21.876977920532227,
    0.2955206036567688,
    7.340531349182129,
    52.52980422973633,
    9.621030807495117,
    12.051604270935059,
    783.888427734375,
    34.49824523925781,
    5.958560466766357,
    11.733049392700195,
    330.6155700683594,
    17.118879318237305,
    15.840741157531738,
    4.088183879852295,
    24.647945404052734,
    0.08141398429870605,
    17.852745056152344,
    2.8441035747528076,
    1.9566246271133423,
    27.920806884765625,
    15.26904010772705,
    0.43852341175079346,
    4.032613754272461,
    5.299862384796143,
    4.817346096038818,
    23.03229331970215,
    41.89274978637695,
    21.65822410583496,
    2.5178189277648926,
    5.134634017944336,
    11.193047523498535,
    2.8467650413513184,
    63.14961242675781,
    26.825477600097656,
    106.63838958740234,
    14.791544914245605,
    13.02008056640625,
    0.5024715065956116,
    9.236507415771484,
    4.46705961227417,
    12.719013214111328,
    1.9253795146942139,
    23.949853897094727,
    1.091653823852539,
    11.166877746582031,
    90.74140167236328,
    3.9178109169006348,
    171.3546142578125,
    18.018465042114258,
    112.39656066894531,
    24.57540512084961,
    11.636059761047363,
    18.60203742980957,
    1.9045045375823975,
    37.4972038269043,
    87.38375854492188,
    21.526424407958984,
    19.116044998168945,
    3.452946186065674,
    8.576736450195312,
    28.705116271972656,
    23.794410705566406,
    9.662251472473145,
    4.736318111419678,
    23.988988876342773,
    6.401980400085449,
    216.71377563476562,
    27.839820861816406,
    4.771946430206299
], np.float32)

y = np.array([0.0010000000474974513], np.float32)

x1 = np.array([
    1.0001689195632935,
    0.8780577182769775,
    0.8868989944458008,
    0.894711434841156,
    0.944614827632904,
    1.033501148223877,
    2.1012930870056152,
    1.6228097677230835,
    1.1083400249481201,
    1.2399554252624512,
    1.5854648351669312,
    0.7947320342063904,
    0.9162285923957825,
    1.3240278959274292,
    0.8412530422210693,
    1.233343482017517,
    1.02128005027771,
    0.9676418900489807,
    1.2707772254943848,
    1.1182206869125366,
    1.1589807271957397,
    1.3174852132797241,
    0.8395540118217468,
    0.5670318603515625,
    1.9759927988052368,
    1.3380725383758545,
    2.000969886779785,
    0.7145503759384155,
    0.6299505233764648,
    1.2315846681594849,
    1.009231448173523,
    0.6337988972663879,
    0.6951360702514648,
    0.9353156685829163,
    0.9055959582328796,
    0.9804213643074036,
    0.609941303730011,
    1.3276393413543701,
    0.9258073568344116,
    1.2211651802062988,
    1.1829502582550049,
    1.3320400714874268,
    0.5720718502998352,
    1.000986099243164,
    1.21437668800354,
    1.0850956439971924,
    0.8027282357215881,
    1.002602458000183,
    2.269151449203491,
    0.9027953743934631,
    0.9045624136924744,
    1.8146674633026123,
    0.9488409757614136,
    1.5350325107574463,
    0.9284224510192871,
    1.146897792816162,
    1.0752787590026855,
    0.7175253629684448,
    1.0148614645004272,
    0.9238283038139343,
    1.018907904624939,
    1.0761818885803223,
    3.391054153442383,
    1.348015546798706,
    0.5962985754013062,
    1.0098881721496582,
    1.26979398727417,
    1.4007360935211182,
    1.1175923347473145,
    2.063180685043335,
    0.6979321837425232,
    1.17123281955719,
    0.9045684933662415,
    1.1967644691467285,
    1.0601277351379395,
    1.3861143589019775,
    1.0138226747512817,
    0.8322133421897888,
    1.0374521017074585,
    1.0076591968536377,
    0.6770474910736084,
    1.9207227230072021,
    1.059730052947998,
    0.7887228727340698,
    0.9072004556655884,
    1.187535285949707,
    1.1741851568222046,
    1.766964316368103,
    1.5409915447235107,
    1.1530601978302002,
    0.9437809586524963,
    0.9482491612434387,
    1.4560399055480957,
    1.234002947807312,
    1.2194430828094482,
    1.246254324913025
], np.float32)

x2 = np.array([
    -5.180166244506836,
    3.7005016803741455,
    -10.90788745880127,
    4.012871265411377,
    4.429512977600098,
    0.3497477173805237,
    -3.9566190242767334,
    -1.8509852886199951,
    -0.21952253580093384,
    -0.06212245300412178,
    -5.093024253845215,
    -6.36647367477417,
    10.501260757446289,
    -0.23397648334503174,
    26.599699020385742,
    -0.043404266238212585,
    -0.039516013115644455,
    -0.23252145946025848,
    -0.3733890950679779,
    -8.798702239990234,
    -1.458726167678833,
    -10.43361759185791,
    2.4384193420410156,
    -0.4310355484485626,
    -11.485552787780762,
    -35.93326187133789,
    -11.777127265930176,
    2.0529074668884277,
    0.17680668830871582,
    0.3521166443824768,
    -16.207395553588867,
    3.020942211151123,
    -2.683389902114868,
    -31.045930862426758,
    6.103168487548828,
    0.9636617302894592,
    -1.7291356325149536,
    28.085447311401367,
    -4.286806583404541,
    -0.5121002793312073,
    -0.05453529953956604,
    -0.004477453883737326,
    0.07280711829662323,
    4.284749507904053,
    -0.012386074289679527,
    -0.1143072172999382,
    4.45491361618042,
    -0.6973512172698975,
    0.061323754489421844,
    0.31690242886543274,
    0.02550177276134491,
    -0.9771645665168762,
    5.0788984298706055,
    -5.232454299926758,
    -5.046350479125977,
    -0.2617790400981903,
    -0.0006600793567486107,
    -2.8525986671447754,
    -0.12362352758646011,
    8.20291519165039,
    4.711668491363525,
    10.515141487121582,
    -4.041153430938721,
    -0.16640359163284302,
    -0.33183470368385315,
    4.820765972137451,
    0.13425064086914062,
    0.009547995403409004,
    -0.1804608851671219,
    -5.927944660186768,
    0.13916325569152832,
    -0.5243402123451233,
    -16.569602966308594,
    -0.6729446649551392,
    11.393829345703125,
    -0.26097437739372253,
    -21.679431915283203,
    4.195556163787842,
    0.19142165780067444,
    3.6716952323913574,
    0.3988632559776306,
    -7.73115348815918,
    -8.86845874786377,
    3.6867213249206543,
    -4.281424522399902,
    -0.7083483338356018,
    -17.665390014648438,
    -6.130855560302734,
    -7.333799839019775,
    -0.12031694501638412,
    -0.02339952066540718,
    -5.354321479797363,
    0.047780346125364304,
    19.933006286621094,
    0.3705201745033264,
    -0.4028393626213074
], np.float32)

x3 = np.array([
    1.2550855875015259,
    0.8654487729072571,
    0.6308754682540894,
    0.9816827178001404,
    1.0548738241195679,
    0.012524792924523354,
    -0.839571475982666,
    -0.0006424338207580149,
    -0.04649466648697853,
    -0.037287574261426926,
    0.06486305594444275,
    0.9137144684791565,
    0.8428932428359985,
    -0.01175174955278635,
    0.8376722931861877,
    -0.030319122597575188,
    -0.04582984372973442,
    -0.018306005746126175,
    0.09186185151338577,
    0.17019498348236084,
    -0.020119082182645798,
    0.22337554395198822,
    0.4282861351966858,
    3.9934964179992676,
    -0.4508689045906067,
    0.4686657786369324,
    -0.1897992491722107,
    0.29383572936058044,
    0.2193099409341812,
    -0.1484094113111496,
    0.31675228476524353,
    0.5295975804328918,
    1.8150707483291626,
    2.0527231693267822,
    1.086689829826355,
    -0.017538614571094513,
    2.948042869567871,
    0.4877329468727112,
    4.95603084564209,
    -0.029376821592450142,
    -0.07688342779874802,
    -0.037382662296295166,
    0.17505115270614624,
    1.08339524269104,
    -0.07510468363761902,
    -0.024961132556200027,
    0.540581464767456,
    0.7190515398979187,
    -3.110440492630005,
    -0.057315193116664886,
    -0.01705225743353367,
    0.6796309947967529,
    1.0900826454162598,
    0.39591458439826965,
    4.2813401222229,
    -0.07372663915157318,
    -0.04945394769310951,
    2.074993133544922,
    -0.020864002406597137,
    0.39534062147140503,
    1.0341440439224243,
    1.0517014265060425,
    -1.7383136749267578,
    -0.033969223499298096,
    3.984095335006714,
    -0.300537109375,
    -0.054953258484601974,
    -0.05139319971203804,
    -0.0513877235352993,
    -0.6203671097755432,
    -1.0038613080978394,
    -0.060277827084064484,
    -0.0579550676047802,
    -0.03466716781258583,
    1.1362496614456177,
    -0.08225323259830475,
    1.2022327184677124,
    0.7995136976242065,
    0.01857675611972809,
    1.041009783744812,
    0.25889095664024353,
    -0.6596677303314209,
    1.6385159492492676,
    0.739734947681427,
    4.812783718109131,
    -0.05927223339676857,
    -0.12917836010456085,
    -0.684905469417572,
    -0.426142156124115,
    -0.0003853600355796516,
    -0.020870279520750046,
    4.774664878845215,
    -0.11399747431278229,
    0.8400925397872925,
    0.018678396940231323,
    -0.10171803832054138
], np.float32)

graph = tf.Graph()

with graph.as_default():
    xt = tf.constant(x, dtype=tf.float32)
    yt = tf.constant(y, dtype=tf.float32)
    x1t = tf.constant(x1, dtype=tf.float32)
    x2t = tf.constant(x2, dtype=tf.float32)
    x3t = tf.constant(x3, dtype=tf.float32)
    
    t1 = tf.add(xt, yt)
    t2 = tf.math.rsqrt(t1)
    t4 = tf.math.multiply(x1t, t2)
    t5 = tf.math.multiply(x2t, t4)
    t6 = tf.math.subtract(x3t, t5)

print(tf.__version__)

with tf.compat.v1.Session(graph = graph) as session:
    ans = session.run(t6)
    ans.tofile('./AMD/ans.raw')

ans = np.subtract(x3, np.multiply(np.multiply(np.divide(1, np.sqrt(np.add(x, y))), x1), x2))
print(ans, ans.dtype)

ans.tofile('../np_test/AMD/ans.raw')

tilakrayal · 2022-07-11T04:23:49Z

@GChaitanya2001,
I was trying to execute the provided and found a different error. Kindly find the gist of it here and share the dependencies. Thank you!

GChaitanya2001 · 2022-07-11T04:51:30Z

Hi @tilakrayal,
Please create folder structure, as mentioned below,

-- tf_exp
    -- AMD
    -- INTEL
-- np_exp 
   -- AMD
   -- INTEL

Run the script inside tf_exp folder, it will create relevant raw dumps in the respective folders. Change the './AMD/ans.raw' to './INTEL/ans.raw' inside test.py while using INTEL CPU. You can change the relevant paths accordingly.

Thank you!

GChaitanya2001 · 2022-07-14T05:40:45Z

Hi @tilakrayal, @gadagashwini,

I explored the Tensorflow source code and finally got a TF build which wasn't showing any difference across INTEL and AMD CPUs. I disabled EIGEN_FAST_MATH macro (which is enabled by default) while building TF from source which worked (Tried this for v1.15).

Thank you!

tilakrayal · 2022-08-17T06:13:05Z

@GChaitanya2001,
Glad the issue is resolved for you, please feel free to move this to closed status. Thank you!

google-ml-butler · 2022-08-17T08:50:15Z

Are you satisfied with the resolution of your issue?
Yes
No

xinario · 2022-10-21T21:13:39Z

@GChaitanya2001 Hey, I came across the same inconsistency issue when building TF from source. Do you mind sharing where did you make the change to disable the macro? Thanks.

GChaitanya2001 · 2022-10-22T09:05:21Z

@xinario Hi, you can disable the macro by setting it to 0. You can use a -copt option for doing that while building tensorflow from source.

xinario · 2022-10-25T22:32:08Z

@GChaitanya2001 Thank you very much for the reply. Do you mind explain a bit the cause for the difference, like why the EIGEN_FAST_MATH would result this difference on different CPU? I also experienced tiny difference across a customized build across Mac and Windows. Wondering if there're other type of optimization going on there.

google-ml-butler bot added the type:bug Bug label Jun 22, 2022

google-ml-butler bot assigned tilakrayal Jun 22, 2022

GChaitanya2001 changed the title ~~Tensor Flow give different results on INTEL and AMD CPUs~~ Tensor Flow gives different results on INTEL and AMD CPUs Jun 22, 2022

tilakrayal added TF 1.15 for issues seen on TF 1.15 comp:apis Highlevel API related issues labels Jun 22, 2022

tilakrayal added the stat:awaiting response Status - Awaiting response from author label Jun 22, 2022

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Jun 23, 2022

tilakrayal added TF 2.9 Issues found in the TF 2.9 release (or RCs) stat:awaiting response Status - Awaiting response from author and removed TF 1.15 for issues seen on TF 1.15 labels Jun 24, 2022

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Jun 27, 2022

GChaitanya2001 mentioned this issue Jun 27, 2022

How to build tensorflow from source, so that it works similar across INTEL and AMD CPUs (in terms of floating point math precision) #56586

Closed

tilakrayal added the stat:awaiting response Status - Awaiting response from author label Jun 29, 2022

GChaitanya2001 closed this as completed Jun 29, 2022

GChaitanya2001 reopened this Jul 1, 2022

GChaitanya2001 closed this as completed Jul 1, 2022

GChaitanya2001 reopened this Jul 1, 2022

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Jul 1, 2022

tilakrayal added the stat:awaiting response Status - Awaiting response from author label Jul 11, 2022

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Jul 11, 2022

tilakrayal assigned gadagashwini and unassigned tilakrayal Jul 14, 2022

tilakrayal assigned tilakrayal and unassigned gadagashwini Aug 17, 2022

tilakrayal added the stat:awaiting response Status - Awaiting response from author label Aug 17, 2022

GChaitanya2001 closed this as completed Aug 17, 2022

Tensor Flow gives different results on INTEL and AMD CPUs #56529

Tensor Flow gives different results on INTEL and AMD CPUs #56529

Comments

GChaitanya2001 commented Jun 22, 2022 • edited Loading

Issue Type

Source

Tensorflow Version

Custom Code

OS Platform and Distribution

Mobile device

Python version

Bazel version

GCC/Compiler version

CUDA/cuDNN version

GPU model and memory

Current Behaviour?

Note:

Standalone code to reproduce the issue

test.py

compare.py - script to compare raw files

Sample commands to run

Relevant log output

tilakrayal commented Jun 22, 2022

GChaitanya2001 commented Jun 23, 2022 • edited Loading

tilakrayal commented Jun 24, 2022

GChaitanya2001 commented Jun 27, 2022 • edited Loading

tilakrayal commented Jun 29, 2022

google-ml-butler bot commented Jun 29, 2022

google-ml-butler bot commented Jul 1, 2022

GChaitanya2001 commented Jul 1, 2022 • edited Loading

GChaitanya2001 commented Jul 6, 2022 • edited Loading

test_1.py

tilakrayal commented Jul 11, 2022

GChaitanya2001 commented Jul 11, 2022

GChaitanya2001 commented Jul 14, 2022 • edited Loading

tilakrayal commented Aug 17, 2022 • edited Loading

google-ml-butler bot commented Aug 17, 2022

xinario commented Oct 21, 2022

GChaitanya2001 commented Oct 22, 2022 • edited Loading

xinario commented Oct 25, 2022

GChaitanya2001 commented Jun 22, 2022 •

edited

Loading

GChaitanya2001 commented Jun 23, 2022 •

edited

Loading

GChaitanya2001 commented Jun 27, 2022 •

edited

Loading

GChaitanya2001 commented Jul 1, 2022 •

edited

Loading

GChaitanya2001 commented Jul 6, 2022 •

edited

Loading

GChaitanya2001 commented Jul 14, 2022 •

edited

Loading

tilakrayal commented Aug 17, 2022 •

edited

Loading

GChaitanya2001 commented Oct 22, 2022 •

edited

Loading