Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

graph output of uff file for TensorRT #48

Open
JadBatmobile opened this issue Feb 20, 2019 · 3 comments
Open

graph output of uff file for TensorRT #48

JadBatmobile opened this issue Feb 20, 2019 · 3 comments

Comments

@JadBatmobile
Copy link

Hey Andres,

I believe i understand whats going on, but id like to be sure. the UFF file for deploying the model in tensorrt outputs the unnormalized logits layer from the neural network, as opposed to the mask?

@tano297
Copy link
Member

tano297 commented Feb 23, 2019

Correct!

@JadBatmobile
Copy link
Author

Great, I noticed that you do not do a softmax on the logits, you just take the argmax immediately, which i realize is equivalent.
I also see in the c++ deploy package, in netTRT.cpp, line 160:
_sizeof_out = num_classes * _size_in_pix * sizeof(int);

i am curious why you used sizeof(int), instead of sizeof(float), since the output logits are float32 types. Now, i recognize that sizeof(int) is actually = to sizeof(float), but im curious why you explicitly stated it as int

Thank you!

@JadBatmobile
Copy link
Author

Hey Andres, i have another question, this is unrelated and more theoretical, perhaps this is the wrong place, but id be down to email you instead.

When preprocessing images for segmentation learning with CNN's, there are multiple options, (1) take the mean for a pixel across the entire data set, and the std dev for the pixel across the data set, then subtract that mean from that pixel in each image and divide by the std dev. do this for all pixels. (2) for each image, find the mean value and std deviation of all your pixels, and normalize each pixel with those. Or (3) for each image, for each channel, find the mean value and deviation and normalize each pixel in each channel, then merge the channels.

I am wondering what your thoughts on those options are

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants