Customize continuous string tensor type for feature hashing on GPU. #48520
Labels
comp:gpu
GPU related issues
stat:awaiting tensorflower
Status - Awaiting response from tensorflower
type:feature
Feature requests
System information
Background
In a recommender system, there are large number of features are expressed in the form of text, like the name of a product, or behavior of user, in advertisement.
Describe the feature and the current behavior/state.
Currently, Tensorflow use discontinuous host memory to store the text. When hashing the text features, it will lead to lack of performance, by some reasons:
Who will benefit with this feature?
I implemented a Murmur hashing with CUDA, wrapped by Tensorflow OP. I found that string copying one by one consumes 90% of the time. So it will make a huge improvement if a continuous string Tensor is enabled.
I'm new to Tensorflow framework type system. How could I create a new tensor type? And what kinds of problems should I consider?
The text was updated successfully, but these errors were encountered: