Add lookup table operator #1251

jantonguirao · 2019-09-12T15:41:04Z

Signed-off-by: Joaquin Anton janton@nvidia.com

Why we need this PR?

Need a lookup operator, e.g. lookup weights for label values

What happened in this PR?

Introduced LookupTable CPU and GPU implementations
Added python tests to cover the new operator

JIRA TASK: [DALI-1050]

jantonguirao · 2019-09-13T11:58:17Z

!build

dali-automaton · 2019-09-13T11:59:58Z

CI MESSAGE: [899100]: BUILD STARTED

dali-automaton · 2019-09-13T13:51:20Z

CI MESSAGE: [899100]: BUILD PASSED

dali/pipeline/data/tensor.h

dali/pipeline/operators/util/lookup_table.h

dali/pipeline/operators/util/lookup_table.cc

klecki · 2019-09-16T10:43:04Z

Btw, did you forget to add python tests?

Signed-off-by: Joaquin Anton <janton@nvidia.com>

dali/pipeline/data/tensor.h

dali/pipeline/operators/util/lookup_table.cc

dali/pipeline/operators/util/lookup_table.h

jantonguirao · 2019-09-16T11:06:11Z

Btw, did you forget to add python tests?

Yes, and the GPU implementation. I pushed it now

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao · 2019-09-16T14:45:29Z

!build

dali-automaton · 2019-09-16T14:49:57Z

CI MESSAGE: [902153]: BUILD STARTED

dali-automaton · 2019-09-16T15:41:35Z

CI MESSAGE: [902153]: BUILD PASSED

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao · 2019-09-17T11:28:48Z

!build

dali-automaton · 2019-09-17T11:30:03Z

CI MESSAGE: [903729]: BUILD STARTED

dali-automaton · 2019-09-17T12:55:53Z

CI MESSAGE: [903729]: BUILD PASSED

Signed-off-by: Joaquin Anton <janton@nvidia.com> Signed-off-by: Jianjun Liu <00liujj@163.com>

LiuHao-THU · 2020-02-10T13:54:30Z

ops, only int supported for input.

JanuszL · 2020-02-10T13:58:03Z

@LiuHao-THU - what is your use case and how you want it to work?

LiuHao-THU · 2020-02-10T14:13:56Z

@LiuHao-THU - what is your use case and how you want it to work?
Thank for the reply, here is my situation:

I'm Reimplement bayesian personalized ranking(a recommendation algorithm), the network is very small, however, I have over 200000 users in my datasets. Now, I'm using PyTorch data_loader,
the code is here. self.cand is a dictionary(lookup-table in my datasets, user as key, rating as value), however, after I set num works in data loader to a very large value, the CPU usage is very high, the network still runs in low GPU usage. so I want to load the datasets to GPU for preprocessing, however, the lookuptable range from [0, 65535].

"""

"""
class BPRData(data.Dataset):

def __init__(self, users, items, candidates, num_items, is_training = True):
	super(BPRData, self).__init__()
	self.users = users
	self.items = items
	self.is_training = is_training
	self.cand = candidates
	self.all = set([i for i in range(num_items)])
	self.device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

def __len__(self):
	return len(self.users)

def __getitem__(self, idx):
	if self.is_training == True:
		neg_items = list(self.all - set(self.cand[int(self.users[idx])]))
		indices = random.randint(0, len(neg_items) - 1)

	user = self.users[idx]
	item_i = self.items[idx]
	item_j = neg_items[indices] if \
			self.is_training else self.items[idx]

	return user, item_i, item_j

"""

JanuszL · 2020-02-10T17:43:30Z

@LiuHao-THU - the lookup was rather designed for signal processing, than for tabular data. Have you tried to check RAPIDS?
If you want to can also extend LookupTable operator to support a wider range of indexes.

jantonguirao force-pushed the lookup_table_op branch 3 times, most recently from efe97d3 to 9003501 Compare September 13, 2019 11:48

jantonguirao changed the title ~~[WIP] Add lookup table operator~~ Add lookup table operator Sep 13, 2019

jantonguirao force-pushed the lookup_table_op branch 2 times, most recently from d9ecef5 to 052f67c Compare September 13, 2019 11:58

jantonguirao requested review from klecki, awolant, mzient and szalpal September 13, 2019 12:00

NVIDIA deleted a comment from dali-automaton Sep 13, 2019

klecki requested changes Sep 16, 2019

View reviewed changes

Add lookup table operator

a6528b5

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the lookup_table_op branch from 052f67c to a6528b5 Compare September 16, 2019 10:59

szalpal reviewed Sep 16, 2019

View reviewed changes

Address PR comments

2b31d21

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the lookup_table_op branch from c32d382 to 2b31d21 Compare September 16, 2019 13:20

klecki approved these changes Sep 16, 2019

View reviewed changes

szalpal approved these changes Sep 17, 2019

View reviewed changes

Add float16.h

52dc852

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the lookup_table_op branch from 3cbe892 to 058b1eb Compare September 17, 2019 11:13

Address PR comments

c40a664

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the lookup_table_op branch from 058b1eb to c40a664 Compare September 17, 2019 11:28

jantonguirao merged commit 9feeeb7 into NVIDIA:master Sep 17, 2019

00liujj pushed a commit to 00liujj/DALI that referenced this pull request Oct 10, 2019

Add lookup table operator (NVIDIA#1251)

f87c305

Signed-off-by: Joaquin Anton <janton@nvidia.com> Signed-off-by: Jianjun Liu <00liujj@163.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lookup table operator #1251

Add lookup table operator #1251

jantonguirao commented Sep 12, 2019 •

edited

Loading

jantonguirao commented Sep 13, 2019

dali-automaton commented Sep 13, 2019

dali-automaton commented Sep 13, 2019

klecki commented Sep 16, 2019

jantonguirao commented Sep 16, 2019

jantonguirao commented Sep 16, 2019

dali-automaton commented Sep 16, 2019

dali-automaton commented Sep 16, 2019

jantonguirao commented Sep 17, 2019

dali-automaton commented Sep 17, 2019

dali-automaton commented Sep 17, 2019

LiuHao-THU commented Feb 10, 2020

JanuszL commented Feb 10, 2020

LiuHao-THU commented Feb 10, 2020 •

edited

Loading

JanuszL commented Feb 10, 2020

Add lookup table operator #1251

Add lookup table operator #1251

Conversation

jantonguirao commented Sep 12, 2019 • edited Loading

Why we need this PR?

What happened in this PR?

jantonguirao commented Sep 13, 2019

dali-automaton commented Sep 13, 2019

dali-automaton commented Sep 13, 2019

klecki commented Sep 16, 2019

jantonguirao commented Sep 16, 2019

jantonguirao commented Sep 16, 2019

dali-automaton commented Sep 16, 2019

dali-automaton commented Sep 16, 2019

jantonguirao commented Sep 17, 2019

dali-automaton commented Sep 17, 2019

dali-automaton commented Sep 17, 2019

LiuHao-THU commented Feb 10, 2020

JanuszL commented Feb 10, 2020

LiuHao-THU commented Feb 10, 2020 • edited Loading

JanuszL commented Feb 10, 2020

jantonguirao commented Sep 12, 2019 •

edited

Loading

LiuHao-THU commented Feb 10, 2020 •

edited

Loading