`xnn_weights_cache_provider` look_up doesn't work?

I've recently been experimenting with XNNPACK's weight cache to reduce load time by caching packed weights and also reduce memory pressure for repeated weights across the same kernels.

I was experiementing with fully-connected operator and found that the weight cache was never being hit. I noticed that when using the apis to create the `xnn_weights_cache_t` we set the look up function to be `xnn_internal_weights_cache_look_up`:

https://github.com/google/XNNPACK/blob/85071b8b8729f63262484942fc9eb1c7c16525c4/src/runtime.c#L148

looking at this function, it looks like a placeholder function which would always return `XNN_CACHE_NOT_FOUND`:

https://github.com/google/XNNPACK/blob/85071b8b8729f63262484942fc9eb1c7c16525c4/src/cache.c#L491-L496

Now when I'm using the weights cache to create a runtime_t with only a fully connected operator, in the flow of creating the fully-connected operator, we look up the cache to see if the weights have been packed before, using xnn_weights_cache_look_up:

https://github.com/google/XNNPACK/blob/85071b8b8729f63262484942fc9eb1c7c16525c4/src/operators/fully-connected-nc.c#L154-L157

However this just uses the the placeholder function above, returning XNN_CACHE_NOT_FOUND:

https://github.com/google/XNNPACK/blob/85071b8b8729f63262484942fc9eb1c7c16525c4/src/cache.c#L530-L534

As a result, every look up would then fall to XNN_CACHE_NOT_FOUND, in which weights have to be repacked, and memory has to be allocated for the newly packed weights:

https://github.com/google/XNNPACK/blob/85071b8b8729f63262484942fc9eb1c7c16525c4/src/operators/fully-connected-nc.c#L159-L179

Am I looking at this incorrectly? Or is this a feature that is still a wip? Or is this a bug that is meant to be fixed in the future?



	size_t xnn_internal_weights_cache_look_up(
	struct xnn_internal_weights_cache* cache, const struct xnn_weights_cache_look_up_key* cache_key)
	{
	// The default implementation does not support this query.
	return XNN_CACHE_NOT_FOUND;
	}

	if (cache_offset == XNN_CACHE_NOT_FOUND) {
	void* weights_ptr = xnn_get_pointer_to_write_weights(
	fully_connected_op, aligned_total_weights_size, packed_weights_padding_byte);
	if (weights_ptr == NULL) {
	xnn_log_error(
	"failed to allocate %zu bytes for %s operator packed weights",
	packed_weights_size, xnn_operator_type_to_string(operator_type));
	goto error;
	}
	xnn_log_debug("allocated %zu bytes for packed weights in %s operator",
	aligned_total_weights_size, xnn_operator_type_to_string(operator_type));

	if (flags & XNN_FLAG_TRANSPOSE_WEIGHTS) {
	pack_gemm_gio_w(
	/groups=/1, output_channels, input_channels,
	nr, kr, sr,
	output_channels,
	kernel, bias, /scale=/NULL,
	weights_ptr,
	gemm_config->nr * extra_weights_bytes,
	packing_params);

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`xnn_weights_cache_provider` look_up doesn't work? #6257

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	if (use_weights_cache(fully_connected_op)) {
	cache_offset = xnn_weights_cache_look_up(
	fully_connected_op->weights_cache, &cache_key);
	}

	size_t xnn_weights_cache_look_up(
	xnn_weights_cache_t cache, const struct xnn_weights_cache_look_up_key* cache_key)
	{
	return cache->look_up(cache->context, cache_key);
	}

xnn_weights_cache_provider look_up doesn't work? #6257

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`xnn_weights_cache_provider` look_up doesn't work? #6257