Metric collector should collect more GPU metrics

### 🚀 The feature

I'm seeing customers run commands like the below

```
nvidia-smi --format=csv,noheader,nounits --query-gpu=utilization.gpu,utilization.memory,memory.total,memory.used,temperature.gpu,power.draw,clocks.current.sm,clocks.current.memory -l 10 > logs/gpu-stats.log &
```

However in our existing metric_collector.py we have only have metrics for utilization and memory used but customers may also be interested in clock speed and power draw. We should make those available as well

https://github.com/pytorch/serve/blob/e181fee71423d3496ad9f9c1d1f4e7ee199636d0/ts/metrics/system_metrics.py#L72-L88

### Motivation, pitch

This is easy enough to do by using nvml which already instruments this data, we just need to create new metrics objects 

### Alternatives

_No response_

### Additional context

_No response_

	for value in info:
	dimension_gpu = [
	Dimension("Level", "Host"),
	Dimension("device_id", value["index"]),
	]
	system_metrics.append(
	Metric(
	"GPUMemoryUtilization",
	value["mem_used_percent"],
	"percent",
	dimension_gpu,
	)
	)
	system_metrics.append(
	Metric("GPUMemoryUsed", value["mem_used"], "MB", dimension_gpu)
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metric collector should collect more GPU metrics #1937

🚀 The feature

Motivation, pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Metric collector should collect more GPU metrics #1937

Description

🚀 The feature

Motivation, pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions