Roc curve /part 2 #13022

gchinta1 · 2024-05-17T19:40:13Z

Search before asking

I have searched the YOLOv5 issues and discussions and found no similar questions.

Question

Hi Glenn , after i run your code still the roc is not good . I tried to use detection confidence maybe that was the problem but nothing the roc is nan. I put the code you gave to me and mayde you can help me again. All the classes are 0 i am using only one. And the last culunm is the confidence one. I am not using ground truth labels only the confidence score. Can you help with modifying for ground truth if it's needs.Thank you

Additional

`import pandas as pd
from sklearn.metrics import roc_curve, auc
import matplotlib.pyplot as plt
import glob

This will hold all your scores and true values

all_scores = []
y_true = []

Loop through all CSV files in your directory

for csv_file in glob.glob('labelsval1/*.csv'):
data = pd.read_csv(csv_file, header=None)
scores = data[1].tolist()
all_scores.extend(scores)
# Ensure to update y_true based on your actual data specifics, using 0 or 1 accordingly.
y_true.extend([class_label] * len(scores)) # Replace class_label with 0 or 1 as appropriate.

Calculate ROC

fpr, tpr, thresholds = roc_curve(y_true, all_scores)
roc_auc = auc(fpr, tpr)

Plotting

plt.figure()
plt.plot(fpr, tpr, color='darkorange', lw=2, label='ROC curve (area = %0.2f)' % roc_auc)
plt.plot([0, 1], [0, 1], color='navy', lw=2, linestyle='--')
plt.xlim([0.0, 1.0])
plt.ylim([0.0, 1.05])
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('Receiver Operating Characteristic')
plt.legend(loc="lower right")
plt.show()`

glenn-jocher · 2024-05-18T00:03:38Z

Hello! 🚀 It looks like you're experiencing issues with generating the ROC curve because you're not using ground truth labels.

To properly calculate AUC for ROC, you will need the actual labels (ground truth) for each prediction to compare against. It seems you only have the detection confidence scores.

Here’s a method to modify your script to include ground truth labels:

Update the y_true list in your loop to include actual labels from your dataset.
Map your class labels to 0 or 1 based on the detection (presence or absence of your target class).

Modify this part of your script:

# Assume your actual labels column in CSV is the second column (index 1)
labels = data[1].tolist()  # adjust index based on your data structure
...
y_true.extend(labels)

Ensure you're reading from the correct column index for labels and scores in your CSV. After this adjustment, the ROC calculation should function with accurate class labels.

Hope this helps you move forward! 👍

gchinta1 added the question Further information is requested label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roc curve /part 2 #13022

Roc curve /part 2 #13022

gchinta1 commented May 17, 2024 •

edited

glenn-jocher commented May 18, 2024

Roc curve /part 2 #13022

Roc curve /part 2 #13022

Comments

gchinta1 commented May 17, 2024 • edited

Search before asking

Question

Additional

This will hold all your scores and true values

Loop through all CSV files in your directory

Calculate ROC

Plotting

glenn-jocher commented May 18, 2024

gchinta1 commented May 17, 2024 •

edited