Hello,
First of all, thank you for the MIT license implementation of YOLOv9.
I want to report an issue when training with a custom dataset.
I am using the WIDER dataset and I have converted the annotations to YOLO format i.e. xywh.
However, when I attempted to train the yolov9-c model on the dataset, I found out that the annotations are wrong from the visualized ground truth, thus the model was trained wrongly with incorrect label loaded. I have attached a screenshot of the labels on ground truth visualized by wandb below.
My training script is:
task.epoch=100 dataset=widerface task.data.batch_size=8 model=v9-c device=cuda name=face use_wandb=True
The visualization of the annotated GT is attached as below:

I investigated the situation and figure out there might be an issue when loading the bbox in the file yolo/tools/data_loader.py line 125:
bbox = torch.tensor([cls, *valid_points.min(axis=0), *valid_points.max(axis=0)])
When debugging, bbox is returned as a list under [c w h x y] and not [c x y w h], where c is the class number. I attempted to reverse it to [c x y w h], but when loading the new dataset cache and training, the loss approaching NaN and infinity.
Could you help me with this issue? Thank you and best regards.
Hello,
First of all, thank you for the MIT license implementation of YOLOv9.
I want to report an issue when training with a custom dataset.
I am using the WIDER dataset and I have converted the annotations to YOLO format i.e. xywh.
However, when I attempted to train the yolov9-c model on the dataset, I found out that the annotations are wrong from the visualized ground truth, thus the model was trained wrongly with incorrect label loaded. I have attached a screenshot of the labels on ground truth visualized by wandb below.
My training script is:
task.epoch=100 dataset=widerface task.data.batch_size=8 model=v9-c device=cuda name=face use_wandb=TrueThe visualization of the annotated GT is attached as below:

I investigated the situation and figure out there might be an issue when loading the bbox in the file yolo/tools/data_loader.py line 125:
bbox = torch.tensor([cls, *valid_points.min(axis=0), *valid_points.max(axis=0)])When debugging, bbox is returned as a list under [c w h x y] and not [c x y w h], where c is the class number. I attempted to reverse it to [c x y w h], but when loading the new dataset cache and training, the loss approaching NaN and infinity.
Could you help me with this issue? Thank you and best regards.