The detection and segmentation code are similar. We explain how to use
the code using segmentation.
if the training loss cannot decrease initially, use class weight or balance class sampler. You can finetune without weight + normal random sampler at the end