摘要:Abstract:Rail defect detection by video cameras has recently gained much attention in both academia and industry. Rail image data has two properties. It is highly imbalanced towards the non-defective class and it has a large number of unlabeled data samples available for semi-supervised learning techniques. In this paper we investigate if positive defective candidates selected from the unlabeled data can help improve the balance between the two classes and gain performance on detecting a specific type of defects called Squats. We compare data sampling techniques as well and conclude that the semi-supervised techniques are a reasonable alternative for improving performance on applications such as rail track Squat detection from image data.