Object detection is to predict the bounding boxes and category labels for each object of interest. This paper proposed DETR (Detection Transformer) to predict all objects at once, trained end-to-end with a set loss function to perform bipartite matching between the predicted and groundtruth. It is found to perform better...
[more]
Prokhorenkova et al (2018) CatBoost: Unbiased boosting with categorical features
CatBoost is a library for random forest. This paper describes the key feature behind it.
[more]
Redmon et al (2016) You Only Look Once: Unified, Real-time Object Detection
This is the paper to propose YOLOv1 network, which reframed object detection as a regression problem. It is a single convolutional network that simultaneously predicts multiple bounding boxes and class probabilities for those boxes. It is to compare against R-CNN but faster and can see the entire image at once....
[more]
Liu et al (2016) SSD: Single Shot MultiBox Detector
This paper distinct from previous work in the sense that the older approach of object detection first hypothesize bounding boxes, resample features for each box, then apply a classifier. This paper proposed a network that does not resample for bounding box hypotheses but equally accurate. It can do high speed...
[more]
Wang et al (2020) HRNet
Classification networks such as AlexNet, VGGNet, GoogLeNet, ResNet are all reducing spatial size and produce a low-resolution representation. High-resolution representation is produced in U-net, for example, using dilated convolution and upsampling. This paper proposes HRNet that maintain high-resolution representation through the whole process. It starts from high resolution convolution stream...
[more]