detectron2系列：config软件包

2020-09-04 09:48

磐创AI

关注

＃ ROI HEADS options

＃－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－＃

＿C．MODEL．ROI＿HEADS ＝ CN（）

＿C．MODEL．ROI＿HEADS．NAME ＝＂Res5ROIHeads＂

＃ Number of foreground classes

＿C．MODEL．ROI＿HEADS．NUM＿CLASSES ＝ 80

＃ Names of the input feature maps to be used by ROI heads

＃ Currently all heads （box， mask，．．．） use the same input feature map list

＃ e．g．，［＂p2＂，＂p3＂，＂p4＂，＂p5＂］ is commonly used for FPN

＿C．MODEL．ROI＿HEADS．IN＿FEATURES ＝［＂res4＂］

＃ IOU overlap ratios ［IOU＿THRESHOLD］

＃ Overlap threshold for an RoI to be considered background （if ＜ IOU＿THRESHOLD）

＃ Overlap threshold for an RoI to be considered foreground （if ＞＝ IOU＿THRESHOLD）

＿C．MODEL．ROI＿HEADS．IOU＿THRESHOLDS ＝［0．5］

＿C．MODEL．ROI＿HEADS．IOU＿LABELS ＝［0， 1］

＃ RoI minibatch size ＊per image＊（number of regions of interest ［ROIs］）

＃ Total number of RoIs per training minibatch ＝

＃ ROI＿HEADS．BATCH＿SIZE＿PER＿IMAGE ＊ SOLVER．IMS＿PER＿BATCH

＃ E．g．， a common configuration is： 512 ＊ 16 ＝ 8192

＿C．MODEL．ROI＿HEADS．BATCH＿SIZE＿PER＿IMAGE ＝ 512

＃ Target fraction of RoI minibatch that is labeled foreground （i．e． class ＞ 0）

＿C．MODEL．ROI＿HEADS．POSITIVE＿FRACTION ＝ 0．25

＃ Only used on test mode

＃ Minimum score threshold （assuming scores in a ［0， 1］ range）； a value chosen to

＃ balance obtaining high recall with not having too many low precision

＃ detections that will slow down inference post processing steps （like NMS）

＃ A default threshold of 0．0 increases AP by ～0．2－0．3 but significantly slows down

＃ inference．

＿C．MODEL．ROI＿HEADS．SCORE＿THRESH＿TEST ＝ 0．05

＃ Overlap threshold used for non－maximum suppression （suppress boxes with

＃ IoU ＞＝ this threshold）

＿C．MODEL．ROI＿HEADS．NMS＿THRESH＿TEST ＝ 0．5

＃ If True， augment proposals with ground－truth boxes before sampling proposals to

＃ train ROI heads．

＿C．MODEL．ROI＿HEADS．PROPOSAL＿APPEND＿GT ＝ True

＃ Box Head

＿C．MODEL．ROI＿BOX＿HEAD ＝ CN（）

＃ C4 don＇t use head name option

＃ Options for non－C4 models： FastRCNNConvFCHead，

＿C．MODEL．ROI＿BOX＿HEAD．NAME ＝＂＂

＃ Default weights on （dx， dy， dw， dh） for normalizing bbox regression targets

＃ These are empirically chosen to approximately lead to unit variance targets

＿C．MODEL．ROI＿BOX＿HEAD．BBOX＿REG＿WEIGHTS ＝（10．0， 10．0， 5．0， 5．0）

＃ The transition point from L1 to L2 loss． Set to 0．0 to make the loss simply L1．

＿C．MODEL．ROI＿BOX＿HEAD．SMOOTH＿L1＿BETA ＝ 0．0

＿C．MODEL．ROI＿BOX＿HEAD．POOLER＿RESOLUTION ＝ 14

＿C．MODEL．ROI＿BOX＿HEAD．POOLER＿SAMPLING＿RATIO ＝ 0

＃ Type of pooling operation applied to the incoming feature map for each RoI

＿C．MODEL．ROI＿BOX＿HEAD．POOLER＿TYPE ＝＂ROIAlignV2＂

＿C．MODEL．ROI＿BOX＿HEAD．NUM＿FC ＝ 0

＃ Hidden layer dimension for FC layers in the RoI box head

＿C．MODEL．ROI＿BOX＿HEAD．FC＿DIM ＝ 1024

＿C．MODEL．ROI＿BOX＿HEAD．NUM＿CONV ＝ 0

＃ Channel dimension for Conv layers in the RoI box head

＿C．MODEL．ROI＿BOX＿HEAD．CONV＿DIM ＝ 256

＃ Normalization method for the convolution layers．

＃ Options：＂＂（no norm），＂GN＂，＂SyncBN＂．

＿C．MODEL．ROI＿BOX＿HEAD．NORM ＝＂＂

＃ Whether to use class agnostic for bbox regression

＿C．MODEL．ROI＿BOX＿HEAD．CLS＿AGNOSTIC＿BBOX＿REG ＝ False

＃ If true， RoI heads use bounding boxes predicted by the box head rather than proposal boxes．

＿C．MODEL．ROI＿BOX＿HEAD．TRAIN＿ON＿PRED＿BOXES ＝ False

＃ Cascaded Box Head

＿C．MODEL．ROI＿BOX＿CASCADE＿HEAD ＝ CN（）

＃ The number of cascade stages is implicitly defined by the length of the following two configs．

＿C．MODEL．ROI＿BOX＿CASCADE＿HEAD．BBOX＿REG＿WEIGHTS ＝（

（10．0， 10．0， 5．0， 5．0），

（20．0， 20．0， 10．0， 10．0），

（30．0， 30．0， 15．0， 15．0），

）

＿C．MODEL．ROI＿BOX＿CASCADE＿HEAD．IOUS ＝（0．5， 0．6， 0．7）

＃ Mask Head

＿C．MODEL．ROI＿MASK＿HEAD ＝ CN（）

＿C．MODEL．ROI＿MASK＿HEAD．NAME ＝＂MaskRCNNConvUpsampleHead＂

＿C．MODEL．ROI＿MASK＿HEAD．POOLER＿RESOLUTION ＝ 14

＿C．MODEL．ROI＿MASK＿HEAD．POOLER＿SAMPLING＿RATIO ＝ 0

＿C．MODEL．ROI＿MASK＿HEAD．NUM＿CONV ＝ 0 ＃ The number of convs in the mask head

＿C．MODEL．ROI＿MASK＿HEAD．CONV＿DIM ＝ 256

＃ Normalization method for the convolution layers．

＃ Options：＂＂（no norm），＂GN＂，＂SyncBN＂．

＿C．MODEL．ROI＿MASK＿HEAD．NORM ＝＂＂

＃ Whether to use class agnostic for mask prediction

＿C．MODEL．ROI＿MASK＿HEAD．CLS＿AGNOSTIC＿MASK ＝ False

＃ Type of pooling operation applied to the incoming feature map for each RoI

＿C．MODEL．ROI＿MASK＿HEAD．POOLER＿TYPE ＝＂ROIAlignV2＂

＃ Keypoint Head

＿C．MODEL．ROI＿KEYPOINT＿HEAD ＝ CN（）

＿C．MODEL．ROI＿KEYPOINT＿HEAD．NAME ＝＂KRCNNConvDeconvUpsampleHead＂

＿C．MODEL．ROI＿KEYPOINT＿HEAD．POOLER＿RESOLUTION ＝ 14

＿C．MODEL．ROI＿KEYPOINT＿HEAD．POOLER＿SAMPLING＿RATIO ＝ 0

＿C．MODEL．ROI＿KEYPOINT＿HEAD．CONV＿DIMS ＝ tuple（512 for ＿ in range（8））

＿C．MODEL．ROI＿KEYPOINT＿HEAD．NUM＿KEYPOINTS ＝ 17 ＃ 17 is the number of keypoints in COCO．

＃ Images with too few （or no） keypoints are excluded from training．

＿C．MODEL．ROI＿KEYPOINT＿HEAD．MIN＿KEYPOINTS＿PER＿IMAGE ＝ 1

＃ Normalize by the total number of visible keypoints in the minibatch if True．

＃ Otherwise， normalize by the total number of keypoints that could ever exist

＃ in the minibatch．

＃ The keypoint softmax loss is only calculated on visible keypoints．

＃ Since the number of visible keypoints can vary significantly between

＃ minibatches， this has the effect of up－weighting the importance of

＃ minibatches with few visible keypoints．（Imagine the extreme case of

＃ only one visible keypoint versus N： in the case of N， each one

＃ contributes 1／N to the gradient compared to the single keypoint

＃ determining the gradient direction）． Instead， we can normalize the

＃ loss by the total number of keypoints， if it were the case that all

＃ keypoints were visible in a full minibatch．（Returning to the example，

＃ this means that the one visible keypoint contributes as much as each

＃ of the N keypoints．）

＿C．MODEL．ROI＿KEYPOINT＿HEAD．NORMALIZE＿LOSS＿BY＿VISIBLE＿KEYPOINTS ＝ True

＃ Multi－task loss weight to use for keypoints

＃ Recommended values：

＃－ use 1．0 if NORMALIZE＿LOSS＿BY＿VISIBLE＿KEYPOINTS is True

＃－ use 4．0 if NORMALIZE＿LOSS＿BY＿VISIBLE＿KEYPOINTS is False

＿C．MODEL．ROI＿KEYPOINT＿HEAD．LOSS＿WEIGHT ＝ 1．0

＃ Type of pooling operation applied to the incoming feature map for each RoI

＿C．MODEL．ROI＿KEYPOINT＿HEAD．POOLER＿TYPE ＝＂ROIAlignV2＂

＃ Semantic Segmentation Head

＿C．MODEL．SEM＿SEG＿HEAD ＝ CN（）

＿C．MODEL．SEM＿SEG＿HEAD．NAME ＝＂SemSegFPNHead＂

＿C．MODEL．SEM＿SEG＿HEAD．IN＿FEATURES ＝［＂p2＂，＂p3＂，＂p4＂，＂p5＂］

＃ Label in the semantic segmentation ground truth that is ignored， i．e．， no loss is calculated for

＃ the correposnding pixel．

＿C．MODEL．SEM＿SEG＿HEAD．IGNORE＿VALUE ＝ 255

＃ Number of classes in the semantic segmentation head

＿C．MODEL．SEM＿SEG＿HEAD．NUM＿CLASSES ＝ 54

＃ Number of channels in the 3x3 convs inside semantic－FPN heads．

＿C．MODEL．SEM＿SEG＿HEAD．CONVS＿DIM ＝ 128

＃ Outputs from semantic－FPN heads are up－scaled to the COMMON＿STRIDE stride．

＿C．MODEL．SEM＿SEG＿HEAD．COMMON＿STRIDE ＝ 4

＃ Normalization method for the convolution layers． Options：＂＂（no norm），＂GN＂．

＿C．MODEL．SEM＿SEG＿HEAD．NORM ＝＂GN＂

＿C．MODEL．SEM＿SEG＿HEAD．LOSS＿WEIGHT ＝ 1．0

＿C．MODEL．PANOPTIC＿FPN ＝ CN（）

＃ Scaling of all losses from instance detection ／ segmentation head．

＿C．MODEL．PANOPTIC＿FPN．INSTANCE＿LOSS＿WEIGHT ＝ 1．0

＃ options when combining instance ＆ semantic segmentation outputs

＿C．MODEL．PANOPTIC＿FPN．COMBINE ＝ CN（｛＂ENABLED＂： True｝）

＿C．MODEL．PANOPTIC＿FPN．COMBINE．OVERLAP＿THRESH ＝ 0．5

＿C．MODEL．PANOPTIC＿FPN．COMBINE．STUFF＿AREA＿LIMIT ＝ 4096

＿C．MODEL．PANOPTIC＿FPN．COMBINE．INSTANCES＿CONFIDENCE＿THRESH ＝ 0．5

＃ RetinaNet Head

＿C．MODEL．RETINANET ＝ CN（）

＃ This is the number of foreground classes．

＿C．MODEL．RETINANET．NUM＿CLASSES ＝ 80

＿C．MODEL．RETINANET．IN＿FEATURES ＝［＂p3＂，＂p4＂，＂p5＂，＂p6＂，＂p7＂］

＃ Convolutions to use in the cls and bbox tower

＃ NOTE： this doesn＇t include the last conv for logits

＿C．MODEL．RETINANET．NUM＿CONVS ＝ 4

＃ IoU overlap ratio ［bg， fg］ for labeling anchors．

＃ Anchors with ＜ bg are labeled negative （0）

＃ Anchors with ＞＝ bg and ＜ fg are ignored （－1）

＃ Anchors with ＞＝ fg are labeled positive （1）

＿C．MODEL．RETINANET．IOU＿THRESHOLDS ＝［0．4， 0．5］

＿C．MODEL．RETINANET．IOU＿LABELS ＝［0，－1， 1］

＃ Prior prob for rare case （i．e． foreground） at the beginning of training．

＃ This is used to set the bias for the logits layer of the classifier subnet．

＃ This improves training stability in the case of heavy class imbalance．

＿C．MODEL．RETINANET．PRIOR＿PROB ＝ 0．01

＃ Inference cls score threshold， only anchors with score ＞ INFERENCE＿TH are

＃ considered for inference （to improve speed）

＿C．MODEL．RETINANET．SCORE＿THRESH＿TEST ＝ 0．05

＿C．MODEL．RETINANET．TOPK＿CANDIDATES＿TEST ＝ 1000

＿C．MODEL．RETINANET．NMS＿THRESH＿TEST ＝ 0．5

＃ Weights on （dx， dy， dw， dh） for normalizing Retinanet anchor regression targets

＿C．MODEL．RETINANET．BBOX＿REG＿WEIGHTS ＝（1．0， 1．0， 1．0， 1．0）

＃ Loss parameters

＿C．MODEL．RETINANET．FOCAL＿LOSS＿GAMMA ＝ 2．0

＿C．MODEL．RETINANET．FOCAL＿LOSS＿ALPHA ＝ 0．25

＿C．MODEL．RETINANET．SMOOTH＿L1＿LOSS＿BETA ＝ 0．1

＃ ResNe［X］t options （ResNets ＝｛ResNet， ResNeXt｝

＃ Note that parts of a resnet may be used for both the backbone and the head

＃ These options apply to both

＿C．MODEL．RESNETS ＝ CN（）

＿C．MODEL．RESNETS．DEPTH ＝ 50

＿C．MODEL．RESNETS．OUT＿FEATURES ＝［＂res4＂］＃ res4 for C4 backbone， res2．．5 for FPN backbone

＃ Number of groups to use； 1 ＝＝＞ ResNet；＞ 1 ＝＝＞ ResNeXt

＿C．MODEL．RESNETS．NUM＿GROUPS ＝ 1

＃ Options： FrozenBN， GN，＂SyncBN＂，＂BN＂

＿C．MODEL．RESNETS．NORM ＝＂FrozenBN＂

＃ Baseline width of each group．

＃ Scaling this parameters will scale the width of all bottleneck layers．

＿C．MODEL．RESNETS．WIDTH＿PER＿GROUP ＝ 64

＃ Place the stride 2 conv on the 1x1 filter

＃ Use True only for the original MSRA ResNet； use False for C2 and Torch models

＿C．MODEL．RESNETS．STRIDE＿IN＿1X1 ＝ True

＃ Apply dilation in stage ＂res5＂

＿C．MODEL．RESNETS．RES5＿DILATION ＝ 1

＃ Output width of res2． Scaling this parameters will scale the width of all 1x1 convs in ResNet

＿C．MODEL．RESNETS．RES2＿OUT＿CHANNELS ＝ 256

＿C．MODEL．RESNETS．STEM＿OUT＿CHANNELS ＝ 64

＃ Apply Deformable Convolution in stages

＃ Specify if apply deform＿conv on Res2， Res3， Res4， Res5

＿C．MODEL．RESNETS．DEFORM＿ON＿PER＿STAGE ＝［False， False， False， False］

＃ Use True to use modulated deform＿conv （DeformableV2， https：／／arxiv．org／abs／1811．11168）；

＃ Use False for DeformableV1．

＿C．MODEL．RESNETS．DEFORM＿MODULATED ＝ False

＃ Number of groups in deformable conv．

＿C．MODEL．RESNETS．DEFORM＿NUM＿GROUPS ＝ 1

<上一页 1 2 3 4 下一页> 余下全文

本地收藏打印推荐给朋友

声明： 本文由入驻维科号的作者撰写，观点仅代表作者本人，不代表OFweek立场。如有侵权或其他问题，请联系举报。

发表评论

共0条评论，0人参与

登录登录即可访问所有OFweek服务

用户名/邮箱/手机：
密码：
忘记密码？
用其他账号登录： QQ | 微信 | 新浪微博

请输入评论内容...

请输入评论/评论长度6~500个字

暂无评论

图片新闻