deep-learning - 用于实例分割的极简 coco 数据集：无法创建合法的数据集

Question

我尝试仅使用 pycocotools 转换数据集以进行实例分割。初始数据集由成对的灰度（左）+groundtruth（中）组成。地面实况图像产生两个二进制实例（红色）：

转换为 COCO 格式必须为每个灰度图像生成一个保存为 json 文件的字典。

由于我不明白如何使用 imantics 或 pycococreator，我尝试手动生成一个包含一个图像和两个实例的极简示例。整件事都可以在笔记本中找到。这里选择了第 130 张图像，字典制作如下：

N = 130
NUM_CATEGORIES = 2 # chrom:1, background :0
grey = data[N,:,:,0]
# dictionnary for image 130
## path to greyscaled image
dataset_root = os.path.join('.','dataset','shapes','train')
subset ='shapes_train'+'2020'
#annotations = 'annotations'
image_id = "{:04d}".format(N) #Possible bug here since
#print(image_id)
grey_file_name = os.path.join(image_id+'.png')
path_to_grey = os.path.join(dataset_root,subset, grey_file_name)

dict_to_130 = {}
dict_to_130['file_name']= path_to_grey

## grey shape
dict_to_130['height']= grey.shape[0]
dict_to_130['width']= grey.shape[1]

## the image id could be different from its index, here choose id=index=N
dict_to_130['image_id'] = N

### Prepare the dicts for annotation
#### bounding boxes : theres two instances in image 130:First instance
dict_to_130['annotations']= []
annotation_instance_01_dict = {}
annotation_instance_01_dict['bbox']=None
Bbox_0130_01 = mask_to_bbox_corners(mask1, mode='XYXY')

print("     ", type(Bbox_0130_01), type(Bbox_0130_01[0]))

annotation_instance_01_dict['bbox'] = Bbox_0130_01
annotation_instance_01_dict['bbox_mode']=0 #XYXY
annotation_instance_01_dict['category_id'] = NUM_CATEGORIES-1

annotation_instance_01_dict['segmentation']=None # A dict is used, How to handle several instances?
mask1 = mask1 > 0
### rle_instance_1 is a dict
###
### <byte> type issue !!!
###

rle_instance_1 = encode(np.asarray(mask1, order="F"))

print("rle_instance1 ",rle_instance_1)
print("rle_instance1['counts'] is of type:",type(rle_instance_1['counts']))

print("rle_instance1 ",rle_instance_1['counts'].decode("utf-8"))

counts_byte_to_utf8 = rle_instance_1['counts'].decode("utf-8")
rle_instance_1['counts'] = counts_byte_to_utf8
###
###
#cfg.INPUT.MASK_FORMAT='bitmask'
annotation_instance_01_dict['segmentation'] = rle_instance_1

dict_to_130['annotations'].append(annotation_instance_01_dict)

#### bounding boxes : theres two instances in image 130: second instance

annotation_instance_02_dict = {}
annotation_instance_02_dict['bbox']=None
Bbox_0130_02 = mask_to_bbox_corners(mask2, mode='XYXY')
print("     ", type(Bbox_0130_02))
annotation_instance_02_dict['bbox'] = Bbox_0130_02
annotation_instance_02_dict['bbox_mode']=0 #XYXY
annotation_instance_02_dict['category_id'] = NUM_CATEGORIES-1

annotation_instance_02_dict['segmentation']=None # A dict is used, How to handle several instances?
mask2 = mask2 > 0
### rle_instance_1 is a dict
rle_instance_2 = encode(np.asarray(mask2, order="F"))
#cfg.INPUT.MASK_FORMAT='bitmask'

###
### <byte> type issue !!!
###
rle_instance_2['counts'] = rle_instance_2['counts'].decode("utf-8")
###
annotation_instance_02_dict['segmentation'] = rle_instance_2
dict_to_130['annotations'].append(annotation_instance_02_dict)

可以查字典：

print(dict_to_130.keys())
print("    ", type(dict_to_130['height']), dict_to_130['height'])
print("    ", type(dict_to_130['width']), dict_to_130['width'])
print("    ", type(dict_to_130['image_id']), dict_to_130['image_id'])
print(dict_to_130['file_name'])
print(type(dict_to_130['annotations']))

print(dict_to_130['annotations'])
print(dict_to_130['annotations'][0].keys())
print(dict_to_130['annotations'][0]['segmentation'])
print(dict_to_130['annotations'][0]['segmentation'])
print(dict_to_130['annotations'][0]['segmentation'].keys())
print("    ",dict_to_130['annotations'][0]['segmentation']['size'],"---",type(dict_to_130['annotations'][0]['segmentation']['size'][0]))
print("    ",dict_to_130['annotations'][0]['segmentation']['counts'])
print("    ",type(dict_to_130['annotations'][0]['segmentation']['counts']))

产生：

dict_keys(['file_name', 'height', 'width', 'image_id', 'annotations'])
     <class 'int'> 190
     <class 'int'> 189
     <class 'int'> 130
./dataset/shapes/train/shapes_train2020/0130.png
<class 'list'>
[{'bbox': [98, 61, 131, 124], 'bbox_mode': 0, 'category_id': 1, 'segmentation': {'size': [190, 189], 'counts': 'cXb0:_57K5K6QKUOa4[1I3L4M3N2O1N2O0O2O001O001O00000O11N10O02O0O1N3J5D=I7J7E;I9HY`:'}}, {'bbox': [98, 61, 131, 124], 'bbox_mode': 0, 'category_id': 1, 'segmentation': {'size': [190, 189], 'counts': 'oU46f52^JI\\5c0I2N2N101N101O0000000O100O1000000O010O010O10O100000O11O00001O0000O02O00O02O000O100000000000O10O101O00O1000O0101O0000O10001OO11N10000O100O10O02O00O1O1000000O101N100000O02O000000O010001N010000000000000000O100000000000000O11O01OO10000000O10O1001O000000010OO10O10000000001O001N101O1N2N3M8D[JOUU4'}}]
dict_keys(['bbox', 'bbox_mode', 'category_id', 'segmentation'])
{'size': [190, 189], 'counts': 'cXb0:_57K5K6QKUOa4[1I3L4M3N2O1N2O0O2O001O001O00000O11N10O02O0O1N3J5D=I7J7E;I9HY`:'}
{'size': [190, 189], 'counts': 'cXb0:_57K5K6QKUOa4[1I3L4M3N2O1N2O0O2O001O001O00000O11N10O02O0O1N3J5D=I7J7E;I9HY`:'}
dict_keys(['size', 'counts'])
     [190, 189] --- <class 'int'>
     cXb0:_57K5K6QKUOa4[1I3L4M3N2O1N2O0O2O001O001O00000O11N10O02O0O1N3J5D=I7J7E;I9HY`:
     <class 'str'>

然后将字典保存为 json 文件（来自 colab 笔记本）：

with open(os.path.join('../gdrive','My Drive','Science','Data Science','dataset','shapes','train','annotations','instances_0130_data.json'), 'w') as f:
    json.dump(dict_to_130, f)

当我尝试检查 json 文件是否是有效的 coco 数据集时，问题就来了：

#import pycocotools.coco as coco
from pycocotools.coco import COCO
dataDir= os.path.join('../gdrive','My Drive','Science','Data Science','dataset','shapes','train')
dataType='0130_data'
annFile='%s/annotations/instances_%s.json'%(dataDir,dataType)
coco=COCO(annFile)

这里 pycocotools 抱怨如下：

loading annotations into memory...
Done (t=0.00s)
creating index...

---------------------------------------------------------------------------

KeyError                                  Traceback (most recent call last)

<ipython-input-19-bea8d533e4f4> in <module>()
----> 1 coco=COCO(annFile)

1 frames

/usr/local/lib/python3.6/dist-packages/pycocotools/coco.py in createIndex(self)
     95         if 'annotations' in self.dataset:
     96             for ann in self.dataset['annotations']:
---> 97                 imgToAnns[ann['image_id']].append(ann)
     98                 anns[ann['id']] = ann
     99 

KeyError: 'image_id'

deep-learning - 用于实例分割的极简 coco 数据集：无法创建合法的数据集

0 回答 0

Related

Reference