a blue sky with no clouds. man riding bicycle. a blue car parked on the street. traffic light on a pole.a traffic light. a city street scene. a building with a large roof. traffic light on a pole. a large tree in the street. a large window on the building.
对图像的内容进行精准目标检测、识别与描述,实现对图像语义的描述,从而使得计算机能够看懂图像,让机器具有看图说话的能力。