Upload an image. First computes a depth map via MiDaS, then applies SegFormer segmentation on the depth map.