Boosted by Multi-modal Large Language Models (MLLMs), text-guided universal segmentation models for the image and video domains have made rapid progress recently. However, these methods are often ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results