Abstract: Due to the limited scale of multimodal table understanding (MTU) data, model performance is constrained. A straightforward approach is to use multimodal large language models to obtain more ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results