
We have provided some samples of PhoneAgentBench, and we will open-source the complete data and code as soon as possible for the community to verify.


`mllm_testset.xlsx` is the main test set that contains APP-Rec, MM-RR, MM-NER, Mobile-FC, ACU. And `mt-plan_testset` is for MT-Plan task.
