Toggle navigation
OpenReview
.net
Login
×
Back to
ACMMM
ACMMM 2024 Conference Submissions
Dual-view Pyramid Network for Video Frame Interpolation
Yao Luo
,
Ming Yang
,
Jinhui Tang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution
Yuzhen Li
,
Zehang Deng
,
Yuxin Cao
,
Lihua Liu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Fine-grained Semantic Alignment with Transferred Person-SAM for Text-based Person Retrieval
Yihao Wang
,
Meng Yang
,
Rui Cao
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
Fact: Teaching MLLMs with Faithful, Concise and Transferable Rationales
Minghe Gao
,
Shuang Chen
,
Liang Pang
,
Yuan Yao
,
Jisheng Dang
,
Wenqiao Zhang
,
Juncheng Li
,
Siliang Tang
,
Yueting Zhuang
,
Tat-Seng Chua
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Trust Prophet or Not? Taking a Further Verification Step toward Accurate Scene Text Recognition
Anna Zhu
,
Ke Xiao
,
Bo Zhou
,
Runmin Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction
Cunhang Fan
,
Jingjing Zhang
,
Hongyu Zhang
,
Wang Xiang
,
Jianhua Tao
,
Xinhui Li
,
Jiangyan Yi
,
Dianbo Sui
,
Zhao Lv
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
CAD Translator: An Effective Drive for Text to 3D Parametric Computer-Aided Design Generative Modeling
Xueyang Li
,
Yu Song
,
Yunzhong Lou
,
Xiangdong Zhou
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Document Registration: Towards Automated Labeling of Pixel-Level Alignment Between Warped-Flat Documents
Weiguang Zhang
,
Qiufeng Wang
,
Kaizhu Huang
,
Xiaowei Huang
,
Fengjun Guo
,
Xiaomeng Gu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Modeling Event-level Causal Representation for Video Classification
Yuqing Wang
,
Lei Meng
,
Haokai Ma
,
Yuqing Wang
,
Haibei HUANG
,
Xiangxu Meng
Published: 20 Jul 2024, Last Modified: 04 Aug 2024
MM2024 Oral
Readers:
Everyone
FacialPulse: An Efficient RNN-based Depression Detection via Temporal Facial Landmarks
Ruiqi Wang
,
Jinyang Huang
,
Jie Zhang
,
Xin Liu
,
Xiang Zhang
,
Zhi Liu
,
Peng Zhao
,
Sigui Chen
,
Xiao Sun
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
FedEvalFair: A Privacy-Preserving and Statistically Grounded Federated Fairness Evaluation Framework
Zhongchi Wang
,
Hailong Sun
,
Zhengyang Zhao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
A Principled Approach to Natural Language Watermarking
Zhe Ji
,
Qiansiqi Hu
,
Yicheng Zheng
,
Liyao Xiang
,
Xinbing Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates
Junjie Shi
,
Caozhi Shang
,
Zhaobin Sun
,
Li Yu
,
Xin Yang
,
Zengqiang Yan
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
,
Wei Wei
,
Dangyang Chen
,
Yuanyuanfu
Published: 20 Jul 2024, Last Modified: 04 Aug 2024
MM2024 Poster
Readers:
Everyone
Domain Knowledge Enhanced Vision-Language Pretrained Model for Dynamic Facial Expression Recognition
Liupeng Li
,
Yuhua Zheng
,
Shupeng Liu
,
Xiaoyin Xu
,
Taihao Li
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
PD-Refiner: An Underlying Surface Inheritance Refiner with Adaptive Edge-Aware Supervision for Point Cloud Denoising
Chengwei Zhang
,
Xueyi Zhang
,
Xianghu Yue
,
Mingrui Lao
,
Tao Jiang
,
Jiawei Wang
,
Fubo Zhang
,
Longyong Chen
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision
zhijun jia
,
Huaying Xue
,
Xiulian Peng
,
Yan Lu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Highly Transferable Diffusion-based Unrestricted Adversarial Attack on Pre-trained Vision-Language Models
Wenzhuo Xu
,
Kai Chen
,
Ziyi Gao
,
Zhipeng Wei
,
Jingjing Chen
,
Yu-Gang Jiang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
GeunTaek Lim
,
Hyunwoo Kim
,
Joonsoo Kim
,
Yukyung Choi
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Universal Frequency Domain Perturbation for Single-Source Domain Generalization
liu chuang
,
Yichao Cao
,
Haogang Zhu
,
Xiu Su
Published: 20 Jul 2024, Last Modified: 02 Aug 2024
MM2024 Poster
Readers:
Everyone
SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification
Heng Fang
,
Sheng Huang
,
Wenhao Tang
,
Luwen Huangfu
,
Bo Liu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision
Shengguang Wu
,
Zhenglun Chen
,
Qi Su
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
D$^3$U-Net: Dual-Domain Collaborative Optimization Deep Unfolding Network for Image Compressive Sensing
Kai Han
,
Jin Wang
,
Yunhui Shi
,
Nam Ling
,
Baocai Yin
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
Towards Medical Vision-Language Contrastive Pre-training via Study-Oriented Semantic Exploration
LIU BO
,
LU ZEXIN
,
Yan Wang
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
GLATrack: Global and Local Awareness for Open-Vocabulary Multiple Object Tracking
Guangyao Li
,
Yajun Jian
,
Yan Yan
,
Hanzi Wang
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
«
‹
2
3
4
5
6
7
8
9
10
11
›
»