Toggle navigation
OpenReview
.net
Login
×
Back to
ACMMM
ACMMM 2024 Conference Submissions
Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning
Xian Zhang
,
Haokun Wen
,
Jianlong Wu
,
Pengda Qin
,
Hui Xue'
,
Liqiang Nie
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Monocular Human-Object Reconstruction in the Wild
Chaofan Huo
,
Ye Shi
,
Jingya Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
QNCD: Quantization Noise Correction for Diffusion Models
Huanpeng Chu
,
Wei Wu
,
Chengjie Zang
,
Kun Yuan
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
R2SFD: Improving Single Image Reflection Removal using Semantic Feature Dictionary
Green Rosh
,
Pawan Prasad B H
,
LOKESH R BOREGOWDA
,
Kaushik Mitra
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
Xueyuan Chen
,
Shangzhe Li
,
Junran Wu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Unifying Spike Perception and Prediction: A Compact Spike Representation Model using Multi-scale Correlation
Kexiang Feng
,
Chuanmin Jia
,
Siwei Ma
,
Wen Gao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Conditional Diffusion Model for Open-ended Video Question Answering
Xinyue Liu
,
Jiahui Wan
,
Linlin Zong
,
Bo Xu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
A Progressive Skip Reasoning Fusion Method for Multi-Modal Classification
Qian Guo
,
Xinyan Liang
,
Yuhua Qian
,
Zhihua Cui
,
Jie Wen
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
HS-Surf: A Novel High-Frequency Surface Shell Radiance Field to Improve Large-Scale Scene Rendering
Jiongming Qin
,
Fei LUO
,
Tuo Cao
,
Wenju Xu
,
Chunxia Xiao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
An Active Masked Attention Framework for Many-to-Many Cross-Domain Recommendations
Feng Zhu
,
Xinxing Yang
,
Longfei Li
,
JUN ZHOU
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
HighlightRemover: Spatially Valid Pixel Learning for Image Specular Highlight Removal
Ling Zhang
,
Yidong Ma
,
Zhi Jiang
,
Weilei He
,
Zhongyun Bao
,
Gang Fu
,
Wenju Xu
,
Chunxia Xiao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
,
Haonan Bai
,
Jen-tse Huang
,
Yuxuan WAN
,
Youliang Yuan
,
Haoyi Qiu
,
Nanyun Peng
,
Michael Lyu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
HeadSetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets
Yili Jin
,
Duan Xize
,
Fangxin Wang
,
Xue Liu
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Oral
Readers:
Everyone
MFRGN: Multi-scale Feature Representation Generalization Network For Ground-to-Aerial Geo-localization
Yuntao Wang
,
Jinpu Zhang
,
Ruonan Wei
,
Wenbo Gao
,
Yuehuan Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
MTSNet: Joint Feature Adaptation and Enhancement for Text-Guided Multi-view Martian Terrain Segmentation
Yang Fang
,
Xuefeng Rao
,
Xinbo Gao
,
Weisheng Li
,
Min Zijian
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
3D Scene De-occlusion in Neural Radiance Fields: A Framework for Obstacle Removal and Realistic Inpainting
Yi LIU
,
Xinyi LI
,
Shuai Wenjing
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Crossmodal Few-shot 3D Point Cloud Semantic Segmentation via View Synthesis
Ziyu Zhao
,
Pingping Cai
,
Canyu Zhang
,
Xiaoguang Li
,
Song Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
FewVS: A Vision-Semantics Integration Framework for Few-Shot Image Classification
Zhuoling Li
,
Yong Wang
,
Kaitong Li
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
,
Steven Jiaxun Tang
,
Qizheng Yang
,
Hui Guan
,
Tongping Liu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis
Xitong Ling
,
Minxi Ouyang
,
Yizhi Wang
,
Xinrui Chen
,
Renao Yan
,
Hongbochu
,
Junru Cheng
,
Tian Guan
,
Xiaoping Liu
,
Sufang Tian
,
Yonghong He
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Bridging the Modality Gap: Dimension Information Alignment and Sparse Spatial Constraint for Image-Text Matching
Xiang Ma
,
Xuemei Li
,
Lexin Fang
,
Caiming Zhang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Toward Timeliness-Enhanced Loss Recovery for Large-Scale Live Streaming
Bo Wu
,
Tong Li
,
cheng luo
,
Xu Yan
,
FuYu Wang
,
Xinle Du
,
Ke Xu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Vaccine Misinformation Detection in X using Cooperative Multimodal Framework
Usman Naseem
,
Adam Dunn
,
Matloob Khushi
,
Jinman Kim
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Selection and Reconstruction of Key Locals: A Novel Specific Domain Image-Text Retrieval Method
Yu Liao
,
Xinfeng Zhang
,
Rui Yang
,
Jianwei Tao
,
Bai Liu
,
Zhipeng Hu
,
Shuang Wang
,
Zeng Zhao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Sparse Query Dense: Enhancing 3D Object Detection with Pseudo points
Mo Yujian
,
Yan Wu
,
Junqiao Zhao
,
Hou zhenjie
,
weiquan Huang
,
Hu Yinghao
,
Jijun Wang
,
Jun Yan
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
«
‹
6
7
8
9
10
11
12
13
14
15
›
»