Toggle navigation
OpenReview
.net
Login
×
Back to
ACMMM
ACMMM 2024 Conference Submissions
Model-Based Non-Independent Distortion Cost Design for Effective JPEG Steganography
Yuanfeng Pan
,
Wenkang Su
,
Jiangqun Ni
,
Qingliang Liu
,
Yulin Zhang
,
Donghua Jiang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
Yixuan Zhou
,
Xiaoyu Qin
,
Zeyu Jin
,
Shuoyi Zhou
,
Shun Lei
,
Songtao Zhou
,
Zhiyong Wu
,
Jia Jia
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Towards Trustworthy MetaShopping: Studying Manipulative Audiovisual Designs in Virtual-Physical Commercial Platforms
Esmee Henrieke Anne de Haas
,
LIK-HANG LEE
,
Yiming Huang
,
Carlos BERMEJO FERNANDEZ
,
Pan Hui
,
Zijun Lin
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Oral
Readers:
Everyone
Scene Diffusion: Text-driven Scene Image Synthesis Conditioning on a Single 3D Model
Xuan Han
,
Yihao Zhao
,
Mingyu You
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Yi Bin
,
Junrong Liao
,
Yujuan Ding
,
Haoxuan Li
,
Yang Yang
,
See-Kiong Ng
,
Heng Tao Shen
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
Stay Focused is All You Need for Adversarial Robustness
Bingzhi Chen
,
Ruihan Liu
,
Yishu Liu
,
Xiaozhao Fang
,
Jiahui Pan
,
Guangming Lu
,
Zheng Zhang
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting
Hongyun Yu
,
Zhan Qu
,
Qihang Yu
,
Jianchuan Chen
,
Zhonghua Jiang
,
Zhiwen Chen
,
Shengyu Zhang
,
Jimin Xu
,
Fei Wu
,
chengfei lv
,
Gang Yu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
SpeechCraft: A Fine-Grained Expressive Speech Dataset with Natural Language Description
Zeyu Jin
,
Jia Jia
,
Qixin Wang
,
Kehan Li
,
Shuoyi Zhou
,
Songtao Zhou
,
Xiaoyu Qin
,
Zhiyong Wu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Mitigating World Biases: A Multimodal Multi-View Debiasing Framework for Fake News Video Detection
Zhi Zeng
,
Minnan Luo
,
Xiangzheng Kong
,
Huan Liu
,
Hao Guo
,
Hao Yang
,
Zihan Ma
,
Xiang Zhao
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
FRADE: Forgery-aware Audio-distilled Multimodal Learning for Deepfake Detection
Fan Nie
,
Jiangqun Ni
,
Jian Zhang
,
Bin Zhang
,
Weizhe Zhang
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
ReCoS: A Novel Benchmark for Cross-Modal Image-Text Retrieval in Complex Real-Life Scenarios
Xiaojun Chen
,
Jimeng Lou
,
Wenxi Huang
,
Ting Wan
,
Qin Zhang
,
Min Yang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
LiteQUIC: Improving QoE of Video Streams by Reducing CPU Overhead of QUIC
Pengqiang Bi
,
Yifei Zou
,
Mengbai Xiao
,
Dongxiao Yu
,
yijunli
,
zhixiong.liu
,
qunxie
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Oral
Readers:
Everyone
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection
Xinyue Liu
,
Jianyuan Wang
,
Biao Leng
,
Shuo Zhang
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
Inferring 3D Occupancy Fields through Implicit Reasoning on Silhouette Images
Baorui Ma
,
Yu-Shen Liu
,
Matthias Zwicker
,
Zhizhong Han
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Dual-Branch Fusion with Style Modulation for Cross-Domain Few-Shot Semantic Segmentation
Qiuyu Kong
,
Jiangming Chen
,
Jiang Jie
,
Zanxi Ruan
,
KANG Lai
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Generalizing ISP Model by Unsupervised Raw-to-raw Mapping
Dongyu Xie
,
Chaofan Qiao
,
Lanyue Liang
,
Zhiwen Wang
,
Tianyu Li
,
Qiao Liu
,
Chongyi Li
,
Guoqing Wang
,
Yang Yang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
CIEASR:Contextual Image-Enhanced Automatic Speech Recognition for Improved Homophone Discrimination
Ziyi Wang
,
Yiming Rong
,
Deyang Jiang
,
Haoran Wu
,
Shiyu Zhou
,
Bo XU
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Vi2ACT:Video-enhanced Cross-modal Co-learning with Representation Conditional Discriminator for Few-shot Human Activity Recognition
Kang Xia
,
Wenzhong Li
,
Yimiao Shao
,
Sanglu Lu
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
Combating Visual Question Answering Hallucinations via Robust Multi-Space Co-Debias Learning
Jiawei Zhu
,
Yishu Liu
,
Huanjia Zhu
,
Hui Lin
,
Yuncheng Jiang
,
Zheng Zhang
,
Bingzhi Chen
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting
Jihoon Lee
,
Yunhong Min
,
Hwidong Kim
,
Sangtae Ahn
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Bridging Gaps in Content and Knowledge for Multimodal Entity Linking
Pengfei Luo
,
Tong Xu
,
Che Liu
,
Suojuan Zhang
,
Linli Xu
,
Minglei Li
,
Enhong Chen
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
Minghang Zheng
,
Jiahua Zhang
,
Qingchao Chen
,
Yuxin Peng
,
Yang Liu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion
Yingxuan Li
,
Ryota Hinami
,
Kiyoharu Aizawa
,
Yusuke Matsui
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Query Augmentation with Brain Signals
Ziyi Ye
,
Jingtao Zhan
,
Qingyao Ai
,
Yiqun LIU
,
Maarten de Rijke
,
Christina Lioma
,
Tuukka Ruotsalo
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework
Yiheng Huang
,
Yang Hui
,
Chuanchen Luo
,
Yuxi Wang
,
Shibiao Xu
,
Zhaoxiang Zhang
,
Man Zhang
,
Junran Peng
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
«
‹
1
2
3
4
5
6
7
8
9
10
›
»