Toggle navigation
OpenReview
.net
Login
×
Back to
ACMMM
ACMMM 2024 Conference Submissions
Progressive Local and Non-Local Interactive Networks with Deeply Discriminative Training for Image Deraining
Cong Wang
,
Liyan Wang
,
Jie Mu
,
Chengjin Yu
,
Wei Wang
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
TiVA: Time-Aligned Video-to-Audio Generation
Xihua Wang
,
Yuyue Wang
,
Yihan Wu
,
Ruihua Song
,
Xu Tan
,
Zehua Chen
,
Hongteng Xu
,
Guodong Sui
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Oral
Readers:
Everyone
Collaborative Training of Tiny-Large Vision Language Models
Shichen Lu
,
Longteng Guo
,
Wenxuan Wang
,
Zijia Zhao
,
Tongtian Yue
,
Jing Liu
,
Si Liu
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
Multi-Modal Inductive Framework for Text-Video Retrieval
Qian Li
,
Yucheng Zhou
,
Cheng Ji
,
Feihong Lu
,
Jianian Gong
,
Shangguang Wang
,
Jianxin Li
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
Serial section microscopy image inpainting guided by axial optical flow
Yiran Cheng
,
Bintao He
,
Renmin Han
,
Fa Zhang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
3D Question Answering for City Scene Understanding
Penglei Sun
,
Yaoxian Song
,
Xiang Liu
,
Xiaofei Yang
,
Qiang Wang
,
tiefeng li
,
Yang YANG
,
Xiaowen Chu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Semantic-aware Representation Learning for Homography Estimation
Yuhan Liu
,
Qianxin Huang
,
Siqi Hui
,
Jingwen Fu
,
Sanping Zhou
,
Kangyi Wu
,
Pengna Li
,
Jinjun Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
DySarl: Dynamic Structure-Aware Representation Learning for Multimodal Knowledge Graph Reasoning
Kangzheng Liu
,
Feng Zhao
,
Yu Yang
,
Guandong Xu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Deformable NeRF using Recursively Subdivided Tetrahedra
Zherui Qiu
,
Chenqu Ren
,
Kaiwen Song
,
Xiaoyi Zeng
,
Leyuan Yang
,
Juyong Zhang
Published: 20 Jul 2024, Last Modified: 04 Aug 2024
MM2024 Poster
Readers:
Everyone
UniGM: Unifying Multiple Pre-trained Graph Models via Adaptive Knowledge Aggregation
Jintao Chen
,
Fan Wang
,
Shengye Pang
,
Siwei Tan
,
Mingshuai Chen
,
Tiancheng Zhao
,
Meng Xi
,
Jianwei Yin
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Self-Supervised Emotion Representation Disentanglement for Speech-Preserving Facial Expression Manipulation
Zhihua Xu
,
Tianshui Chen
,
Zhijing Yang
,
Chunmei Qing
,
Yukai Shi
,
Liang Lin
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Streamable Portrait Video Editing with Probabilistic Pixel Correspondence
Xiaodi Li
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition
Jinfu Liu
,
Chen Chen
,
Mengyuan Liu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Semantic Alignment for Multimodal Large Language Models
Tao Wu
,
Mengze Li
,
Jingyuan Chen
,
Wei Ji
,
Wang Lin
,
Jinyang Gao
,
Kun Kuang
,
Zhou Zhao
,
Fei Wu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Realistic Full-Body Motion Generation from Sparse Tracking with State Space Model
Kun Dong
,
Jian Xue
,
Zehai Niu
,
Xing Lan
,
Ke Lv
,
Qingyuan Liu
,
Xiaoyu Qin
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
AutoGraph: Enabling Visual Context via Graph Alignment in Open Domain Multi-Modal Dialogue Generation
Deji Zhao
,
Donghong Han
,
Ye Yuan
,
Bo Ning
,
Li Mengxiang
,
Zhongjiang He
,
Shuangyong Song
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
AesMamba: Universal Image Aesthetic Assessment with State Space Models
Fei Gao
,
Yuhao Lin
,
Jiaqi Shi
,
Maoying Qiao
,
Nannan Wang
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Oral
Readers:
Everyone
AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering
Mahiro Ukai
,
Shuhei Kurita
,
Atsushi Hashimoto
,
Yoshitaka Ushiku
,
Nakamasa Inoue
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer
Wenhan Wu
,
Ce Zheng
,
Zihao Yang
,
Chen Chen
,
Srijan Das
,
Aidong Lu
Published: 20 Jul 2024, Last Modified: 01 Aug 2024
MM2024 Poster
Readers:
Everyone
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression
Lei Lu
,
Yanyue Xie
,
Wei Jiang
,
Wei Wang
,
Xue Lin
,
Yanzhi Wang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
An In-depth Study of Bandwidth Allocation across Media Sources in Video Conferencing
Zejun Zhang
,
Xiao Zhu
,
Anlan Zhang
,
Feng Qian
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
Learning to Handle Large Obstructions in Video Frame Interpolation
Libo Long
,
Xiao Hu
,
Jochen Lang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
256 Metaverse Recording Dataset
Patrick Steinert
,
Stefan Wagenpfeil
,
Ingo Frommholz
,
Matthias Hemmje
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Hearing the Moment with MetaEcho! From Physical to Virtual in Synchronized Sound Recording
Zheng WEI
,
Yuzheng Chen
,
Wai Tong
,
Xuan Zong
,
Huamin Qu
,
Xian Xu
,
LIK-HANG LEE
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Generating Prompts in Latent Space for Rehearsal-free Continual Learning
Chengyi Yang
,
Wentao Liu
,
Shisong Chen
,
Jiayin Qi
,
Aimin Zhou
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
«
‹
21
22
23
24
25
26
27
28
29
30
›
»