Toggle navigation
OpenReview
.net
Login
×
Back to
ACMMM
ACMMM 2024 Conference Submissions
Temporal Enhancement for Video Affective Content Analysis
Xin Li
,
Shangfei Wang
,
Xuandong Huang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
A Chinese Multimodal Social Video Dataset for Controversy Detection
Tianjiao Xu
,
Aoxuan Chen
,
Yuxi Zhao
,
Jinfei Gao
,
Tian Gan
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-ShoT Learning
Haojian Huang
,
Xiaozhennn Qiao
,
Zhuo Chen
,
Haodong Chen
,
Binyu Li
,
Zhe Sun
,
Mulin Chen
,
Xuelong Li
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Exploring Matching Rates: From Key Point Selection to Camera Relocalization
Hu Lin
,
Chengjiang Long
,
Yifeng Fei
,
qianchen xia
,
Erwei Yin
,
Baocai Yin
,
Xin Yang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
GaussianTalker: Real-Time Talking Head Synthesis with 3D Gaussian Splatting
Kyusun Cho
,
JoungBin Lee
,
Heeji Yoon
,
Yeobin Hong
,
Jaehoon Ko
,
Sangjun Ahn
,
Seungryong Kim
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Modal-Enhanced Semantic Modeling for Fine-Grained 3D Human Motion Retrieval
Haoyu Shi
,
Huaiwen Zhang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
SSAT-Adapter: Enhancing Vision-Language Model Few-shot Learning with Auxiliary Tasks
Bowen Chen
,
Yun Sing Koh
,
Gillian Dobbie
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing
Zhihong Zhu
,
Xuxin Cheng
,
Zhaorun Chen
,
Yuyan Chen
,
Yunyan Zhang
,
Xian Wu
,
Yefeng Zheng
,
Bowen Xing
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Oral
Readers:
Everyone
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions
Junzhang Liu
,
Zhecan Wang
,
Hammad Ayyubi
,
Haoxuan You
,
Chris Thomas
,
Rui Sun
,
Shih-Fu Chang
,
Kai-Wei Chang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Learning A Low-Level Vision Generalist via Visual Task Prompt
Xiangyu Chen
,
Yihao Liu
,
Yuandong Pu
,
Wenlong Zhang
,
Jiantao Zhou
,
Yu Qiao
,
Chao Dong
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
PSM: Learning Probabilistic Embeddings for Multi-scale Zero-shot Soundscape Mapping
Subash Khanal
,
Eric Xing
,
Srikumar Sastry
,
Aayush Dhakal
,
Zhexiao Xiong
,
Adeel Ahmad
,
Nathan Jacobs
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
IGSPAD: Inverting 3D Gaussian Splatting for Pose-agnostic Anomaly Detection
Bolin Jiang
,
Yuqiu Xie
,
Jiawei Li
,
Naiqi Li
,
Bin Chen
,
Shu-Tao Xia
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
EAGLE: Egocentric AGgregated Language-video Engine
Jing Bi
,
Yunlong Tang
,
Luchuan Song
,
Ali Vosoughi
,
Nguyen Nguyen
,
Chenliang Xu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
GRACE: GRadient-based Active Learning with Curriculum Enhancement for Multimodal Sentiment Analysis
Xinyu Li
,
Wenqing Ye
,
Yueyi Zhang
,
Xiaoyan Sun
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
In Situ 3D Scene Synthesis for Ubiquitous Embodied Interfaces
Haiyan Jiang
,
Song Leiyu
,
dongdong weng
,
Zhe Sun
,
Li Huiying
,
Xiaonuo Dongye
,
Zhenliang Zhang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Laplacian Matrix Learning for Point Cloud Attribute Compression with Ternary Search-Based Adaptive Block Partition
Changhao Peng
,
Wei Gao
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
Improving Open-World Classification with Disentangled Foreground and Background Features
Choubo Ding
,
Guansong Pang
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Fractional Correspondence Framework in Detection Transformer
Masoumeh Zareapoor
,
Pourya Shamsolmoali
,
Huiyu Zhou
,
Yue Lu
,
Salvador Garcia
Published: 20 Jul 2024, Last Modified: 05 Aug 2024
MM2024 Poster
Readers:
Everyone
Reconstructing, Understanding, and Analyzing Relief Type Cultural Heritage from a Single Old Photo
Jiao PAN
,
Liang Li
,
Hiroshi Yamaguchi
,
Kyoko Hasegawa
,
Fadjar Ibnu Thufail
,
Brahmantara
,
Xiaojuan Ban
,
Satoshi Tanaka
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Oral
Readers:
Everyone
3D Priors-Guided Diffusion for Blind Face Restoration
Xiaobin Lu
,
Xiaobin Hu
,
Jun Luo
,
zhuben
,
paulruan
,
Wenqi Ren
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion
Hongyu Zhu
,
Sichu liang
,
Wentao Hu
,
Li Fangqi
,
Ju Jia
,
Shi-Lin Wang
Published: 20 Jul 2024, Last Modified: 06 Aug 2024
MM2024 Poster
Readers:
Everyone
DIG: Complex Layout Document Image Generation with Authentic-looking Text for Enhancing Layout Analysis
Dehao Ying
,
Fengchang Yu
,
Haihua Chen
,
Wei Lu
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts
Shaokun Zhang
,
Yiran Wu
,
Zhonghua Zheng
,
Qingyun Wu
,
Chi Wang
Published: 20 Jul 2024, Last Modified: 04 Aug 2024
MM2024 Poster
Readers:
Everyone
MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks
Xianghu Yue
,
Xueyi Zhang
,
Yiming Chen
,
Chengwei Zhang
,
Mingrui Lao
,
Huiping Zhuang
,
Xinyuan Qian
,
Haizhou Li
Published: 20 Jul 2024, Last Modified: 31 Jul 2024
MM2024 Poster
Readers:
Everyone
Similarity Preserving Transformer Cross-Modal Hashing for Video-Text Retrieval
qianxinhuang
,
Siyao Peng
,
Xiaobo Shen
,
Yun-Hao Yuan
,
Shirui Pan
Published: 20 Jul 2024, Last Modified: 21 Jul 2024
MM2024 Poster
Readers:
Everyone
«
‹
1
2
3
4
5
6
7
8
9
10
›
»