2025
Dynamic Semantic-Aware Correlation Modeling for UAV Tracking, Advances in Neural Information Processing Systems (NeurIPS), 2025, CCF A.
Boosting Adversarial Transferability with Spatial Adversarial Alignment, Advances in Neural Information Processing Systems (NeurIPS), 2025, CCF A.
Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment, Advances in Neural Information Processing Systems (NeurIPS), 2025, CCF A.
ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation, Advances in Neural Information Processing Systems (NeurIPS), 2025, CCF A.
Agent RL Scaling Law: Spontaneous Code Execution for Mathematical Problem Solving, Advances in Neural Information Processing Systems (NeurIPS), 2025, CCF A.
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025, JCR Q1.
Exploring the Adversarial Robustness of Face Forgery Detection with Decision-based Black-box Attacks, Knowledge-Based Systems, 2025, JCR Q1. [pdf]
Improving Adversarial Transferability with Neighbourhood Gradient Information, Applied Soft Computing, 2025, JCR Q1. [pdf]
ClickVOS: Click Video Obiect Segmentation, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025, SCI Q1. [pdf]
Progressive Representation Learning for Weakly-Supervised Camouflage Object Detection, ACM International Conference on Multimedia (ACMMM), 2025, CCF A.
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos, IEEE/CVF International Conference on Computer Vision (ICCV), 2025, CCF A. [pdf]
Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection, IEEE/CVF International Conference on Computer Vision (ICCV), 2025, CCF A. [pdf]
General Compression Framework for Efficient Transformer Object Tracking, IEEE/CVF International Conference on Computer Vision (ICCV), 2025, CCF A. [pdf]
VideoPure: Diffusion-based Adversarial Purification for Video Recognition, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025, SCI Q1. [pdf]
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection, Information Fusion, 2025, SCI Q1. [pdf]
D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025, CCF A. [pdf]
Self-supervised video object segmentation via pseudo label rectification, Pattern Recognition, 2025, CCF A. [pdf]
VideoSAM: Open-World Video Segmentation, ICRA, 2025, CCF B. [pdf]
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection, ICRA, 2025, CCF B. [pdf]
MoE-NuSeg: Enhancing nuclei segmentation in histology images with a two-stage Mixture of Experts network, Alexandria Engineering Journal, 2025, SCI Q1. [pdf]
Observe finer to select better: Learning key frame extraction via semantic coherence for dynamic facial expression recognition in the wild, Information Sciences, 2025, SCI Q1. [pdf]
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem, AAAI Conference on Artificial Intelligence, 2025, CCF A. [pdf]
OpenVIS: Open-vocabulary Video Instance Segmentation, AAAI Conference on Artificial Intelligence, 2025, CCF A. [pdf]
2024
Enhancing Environmental Robustness in Few-shot Learning via Conditional Representation Learning, IEEE Transactions on Image Processing (TIP), 2024, CCF A. [pdf]
DeTrack: In-model Latent Denoising Learning for Visual Object Tracking, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
LCGen: Mining in Low-Certainty Generation for View-consistent Text-to-3D, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
Panovos: Bridging non-panoramic and panoramic views with transformer for video segmentation, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Onevos: unifying video object segmentation with all-in-one transformer framework, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Adaptive multi-modal fusion of spatially variant kernel refinement with diffusion model for blind image super-resolution, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Sampling to distill: Knowledge transfer from open-world data, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Visual-Language Collaborative Representation Network for Broad-Domain Few-Shot Image Classification, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Boosting the transferability of adversarial attacks with global momentum initialization, Expert Systems with Applications, 2024, SCI Q1. [pdf]
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis, AAAI Conference on Artificial Intelligence, 2024, CCF A. [pdf]
Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation, AAAI Conference on Artificial Intelligence, 2024, CCF A. [pdf]
MGR3Net: Multigranularity Region Relation Representation Network for Facial Expression Recognition in Affective Robots, IEEE Transactions on Industrial Informatics (TII), 2024, SCI Q1. [pdf]
KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation, ACM Transactions on Software Engineering and Methodology, 2024, CCF A. [pdf]
Correspondence-based Generative Bayesian Deep Learning for semi-supervised volumetric medical image segmentation, Computerized Medical Imaging and Graphics, 2024, SCI Q2. [pdf]
Attention in Attention for PET-CT Modality Consensus Lung Tumor Segmentation, IEEE International Conference on Multimedia and Expo (ICME), 2024, CCF B. [pdf]
Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning, Computer Communications, 2024, SCI Q3. [pdf]
Mixed noise-guided mutual constraint framework for unsupervised anomaly detection in smart industries, Computer Communications, 2024, SCI Q3. [pdf]
A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts, IEEE Sensors Journal, 2024, SCI Q3. [pdf]
2023
ADPL: Adaptive Dual Path Learning for Domain Adaptation of Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023, CCF A. [pdf]
Reading Relevant Feature from Global Representation Memory for Visual Object Tracking, Advances in Neural Information Processing Systems (NeurIPS), 2023, CCF A. [pdf]
Content-based Unrestricted Adversarial Attack, Advances in Neural Information Processing Systems (NeurIPS), 2023, CCF A. [pdf][code]
MSC-AD: A Multiscene Unsupervised Anomaly Detection Dataset for Small Defect Detection of Casting Surface, IEEE Transactions on Industrial Informatics (TII), 2023, SCI Q1. [pdf]
Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Towards Decision-based Sparse Attacks on Video Recognition, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Plug-and-Play Feature Generation for Few-Shot Medical Image Classification, 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023, CCF B. [pdf]
Multi-domain Information Fusion for Key-Points Guided GAN Inversion, Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2023, CCF C. [pdf]
Query-Efficient Decision-based Black-Box Patch Attack, IEEE Transactions on Information Forensics and Security (TIFS), 2023, CCF A. [pdf]
GSFormer: Geometric-Spatial Transformer on Point Cloud Completion, IEEE International Conference on Multimedia and Expo (ICME), 2023, CCF B. [pdf]
RankDNN: Learning to Rank for Few-Shot Learning, AAAI Conference on Artificial Intelligence, 2023, CCF A. [pdf][code]
Memory Network with Pixel-level Spatio-Temporal Learning for Visual Object Tracking, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, SCI Q1. [pdf]
Go Closer To See Better: Camouflaged Object Detection via Object Area Amplification and Figure-ground Conversion, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, SCI Q1. [pdf]
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf] [code]
Efficient Decision-based Black-box Patch Attacks on Video Recognition, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf][code]
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf]
Lvos: A benchmark for long-term video object segmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf][code]
TINYCOD: Tiny and Effective Model for Camouflaged Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Robust Video Object Segmentation with Restricted Attention, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Gated Enhanced RPN and Hybrid-View for Few-Shot Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yang Liu, Siao Liu, Wenqiang Zhang, Lizhe Qi, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf]
Correspondence Transformers With Asymmetric Feature Learning and Matching Flow Super-Resolution, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf]
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf][code]
2022
Adaptive online mutual learning bi-decoders for video object segmentation, IEEE Transactions on Image Processing (TIP), 2022, CCF A. [pdf]
Shape Matters: Deformable Patch Attack, European conference on computer vision (ECCV), 2022, CCF B. [pdf][code]
MHKD-MVQA: Multimodal Hierarchical Knowledge Distillation for Medical Visual Question Answering, IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022, CCF B. [pdf]
A large-scale empirical study of commit message generation: models, datasets and evaluation, Empirical Software Engineering, 2022, SCI Q2. [pdf]
Dual Cross-Attention for Video Object Segmentation via Uncertainty Refinement, IEEE Transactions on Multimedia (TMM), 2022, SCI Q1. [pdf]
Weakly Supervised Video Salient Object Detection via Point Supervision, ACM International Conference on Multimedia (ACMMM), 2022, CCF A. [pdf]
DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos, ACM International Conference on Multimedia (ACMMM), 2022, CCF A. [pdf]
Deep Knowledge Reasoning guided Disease Prediction, IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2022, CCF C. [pdf]
Translational relation embeddings for multi-hop knowledge base question answering, Journal of Web Semantics, 2022, CCF B. [pdf]
Tokenizing Features for Fast Video Object Segmentation, IEEE International Conference on Multimedia and Expo (ICME), 2022, CCF B. [pdf]
Position-aware Joint Entity and Relation Extraction with Attention Mechanism, International Joint Conference on Artificial Intelligence (IJCAI), 2022, CCF A. [pdf]
Fast Video Object Segmentation via Dynamic YOLACT, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Type-Aware Medical Visual Question Answering, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Selective Scale Cascade Attention Network for Breast Cancer Histopathology Image Classification, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Efficient universal shuffle attack for visual object tracking, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf][code]
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf][code]
Towards Practical Certifiable Patch Defense with Vision Transformer, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf]
A Systematic Review on Affective Computing: Emotion Models, Databases, and Recent Advances, Information Fusion, 2022, SCI Q1. [pdf]
Weakly-Supervised Salient Object Detection Using Point Supervison, AAAI Conference on Artificial Intelligence, 2022, CCF A. [pdf][code]
2021
Adaptive selection of reference frames for video object segmentation, IEEE Transactions on Image Processing (TIP), 2021, CCF A. [pdf]
On the evaluation of commit message generation models: an experimental study, IEEE International Conference on Software Maintenance and Evolution (ICSME), 2021, CCF B. [pdf]
Modeling Event-Pair Relations in External Knowledge Graphs for Script Reasoning, Findings of the Association for Computational Linguistics (ACL), 2021, CCF A. [pdf]
Attention Driven Self-Similarity Capture for Motion Deblurring, IEEE International Conference on Multimedia and Expo (ICME), 2021, CCF B. [pdf]
Rpattack: Refined Patch Attack on General Object Detectors, IEEE International Conference on Multimedia and Expo (ICME), 2021, CCF B. [pdf]
Dual-Stream Network Based On Global Guidance for Salient Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Adaptable Ensemble Distillation, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Triple Sequence Generative Adversarial Nets for Unsupervised Image Captioning, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph, North American Chapter of the Association for Computational Linguistics (NAACL), 2021, CCF B. [pdf]
Dual Path Learning for Domain Adaptation of Semantic Segmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2021, CCF A. [pdf]
Global Cognition and Local Perception Network for Blind Image Deblurring, MultiMedia Modeling (MMM), 2021, CCF C. [pdf]