2025
MoE-NuSeg: Enhancing nuclei segmentation in histology images with a two-stage Mixture of Experts network, Alexandria Engineering Journal, 2025, SCI Q1. [pdf]
Observe finer to select better: Learning key frame extraction via semantic coherence for dynamic facial expression recognition in the wild, Information Sciences, 2025, SCI Q1. [pdf]
2024
DeTrack: In-model Latent Denoising Learning for Visual Object Tracking, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
LCGen: Mining in Low-Certainty Generation for View-consistent Text-to-3D, Advances in Neural Information Processing Systems (NeurIPS), 2024, CCF A. [pdf]
Panovos: Bridging non-panoramic and panoramic views with transformer for video segmentation, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Onevos: unifying video object segmentation with all-in-one transformer framework, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Adaptive multi-modal fusion of spatially variant kernel refinement with diffusion model for blind image super-resolution, European conference on computer vision (ECCV), 2024, CCF B. [pdf]
Sampling to distill: Knowledge transfer from open-world data, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Visual-Language Collaborative Representation Network for Broad-Domain Few-Shot Image Classification, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution, ACM International Conference on Multimedia (ACMMM), 2024, CCF A. [pdf]
Boosting the transferability of adversarial attacks with global momentum initialization, Expert Systems with Applications, 2024, SCI Q1. [pdf]
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, CCF A. [pdf]
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis, AAAI Conference on Artificial Intelligence, 2024, CCF A. [pdf]
Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation, AAAI Conference on Artificial Intelligence, 2024, CCF A. [pdf]
MGR3Net: Multigranularity Region Relation Representation Network for Facial Expression Recognition in Affective Robots, IEEE Transactions on Industrial Informatics (TII), 2024, SCI Q1. [pdf]
KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation, ACM Transactions on Software Engineering and Methodology, 2024, CCF A. [pdf]
Correspondence-based Generative Bayesian Deep Learning for semi-supervised volumetric medical image segmentation, Computerized Medical Imaging and Graphics, 2024, SCI Q2. [pdf]
Attention in Attention for PET-CT Modality Consensus Lung Tumor Segmentation, IEEE International Conference on Multimedia and Expo (ICME), 2024, CCF B. [pdf]
Empower smart cities with sampling-wise dynamic facial expression recognition via frame-sequence contrastive learning, Computer Communications, 2024, SCI Q3. [pdf]
Mixed noise-guided mutual constraint framework for unsupervised anomaly detection in smart industries, Computer Communications, 2024, SCI Q3. [pdf]
A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts, IEEE Sensors Journal, 2024, SCI Q3. [pdf]
2023
ADPL: Adaptive Dual Path Learning for Domain Adaptation of Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023, CCF A. [pdf]
Reading Relevant Feature from Global Representation Memory for Visual Object Tracking, Advances in Neural Information Processing Systems (NeurIPS), 2023, CCF A. [pdf]
Content-based Unrestricted Adversarial Attack, Advances in Neural Information Processing Systems (NeurIPS), 2023, CCF A. [pdf][code]
MSC-AD: A Multiscene Unsupervised Anomaly Detection Dataset for Small Defect Detection of Casting Surface, IEEE Transactions on Industrial Informatics (TII), 2023, SCI Q1. [pdf]
Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Towards Decision-based Sparse Attacks on Video Recognition, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos, ACM International Conference on Multimedia (ACMMM), 2023, CCF A. [pdf]
Plug-and-Play Feature Generation for Few-Shot Medical Image Classification, 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023, CCF B. [pdf]
Multi-domain Information Fusion for Key-Points Guided GAN Inversion, Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2023, CCF C. [pdf]
Query-Efficient Decision-based Black-Box Patch Attack, IEEE Transactions on Information Forensics and Security (TIFS), 2023, CCF A. [pdf]
GSFormer: Geometric-Spatial Transformer on Point Cloud Completion, IEEE International Conference on Multimedia and Expo (ICME), 2023, CCF B. [pdf]
RankDNN: Learning to Rank for Few-Shot Learning, AAAI Conference on Artificial Intelligence, 2023, CCF A. [pdf][code]
Memory Network with Pixel-level Spatio-Temporal Learning for Visual Object Tracking, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, SCI Q1. [pdf]
Go Closer To See Better: Camouflaged Object Detection via Object Area Amplification and Figure-ground Conversion, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, SCI Q1. [pdf]
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf] [code]
Efficient Decision-based Black-box Patch Attacks on Video Recognition, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf][code]
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf]
Lvos: A benchmark for long-term video object segmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2023, CCF A. [pdf][code]
TINYCOD: Tiny and Effective Model for Camouflaged Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Robust Video Object Segmentation with Restricted Attention, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Gated Enhanced RPN and Hybrid-View for Few-Shot Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yang Liu, Siao Liu, Wenqiang Zhang, Lizhe Qi, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, CCF B. [pdf]
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf]
Correspondence Transformers With Asymmetric Feature Learning and Matching Flow Super-Resolution, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf]
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, CCF A. [pdf][code]
2022
Adaptive online mutual learning bi-decoders for video object segmentation, IEEE Transactions on Image Processing (TIP), 2022, CCF A. [pdf]
Shape Matters: Deformable Patch Attack, European conference on computer vision (ECCV), 2022, CCF B. [pdf][code]
MHKD-MVQA: Multimodal Hierarchical Knowledge Distillation for Medical Visual Question Answering, IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022, CCF B. [pdf]
A large-scale empirical study of commit message generation: models, datasets and evaluation, Empirical Software Engineering, 2022, SCI Q2. [pdf]
Dual Cross-Attention for Video Object Segmentation via Uncertainty Refinement, IEEE Transactions on Multimedia (TMM), 2022, SCI Q1. [pdf]
Weakly Supervised Video Salient Object Detection via Point Supervision, ACM International Conference on Multimedia (ACMMM), 2022, CCF A. [pdf]
DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos, ACM International Conference on Multimedia (ACMMM), 2022, CCF A. [pdf]
Deep Knowledge Reasoning guided Disease Prediction, IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2022, CCF C. [pdf]
Translational relation embeddings for multi-hop knowledge base question answering, Journal of Web Semantics, 2022, CCF B. [pdf]
Tokenizing Features for Fast Video Object Segmentation, IEEE International Conference on Multimedia and Expo (ICME), 2022, CCF B. [pdf]
Position-aware Joint Entity and Relation Extraction with Attention Mechanism, International Joint Conference on Artificial Intelligence (IJCAI), 2022, CCF A. [pdf]
Fast Video Object Segmentation via Dynamic YOLACT, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Type-Aware Medical Visual Question Answering, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Selective Scale Cascade Attention Network for Breast Cancer Histopathology Image Classification, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Efficient universal shuffle attack for visual object tracking, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, CCF B. [pdf]
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf][code]
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf][code]
Towards Practical Certifiable Patch Defense with Vision Transformer, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, CCF A. [pdf]
A Systematic Review on Affective Computing: Emotion Models, Databases, and Recent Advances, Information Fusion, 2022, SCI Q1. [pdf]
Weakly-Supervised Salient Object Detection Using Point Supervison, AAAI Conference on Artificial Intelligence, 2022, CCF A. [pdf][code]
2021
Adaptive selection of reference frames for video object segmentation, IEEE Transactions on Image Processing (TIP), 2021, CCF A. [pdf]
On the evaluation of commit message generation models: an experimental study, IEEE International Conference on Software Maintenance and Evolution (ICSME), 2021, CCF B. [pdf]
Modeling Event-Pair Relations in External Knowledge Graphs for Script Reasoning, Findings of the Association for Computational Linguistics (ACL), 2021, CCF A. [pdf]
Attention Driven Self-Similarity Capture for Motion Deblurring, IEEE International Conference on Multimedia and Expo (ICME), 2021, CCF B. [pdf]
Rpattack: Refined Patch Attack on General Object Detectors, IEEE International Conference on Multimedia and Expo (ICME), 2021, CCF B. [pdf]
Dual-Stream Network Based On Global Guidance for Salient Object Detection, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Adaptable Ensemble Distillation, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Triple Sequence Generative Adversarial Nets for Unsupervised Image Captioning, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, CCF B. [pdf]
Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph, North American Chapter of the Association for Computational Linguistics (NAACL), 2021, CCF B. [pdf]
Dual Path Learning for Domain Adaptation of Semantic Segmentation, IEEE/CVF International Conference on Computer Vision (ICCV), 2021, CCF A. [pdf]
Global Cognition and Local Perception Network for Blind Image Deblurring, MultiMedia Modeling (MMM), 2021, CCF C. [pdf]