2026年:
Zhang P, Zhang M, Wang T, et al. Beyond static cues: Detecting fine-grained forgeries via temporal inconsistencies in facial dynamics[J]. Information Sciences, 2026: 123165.
Zhang P, Yang Xinyu. Audio-to-3D: One-shot talking face generation with disentangled latent codes and diffusion control[J]. Neuro computing, 2026: 133132.
Zhang P, Mu Z, Ji S, Wang X, Yang Xinyu. Disentangle to Edit: Instruction-Guided Latent Manipulation for 3D Facial Video Consistency[J]. Multimedia Systems, 2026.
Li, X., Yang Xinyu, Zhang, S., Sun, J. (2026). User-Adjustable Image Cropping Based on Visual Semantic Awareness. In: Kittler, J., et al. Pattern Recognition and Computer Vision. PRCV 2025. Lecture Notes in Computer Science, vol 16277. Springer, Singapore. https://doi.org/10.1007/978-981-95-5679-3_19S.
Zhang, Yang Xinyu and X. Bai, "Crop the Way You Like: Personalized Image Cropping by Integrating Subjective and Objective Features," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2026.3651046.(CCF-A,TMM)
Zhang S, Yang Xinyu. RetouchAgent: Towards Interactive and Explainable Image Retouching with MLLM Agents[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2026, 40(35): 29901-29910.(CCF-A,AAAI)
Chen H, Yang Xinyu, Zhu J, et al. Skill Path: Unveiling Language Skills from Circuit Graphs[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2026, 40(36): 30210-30217.(CCF-A,AAAI)
Chen H, Zhu J, Yang Xinyu, et al. CLUE: Conflict-guided Localization for LLM Unlearning Framework[C]//The Fourteenth International Conference on Learning Representations.(CCF-A,ICLR)
Du K, Kang Y, Yang Xinyu, et al. ST-HHOL: Spatio-Temporal Hierarchical Hypergraph Online Learning for Crime Prediction[C]//The Fourteenth International Conference on Learning Representations.(CCF-A,ICLR)
Du K, Yang Xinyu, Chen H. CaASR: A Causal Lens for Refining Temporal Action Segmentation[J]. IEEE Transactions on Multimedia, 2026.(CCF-A,TMM)
Hu G, Kollias D, Yang Xinyu. From Cognitive Priors to Instance Semantics: A Unified Framework for Multi-task Affective Computing[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2026: 8551-8562.
2025年:
Luo J,Yang Xinyu, Wei J. Exploring Classical Piano Performance Generation with Expressive Music Variational AutoEncoder[C]//2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, 2025: 1817-1822.
Luo J, Yang Xinyu, Herremans D. BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features[J]. Expert Systems with Applications, 2025: 130059.
Lv Y, Luo J, Ju B, et al. Small Tunes Transformer: Exploring Macro and Micro-level Hierarchies for Skeleton-Conditioned Melody Generation[C]//International Conference on Multimedia Modeling. Singapore: Springer Nature Singapore, 2025: 30-43.
Li C, Yang Xinyu, Yang W, et al. VaF-LangSplat: Voxel-Aware Fusion Language Gaussian Splatting[C]//Proceedings of the 33rd ACM International Conference on Multimedia. 2025: 4952-4961.(CCF-A,MM)
Chen H, Yang Xinyu, Zhu J, et al. Quantifying semantic emergence in language models[C]//Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025: 12041-12054.(CCF-A,ACL)
Chen H, Yang Xinyu, Du K. Towards Causal Relationship in indefinite data: New Datasets and Baseline Model[J]. Journal of Data-centric Machine Learning Research, 2025.
Chen H, Zhu J, Yang Xinyu, et al. Rethinking Circuit Completeness in Language Models: AND, OR, and ADDER Gates[C]//The Thirty-ninth Annual Conference on Neural Information Processing Systems.(CCF-A,NeurIPS)
Du K, Yang Xinyu, Chen H. Enhancing multivariate spatio-temporal forecasting via complete dynamic causal modeling[J]. Neural Networks, 2025, 191: 107826.
Zhao D, Yang Xinyu, Chen H. Debiasing the Fine-Grained Classification Task in LLMs with Bias-Aware PEFT[C]//Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025: 14731-14746.(CCF-A,ACL)
Yang Xinyu, Zhao D, Chen H, et al. How to Enhance Causal Discrimination of Emotional Utterances: A Case on LLMs[J]. IEEE Transactions on Affective Computing, 2025.
Hu G, Kollias D, Yang Xinyu. Grounding Emotion Recognition with Visual Prototypes: VEGA-Revisiting CLIP in MERC[C]//Proceedings of the 33rd ACM International Conference on Multimedia. 2025: 5667-5676.(CCF-A,MM)
Mu Z, Yang Xinyu, Wang G. SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation[C]//Proceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025. International Joint Conferences on Artificial Intelligence, 2025: 8204-8212.(CCF-A,IJCAI)
Mu Z, Chen R, Li A, et al. From Continuous to Discrete: Cross-Domain Collaborative General Speech Enhancement via Hierarchical Language Models[C]//Proceedings of the 33rd ACM International Conference on Multimedia. 2025: 219-228.(CCF-A,MM)
2024年:
2023年:
2022年:
2021年:
2020年: