publications | Weixiang (Sweson) Sun

2025

EfficientLLM: Efficiency in Large Language Models

Zhengqing Yuan^*, Weixiang Sun^*, Yixin Liu^*, and 8 more authors

arXiv preprint arXiv:2505.13840, 2025

Code Website
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Zhiling Yan, Sifan Song, Dingjie Song, and 8 more authors

arXiv preprint arXiv:2507.03698, 2025
LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines

Yanfang Ye^*, Zheyuan Zhang^*, Tianyi Ma^*, and 9 more authors

arXiv preprint arXiv:2509.19580, 2025

2024

Mora: Enabling generalist video generation via a multi-agent framework

Zhengqing Yuan^*, Yixin Liu^*, Weixiang Sun^*, and 8 more authors

arXiv preprint arXiv:2403.13248, 2024

Abs Code

Text-to-video generation has made significant strides, but replicating the capabilities of advanced systems like OpenAI’s Sora remains challenging due to their closed-source nature. Existing open-source methods struggle to achieve comparable performance, often hindered by ineffective agent collaboration and inadequate training data quality. In this paper, we introduce Mora, a novel multi-agent framework that leverages existing open-source modules to replicate Sora’s functionalities. We address these fundamental limitations by proposing three key techniques: (1) multi-agent fine-tuning with a self-modulation factor to enhance inter-agent coordination, (2) a data-free training strategy that uses large models to synthesize training data, and (3) a human-in-the-loop mechanism combined with multimodal large language models for data filtering to ensure high-quality training datasets. Our comprehensive experiments on six video generation tasks demonstrate that Mora achieves performance comparable to Sora on VBench, outperforming existing open-source methods across various tasks. Specifically, in the text-to-video generation task, Mora achieved a Video Quality score of 0.800, surpassing Sora’s 0.797 and outperforming all other baseline models across six key metrics. Additionally, in the image-to-video generation task, Mora achieved a perfect Dynamic Degree score of 1.00, demonstrating exceptional capability in enhancing motion realism and achieving higher Imaging Quality than Sora. These results highlight the potential of collaborative multi-agent systems and human-in-the-loop mechanisms in advancing text-to-video generation. More visualization results of our work are available at https://mora-2025.github.io/.
Bora: Biomedical generalist video generation model

Weixiang Sun^*, Xiaocao You^*, Ruizhe Zheng^*, and 5 more authors

arXiv preprint arXiv:2407.08944, 2024
Ttt-unet: Enhancing u-net with test-time training layers for biomedical image segmentation

Rong Zhou^*, Zhengqing Yuan^*, Zhiling Yan^*, and 7 more authors

arXiv preprint arXiv:2409.11299, 2024
Medical Unlearnable Examples: Securing Medical Data from Unauthorized Training via Sparsity-Aware Local Masking

Weixiang Sun, Yixin Liu, Zhiling Yan, and 2 more authors

arXiv preprint arXiv:2403.10573, 2024
Advlogo: Adversarial patch attack against object detectors based on diffusion models

Boming Miao, Chunxiao Li, Yao Zhu, and 4 more authors

arXiv preprint arXiv:2409.07002, 2024

2023

Research and application of improved neural network optimization algorithm

Weixiang Sun

In Third International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022), 2023