Updated on 2025.07.01
Usage instructions: here
Model Merging
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic | Munish Monga et.al. | 2506.21260 | null |
2025-06-22 | SE-Merging: A Self-Enhanced Approach for Dynamic Model Merging | Zijun Chen et.al. | 2506.18135 | null |
2025-06-19 | Subspace-Boosted Model Merging | Ronald Skorobogat et.al. | 2506.16506 | null |
2025-06-18 | video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models | Changli Tang et.al. | 2506.15220 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262 | null |
2025-06-16 | Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness | Mei-Yen Chen et.al. | 2506.13479 | null |
2025-06-16 | CALM: Consensus-Aware Localized Merging for Multi-Task Learning | Kunda Yan et.al. | 2506.13406 | null |
2025-06-16 | The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions | Devin Kwok et.al. | 2506.13234 | null |
2025-06-14 | Model Merging for Knowledge Editing | Zichuan Fu et.al. | 2506.12384 | link |
2025-06-16 | Generative Representational Learning of Foundation Models for Recommendation | Zheli Zhou et.al. | 2506.11999 | null |
2025-06-13 | A correlation-permutation approach for speech-music encoders model merging | Fabian Ritter-Gutierrez et.al. | 2506.11403 | null |
2025-06-11 | Harmonizing and Merging Source Models for CLIP-based Domain Generalization | Yuhe Ding et.al. | 2506.09446 | null |
2025-06-13 | Merging Smarter, Generalizing Better: Enhancing Model Merging on OOD Data | Bingjie Zhang et.al. | 2506.09093 | null |
2025-06-13 | Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model | Ailin Huang et.al. | 2506.08967 | null |
2025-06-11 | Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models | Yanzhao Zhang et.al. | 2506.05176 | null |
2025-06-05 | StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation | Ranjith Merugu et.al. | 2506.04567 | link |
2025-06-05 | HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training | Geon-Woo Kim et.al. | 2506.04531 | null |
2025-06-04 | Out-of-Distribution Graph Models Merging | Yidi Wang et.al. | 2506.03674 | null |
2025-05-30 | Continual Learning in Vision-Language Models via Aligned Model Merging | Ghada Sokar et.al. | 2506.03189 | null |
2025-06-03 | FroM: Frobenius Norm-Based Data-Free Adaptive Model Merging | Zijian Li et.al. | 2506.02478 | null |
2025-06-02 | Is Extending Modality The Right Path Towards Omni-Modality? | Tinghui Zhu et.al. | 2506.01872 | null |
2025-06-01 | FedRPCA: Enhancing Federated LoRA Aggregation Using Robust PCA | Divyansh Jhunjhunwala et.al. | 2506.01194 | null |
2025-05-31 | AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs | Nicholas E. Corrado et.al. | 2506.00569 | null |
2025-05-29 | Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration | Wenju Sun et.al. | 2505.23859 | null |
2025-05-29 | Understanding Mode Connectivity via Parameter Space Symmetry | Bo Zhao et.al. | 2505.23681 | null |
2025-05-29 | Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation | Juncheol Shin et.al. | 2505.23651 | link |
2025-05-29 | Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models | Zenghui Yuan et.al. | 2505.23561 | null |
2025-05-29 | Navigating the Accuracy-Size Trade-Off with Flexible Model Merging | Akash Dhasade et.al. | 2505.23209 | null |
2025-05-29 | Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking | Yuatyong Chaichana et.al. | 2505.23117 | null |
2025-05-28 | Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging | Haobo Zhang et.al. | 2505.22934 | null |
2025-05-28 | Learning Composable Chains-of-Thought | Fangcong Yin et.al. | 2505.22635 | null |
2025-05-29 | Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning | Haomiao Qiu et.al. | 2505.22389 | null |
2025-05-29 | Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition | Hanting Chen et.al. | 2505.22375 | null |
2025-05-28 | LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents | Taro Yano et.al. | 2505.21963 | null |
2025-06-03 | Why Do More Experts Fail? A Theoretical Analysis of Model Merging | Zijing Wang et.al. | 2505.21226 | link |
2025-05-26 | Robust fine-tuning of speech recognition models via model merging: application to disordered speech | Alexandre Ducorroy et.al. | 2505.20477 | null |
2025-05-26 | SeMe: Training-Free Language Model Merging via Semantic Alignment | Jian Gu et.al. | 2505.20144 | null |
2025-05-26 | Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | Yongxian Wei et.al. | 2505.19892 | link |
2025-05-24 | Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer | Guodong Du et.al. | 2505.18713 | null |
2025-05-23 | The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs | Lucas Bandarkar et.al. | 2505.18356 | null |
2025-05-23 | AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model | Tijmen de Haan et.al. | 2505.17592 | null |
2025-05-23 | Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models | Chi-Yuan Hsiao et.al. | 2505.17496 | null |
2025-05-22 | Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs | Zeping Yu et.al. | 2505.16703 | null |
2025-05-22 | CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving | Huitong Yang et.al. | 2505.16524 | null |
2025-05-22 | NAN: A Training-Free Solution to Coefficient Estimation in Model Merging | Chongjie Si et.al. | 2505.16148 | null |
2025-05-21 | Merge to Mix: Mixing Datasets via Model Merging | Zhixu Silvia Tao et.al. | 2505.16066 | null |
2025-05-21 | Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging | Shenghe Zheng et.al. | 2505.15875 | null |
2025-05-21 | Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning | Taehoon Kim et.al. | 2505.15798 | null |
2025-05-20 | Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging | Ryo Bertolissi et.al. | 2505.14136 | null |
2025-05-20 | Activation-Guided Consensus Merging for Large Language Models | Yuxuan Yao et.al. | 2505.14009 | null |
2025-05-18 | Scalable Strategies for Continual Learning with Replay | Truman Hickok et.al. | 2505.12512 | null |
2025-05-22 | Model Merging in Pre-training of Large Language Models | Yunshui Li et.al. | 2505.12082 | null |
2025-05-17 | MINGLE: Mixtures of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging | Zihuan Qiu et.al. | 2505.11883 | null |
2025-05-16 | Mergenetic: a Simple Evolutionary Model Merging Library | Adrian Robert Minut et.al. | 2505.11427 | link |
2025-05-16 | RanDeS: Randomized Delta Superposition for Multi-Model Compression | Hangyu Zhou et.al. | 2505.11204 | link |
2025-05-16 | Hybrid Monetary Ecosystems: Integrating Stablecoins and Fiat in the Future of Currency Systems | Hongzhe Wen et.al. | 2505.10997 | null |
2025-05-16 | MergeBench: A Benchmark for Merging Domain-Specialized LLMs | Yifei He et.al. | 2505.10833 | link |
2025-05-21 | A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment | Jean-Philippe Corbeil et.al. | 2505.10717 | null |
2025-05-14 | Chisme: Fully Decentralized Differentiated Deep Learning for Edge Intelligence | Harikrishna Kuttivelil et.al. | 2505.09854 | null |
2025-05-12 | MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges | Shixi Qin et.al. | 2505.08809 | link |
2025-05-13 | Aya Vision: Advancing the Frontier of Multilingual Multimodality | Saurabh Dash et.al. | 2505.08751 | null |
2025-05-14 | CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging | Wenju Sun et.al. | 2505.06977 | null |
2025-05-10 | QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration | HamidReza Imani et.al. | 2505.06481 | null |
2025-05-08 | Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging | Shiqi Chen et.al. | 2505.05464 | link |
2025-04-29 | Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing | James O’ Neill et.al. | 2504.19333 | null |
2025-04-26 | Dynamic Fisher-weighted Model Merging via Bayesian Optimization | Sanwoo Lee et.al. | 2504.18992 | null |
2025-04-24 | PatientDx: Merging Large Language Models for Protecting Data-Privacy in Healthcare | Jose G. Moreno et.al. | 2504.17360 | null |
2025-04-20 | Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning | Yeoreum Lee et.al. | 2504.14662 | link |
2025-04-17 | ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs | Yan Yang et.al. | 2504.13237 | null |
2025-04-16 | Enhanced Battery Capacity Estimation in Data-Limited Scenarios through Swarm Learning | Jiawei Zhang et.al. | 2504.12444 | null |
2025-04-16 | DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging | Tianhui Song et.al. | 2504.12364 | link |
2025-04-15 | Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning | Juan Garcia Giraldo et.al. | 2504.11268 | null |
2025-04-15 | Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs | Rui Dai et.al. | 2504.10902 | link |
2025-04-24 | FedMerge: Federated Personalization via Model Merging | Shutong Chen et.al. | 2504.06768 | null |
2025-04-15 | SEA-LION: Southeast Asian Languages in One Network | Raymond Ng et.al. | 2504.05747 | null |
2025-04-06 | MASS: MoErging through Adaptive Subspace Selection | Donato Crisostomi et.al. | 2504.05342 | null |
2025-04-06 | Exact Unlearning of Finetuning Data via Model Merging at Scale | Kevin Kuo et.al. | 2504.04626 | null |
2025-03-28 | Breach in the Shield: Unveiling the Vulnerabilities of Large Language Models | Runpeng Dai et.al. | 2504.03714 | null |
2025-04-03 | BECAME: BayEsian Continual Learning with Adaptive Model MErging | Mei Li et.al. | 2504.02666 | null |
2025-04-02 | OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models | Sumeth Yuenyong et.al. | 2504.01789 | null |
2025-04-14 | Command A: An Enterprise-Ready Large Language Model | Team Cohere et.al. | 2504.00698 | null |
2025-03-31 | AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization | Yiyang Du et.al. | 2503.23733 | link |
2025-03-29 | Efficient Inference for Large Reasoning Models: A Survey | Yue Liu et.al. | 2503.23077 | link |
2025-03-28 | Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization | Iñigo Pikabea et.al. | 2503.22577 | null |
2025-03-28 | AdaRank: Adaptive Rank Pruning for Enhanced Model Merging | Chanhyuk Lee et.al. | 2503.22178 | link |
2025-03-27 | Model Assembly Learning with Heterogeneous Layer Weight Merging | Yi-Kai Zhang et.al. | 2503.21657 | null |
2025-03-27 | Reinforced Model Merging | Jiaqi Han et.al. | 2503.21272 | link |
2025-03-27 | ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging | Haoming Xu et.al. | 2503.21088 | link |
2025-03-26 | Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging | Han Wu et.al. | 2503.20641 | link |
2025-03-23 | Personalized Language Models via Privacy-Preserving Evolutionary Model Merging | Kyuyoung Kim et.al. | 2503.18008 | null |
2025-03-21 | SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging | Aladin Djuhera et.al. | 2503.17239 | link |
2025-03-25 | FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization | Hao Mark Chen et.al. | 2503.12649 | link |
2025-03-12 | Efficient Multi-Task Inferencing: Model Merging with Gromov-Wasserstein Feature Alignment | Luyang Fang et.al. | 2503.09774 | null |
2025-03-12 | From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches | Wei Ruan et.al. | 2503.08998 | null |
2025-03-11 | Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation | Mingkang Zhu et.al. | 2503.08575 | null |
2025-03-11 | Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors | Runxi Cheng et.al. | 2503.08099 | null |
2025-03-10 | Self-supervised Normality Learning and Divergence Vector-guided Model Merging for Zero-shot Congenital Heart Disease Detection in Fetal Ultrasound Videos | Pramit Saha et.al. | 2503.07799 | null |
2025-03-08 | Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy | Wei Junhao et.al. | 2503.07661 | null |
2025-03-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al. | 2503.06921 | null |
2025-03-12 | Analyzing the Role of Permutation Invariance in Linear Mode Connectivity | Keyao Zhan et.al. | 2503.06001 | null |
2025-03-07 | Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms | Zitao Fang et.al. | 2503.05320 | null |
2025-03-05 | Extrapolation Merging: Keep Improving With Extrapolation and Merging | Yiguan Lin et.al. | 2503.04834 | null |
2025-03-07 | LEWIS (LayEr WIse Sparsity) – A Training Free Guided Model Merging Approach | Hetarth Chopra et.al. | 2503.03874 | null |
2025-03-27 | GNNMerge: Merging of GNN Models Without Accessing Training Data | Vipul Garg et.al. | 2503.03384 | link |
2025-03-03 | Superficial Self-Improved Reasoners Benefit from Model Merging | Xiangchi Yuan et.al. | 2503.02103 | null |
2025-02-26 | CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging | Zongzhen Yang et.al. | 2503.01874 | null |
2025-03-03 | Multi-Level Collaboration in Model Merging | Qi Li et.al. | 2503.01268 | null |
2025-03-02 | Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think | Jie Tian et.al. | 2503.00948 | link |
2025-03-01 | BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge | Terry Tong et.al. | 2503.00596 | null |
2025-02-27 | In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models | Hu Wang et.al. | 2502.20516 | null |
2025-02-27 | Granite Embedding Models | Parul Awasthy et.al. | 2502.20204 | null |
2025-02-27 | Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge | Yan-Lun Chen et.al. | 2502.20186 | null |
2025-02-22 | Recurrent Knowledge Identification and Fusion for Language Model Continual Learning | Yujie Feng et.al. | 2502.17510 | null |
2025-02-26 | Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation | Qiuming Zhao et.al. | 2502.17380 | null |
2025-02-24 | Low-rank bias, weight decay, and model merging in neural networks | Ilja Kuzborskij et.al. | 2502.17340 | null |
2025-02-24 | Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation | Fanhu Zeng et.al. | 2502.17159 | null |
2025-02-24 | LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint | Qianli Ma et.al. | 2502.16770 | null |
2025-02-22 | Merger-as-a-Stealer: Stealing Targeted PII from Aligned LLMs with Model Merging | Lin Lu et.al. | 2502.16094 | null |
2025-02-22 | MedForge: Building Medical Foundation Models Like Open Source Software Development | Zheling Tan et.al. | 2502.16055 | link |
2025-02-21 | Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation | Yue Zhou et.al. | 2502.15434 | link |
2025-02-19 | Transferring Textual Preferences to Vision-Language Understanding through Model Merging | Chen-An Li et.al. | 2502.13487 | null |
2025-02-18 | Scalable Model Merging with Progressive Layer-wise Distillation | Jing Xu et.al. | 2502.12706 | link |
2025-02-18 | Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability | Tzu-Quan Lin et.al. | 2502.12672 | null |
2025-02-19 | Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models | Shuqi Liu et.al. | 2502.12420 | null |
2025-02-17 | Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging | Zhixiang Wang et.al. | 2502.12217 | null |
2025-02-17 | Merging Language and Domain Specific Models: The Impact on Technical Vocabulary Acquisition | Thibault Rousset et.al. | 2502.12001 | null |
2025-02-17 | Be Cautious When Merging Unfamiliar LLMs: A Phishing Model Capable of Stealing Privacy | Zhenyuan Guo et.al. | 2502.11533 | link |
2025-02-16 | Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation | Tong Zheng et.al. | 2502.11223 | null |
2025-02-15 | Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation | Guofu Xie et.al. | 2502.10762 | null |
2025-02-15 | LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging | Zehua Liu et.al. | 2502.10749 | null |
2025-02-15 | 1bit-Merging: Dynamic Quantized Merging for Large Language Models | Shuqi Liu et.al. | 2502.10743 | null |
2025-02-15 | Superpose Singular Features for Model Merging | Haiquan Qiu et.al. | 2502.10698 | null |
2025-02-14 | Towards Watermarking of Open-Source LLMs | Thibaud Gloaguen et.al. | 2502.10525 | null |
2025-02-09 | MERGE $^3$ : Efficient Evolutionary Merging on Consumer-grade GPUs | Tommaso Mencattini et.al. | 2502.10436 | link |
2025-02-14 | STAR: Spectral Truncation and Rescale for Model Merging | Yu-Ang Lee et.al. | 2502.10339 | link |
2025-02-17 | Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging – An Open Recipe | Kunat Pipatanakul et.al. | 2502.09056 | null |
2025-02-13 | Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging | Jinluan Yang et.al. | 2502.06876 | null |
2025-02-07 | No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces | Daniel Marczak et.al. | 2502.04959 | link |
2025-02-11 | Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing | Kunfeng Lai et.al. | 2502.04411 | null |
2025-02-06 | Fine, I’ll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging | Guinan Su et.al. | 2502.04030 | link |
2025-02-04 | Activation-Informed Merging of Large Language Models | Amin Heyrani Nobari et.al. | 2502.02421 | link |
2025-02-03 | Efficient Model Editing with Task Vector Bases: A Theoretical Framework and Scalable Approach | Siqi Zeng et.al. | 2502.01015 | link |
2025-02-17 | MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs | Yuhang Zhou et.al. | 2502.00997 | null |
2025-01-31 | BTS: Harmonizing Specialized Experts into a Generalist LLM | Qizhen Zhang et.al. | 2502.00075 | null |
2025-01-31 | Norm-Bounded Low-Rank Adaptation | Ruigang Wang et.al. | 2501.19050 | null |
2025-01-25 | Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts | Wenju Sun et.al. | 2501.15065 | null |
2025-01-16 | Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging | Anke Tang et.al. | 2501.09522 | link |
2025-01-14 | Selective Attention Merging for low resource tasks: A case study of Child ASR | Natarajan Balaji Shankar et.al. | 2501.08468 | link |
2025-01-11 | Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent | Yongxian Wei et.al. | 2501.01230 | link |
2024-12-29 | Training-free Heterogeneous Model Merging | Zhengqi Xu et.al. | 2501.00061 | link |
2024-12-15 | ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation | Chenhui Deng et.al. | 2412.19819 | null |
2024-12-27 | Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging | Hua Farn et.al. | 2412.19512 | null |
2024-12-26 | Tint Your Models Task-wise for Improved Multi-task Model Merging | Aecheon Jung et.al. | 2412.19098 | link |
2024-12-22 | Parameter-Efficient Interventions for Enhanced Model Merging | Marcin Osial et.al. | 2412.17023 | null |
2024-12-20 | Non-Uniform Parameter-Wise Model Merging | Albert Manuel Orozco Camacho et.al. | 2412.15467 | null |
2024-12-19 | Multi-concept Model Immunization through Differentiable Model Merging | Amber Yijia Zheng et.al. | 2412.15320 | link |
2024-12-18 | Channel Merging: Preserving Specialization for Merged Experts | Mingyang Zhang et.al. | 2412.15283 | null |
2024-12-18 | Rethink the Evaluation Protocol of Model Merging on Classification Task | Fanshuang Kong et.al. | 2412.13526 | link |
2024-12-11 | Revisiting Weight Averaging for Model Merging | Jiho Choi et.al. | 2412.12153 | link |
2024-12-09 | SUPERMERGE: An Approach For Gradient-Based Model Merging | Haoyu Yang et.al. | 2412.10416 | null |
2024-12-11 | How to Weight Multitask Finetuning? Fast Previews via Bayesian Model-Merging | Hugo Monzón Maldonado et.al. | 2412.08147 | null |
2024-12-09 | How to Merge Your Multimodal Models Over Time? | Sebastian Dziadzio et.al. | 2412.06712 | link |
2024-12-08 | Large Language Models Merging for Enhancing the Link Stealing Attack on Graph Neural Networks | Faqian Guan et.al. | 2412.05830 | null |
2024-12-05 | Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier | John Dang et.al. | 2412.04261 | null |
2024-12-12 | If You Can’t Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs | Muhammad Khalifa et.al. | 2412.04144 | null |
2024-12-04 | Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning | Neale Ratzlaff et.al. | 2412.03467 | null |
Continual Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-27 | Advancements and Challenges in Continual Reinforcement Learning: A Comprehensive Review | Amara Zuffer et.al. | 2506.21899 | null |
2025-06-27 | A Survey of Continual Reinforcement Learning | Chaofan Pan et.al. | 2506.21872 | null |
2025-06-26 | Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning | Peihao Wang et.al. | 2506.21797 | null |
2025-06-26 | Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing | Lars Möllenbrok et.al. | 2506.21312 | null |
2025-06-26 | DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic | Munish Monga et.al. | 2506.21260 | null |
2025-06-26 | CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization | Jan Ackermann et.al. | 2506.21117 | null |
2025-06-26 | Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning | Haodong Lu et.al. | 2506.21035 | null |
2025-06-25 | A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools | Minh-Hao Van et.al. | 2506.20743 | null |
2025-06-24 | Leveraging Lightweight Generators for Memory Efficient Continual Learning | Christiaan Lamers et.al. | 2506.19692 | null |
2025-06-24 | ChordPrompt: Orchestrating Cross-Modal Prompt Synergy for Multi-Domain Incremental Learning in CLIP | Zhiyuan Wang et.al. | 2506.19608 | null |
2025-06-24 | Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning | Russell Beale et.al. | 2506.19484 | null |
2025-06-22 | Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms | Cheng Ji et.al. | 2506.17900 | null |
2025-06-21 | Pathway-based Progressive Inference (PaPI) for Energy-Efficient Continual Learning | Suyash Gaurav et.al. | 2506.17848 | null |
2025-06-21 | VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Chongkai Gao et.al. | 2506.17561 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169 | null |
2025-06-20 | The Importance of Being Lazy: Scaling Limits of Continual Learning | Jacopo Graldi et.al. | 2506.16884 | null |
2025-06-19 | Energy-Based Transfer for Reinforcement Learning | Zeyun Deng et.al. | 2506.16590 | null |
2025-06-19 | Weight Factorization and Centralization for Continual Learning in Speech Recognition | Enes Yavuz Ugan et.al. | 2506.16574 | null |
2025-06-19 | From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling | Yao Lu et.al. | 2506.16393 | null |
2025-06-18 | Task-Agnostic Experts Composition for Continual Learning | Luigi Quarantiello et.al. | 2506.15566 | null |
2025-06-18 | An efficient forgetting-aware fine-tuning framework for pretrained universal machine-learning interatomic potentials | Jisu Kim et.al. | 2506.15223 | link |
2025-06-17 | MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning | Tristan Tomilin et.al. | 2506.14990 | link |
2025-06-19 | A Comprehensive Survey on Continual Learning in Generative Models | Haiyang Guo et.al. | 2506.13045 | link |
2025-06-14 | Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models | Ziwei Liu et.al. | 2506.12409 | null |
2025-06-14 | EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning | Huaijie Wang et.al. | 2506.12351 | null |
2025-06-13 | Quantum-Inspired Differentiable Integral Neural Networks (QIDINNs): A Feynman-Based Architecture for Continuous Learning Over Streaming Data | Oscar Boullosa Dapena et.al. | 2506.12111 | null |
2025-06-13 | Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning | Chendi Ge et.al. | 2506.11672 | null |
2025-06-10 | Trustworthy AI for Medicine: Continuous Hallucination Detection and Elimination with CHECK | Carlos Garcia-Fernandez et.al. | 2506.11129 | null |
2025-06-12 | Continual Hyperbolic Learning of Instances and Classes | Melika Ayoughi et.al. | 2506.10710 | null |
2025-06-13 | Saturation Self-Organizing Map | Igor Urbanik et.al. | 2506.10680 | link |
2025-06-12 | EXPEREPAIR: Dual-Memory Enhanced LLM-based Repository-Level Program Repair | Fangwen Mu et.al. | 2506.10484 | link |
2025-06-12 | TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree | Yu-Yang Qian et.al. | 2506.10355 | link |
2025-06-11 | Analytic Task Scheduler: Recursive Least Squares Based Method for Continual Learning in Embodied Foundation Models | Lipei Xie et.al. | 2506.09623 | null |
2025-06-11 | ErrorEraser: Unlearning Data Bias for Improved Continual Learning | Xuemei Cao et.al. | 2506.09347 | null |
2025-06-10 | Online Learning Control Strategies for Industrial Processes with Application for Loosening and Conditioning | Yue Wu et.al. | 2506.08983 | null |
2025-06-13 | LLaVA-c: Continual Improved Visual Instruction Tuning | Wenzhuo Liu et.al. | 2506.08666 | null |
2025-06-10 | Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection | Duc Thanh Pham et.al. | 2506.08562 | null |
2025-06-09 | SHIELD: Secure Hypernetworks for Incremental Expansion Learning Defense | Patryk Krukowski et.al. | 2506.08255 | null |
2025-06-09 | Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph | Akash Vishwakarma et.al. | 2506.08098 | link |
2025-06-09 | DPFormer: Dynamic Prompt Transformer for Continual Learning | Sheng-Kai Huang et.al. | 2506.07414 | null |
2025-06-07 | Contextual Experience Replay for Self-Improvement of Language Agents | Yitao Liu et.al. | 2506.06698 | null |
2025-06-07 | Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning | Yuan Yuan et.al. | 2506.06694 | null |
2025-06-07 | Non-Intrusive Load Monitoring Based on Image Load Signatures and Continual Learning | Olimjon Toirov et.al. | 2506.06637 | null |
2025-06-06 | Optimal Rates in Continual Linear Regression via Increasing Regularization | Ran Levinstein et.al. | 2506.06501 | null |
2025-06-06 | Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning | Yuheng Lei et.al. | 2506.05985 | link |
2025-06-06 | Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces | Chaofan Pan et.al. | 2506.05702 | null |
2025-06-05 | MLLM-CL: Continual Learning for Multimodal Large Language Models | Hongbo Zhao et.al. | 2506.05453 | null |
2025-06-05 | Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems | Pavle Vasiljevic et.al. | 2506.05138 | null |
2025-06-05 | Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection | Ziyi Zhou et.al. | 2506.04739 | null |
2025-06-05 | Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning | Ziqi Jia et.al. | 2506.04595 | null |
2025-06-04 | The Latent Space Hypothesis: Toward Universal Medical Representation Learning | Salil Patel et.al. | 2506.04515 | null |
2025-06-04 | Replay Can Provably Increase Forgetting | Yasaman Mahdaviyeh et.al. | 2506.04377 | null |
2025-06-04 | A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning | Zhiyu Zhang et.al. | 2506.04083 | null |
2025-06-05 | Adapt before Continual Learning | Aojun Lu et.al. | 2506.03956 | link |
2025-06-05 | Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective | Aojun Lu et.al. | 2506.03951 | link |
2025-06-03 | The Future of Continual Learning in the Era of Foundation Models: Three Key Directions | Jack Bell et.al. | 2506.03320 | null |
2025-05-30 | Continual Learning in Vision-Language Models via Aligned Model Merging | Ghada Sokar et.al. | 2506.03189 | null |
2025-06-03 | Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games | Alejandro Sanchez Roncero et.al. | 2506.02849 | null |
2025-06-01 | EWGN: Elastic Weight Generation and Context Switching in Deep Learning | Shriraj P. Sawant et.al. | 2506.02065 | null |
2025-06-02 | Class Incremental Learning for Algorithm Selection | Mate Botond Nemeth et.al. | 2506.01545 | null |
2025-06-01 | Continual-MEGA: A Large-scale Benchmark for Generalizable Continual Anomaly Detection | Geonu Lee et.al. | 2506.00956 | null |
2025-06-01 | Adaptive, Efficient and Fair Resource Allocation in Cloud Datacenters leveraging Weighted A3C Deep Reinforcement Learning | Suchi Kumari et.al. | 2506.00929 | null |
2025-05-31 | Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn | Hongyao Tang et.al. | 2506.00592 | null |
2025-05-31 | Flashbacks to Harmonize Stability and Plasticity in Continual Learning | Leila Mahmoodi et.al. | 2506.00477 | null |
2025-05-31 | iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection | Huahui Yi et.al. | 2506.00406 | null |
2025-05-30 | Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective | Junze Deng et.al. | 2506.00205 | null |
2025-05-30 | Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data | Douwe den Blanken et.al. | 2505.24852 | link |
2025-05-30 | CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning | Jiangpeng He et.al. | 2505.24816 | link |
2025-05-30 | Rehearsal with Auxiliary-Informed Sampling for Audio Deepfake Detection | Falih Gozi Febrinanto et.al. | 2505.24486 | null |
2025-05-30 | When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways | Kailin Jiang et.al. | 2505.24449 | link |
2025-05-30 | Rethinking Continual Learning with Progressive Neural Collapse | Zheng Wang et.al. | 2505.24254 | null |
2025-05-29 | BIRD: Behavior Induction via Representation-structure Distillation | Galen Pogoncheff et.al. | 2505.23933 | null |
2025-05-29 | LADA: Scalable Label-Specific CLIP Adapter for Continual Learning | Mao-Lin Luo et.al. | 2505.23271 | link |
2025-05-28 | IRS: Incremental Relationship-guided Segmentation for Digital Pathology | Ruining Deng et.al. | 2505.22855 | link |
2025-06-04 | MAC-Gaze: Motion-Aware Continual Calibration for Mobile Gaze Tracking | Yaxiong Lei et.al. | 2505.22769 | null |
2025-05-28 | Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts | Xue Zhang et.al. | 2505.22582 | null |
2025-05-28 | Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning | Stef Cuyckens et.al. | 2505.22404 | null |
2025-05-29 | Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning | Haomiao Qiu et.al. | 2505.22389 | null |
2025-05-29 | SplitLoRA: Balancing Stability and Plasticity in Continual Learning Through Gradient Space Splitting | Haomiao Qiu et.al. | 2505.22370 | null |
2025-05-28 | Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs | Zhiyi Wan et.al. | 2505.22358 | null |
2025-05-28 | Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer | Zehua Chen et.al. | 2505.22306 | null |
2025-05-28 | Apax: A Flexible and Performant Framework For The Development of Machine-Learned Interatomic Potentials | Moritz René Schäfer et.al. | 2505.22168 | null |
2025-05-28 | Efficiently Enhancing General Agents With Hierarchical-categorical Memory | Changze Qiao et.al. | 2505.22006 | null |
2025-05-28 | Continual Learning Beyond Experience Rehearsal and Full Model Surrogates | Prashant Bhat et.al. | 2505.21942 | null |
2025-05-27 | M3S-UPD: Efficient Multi-Stage Self-Supervised Learning for Fine-Grained Encrypted Traffic Classification with Unknown Pattern Discovery | Yali Yuan et.al. | 2505.21462 | null |
2025-05-28 | Understanding the behavior of representation forgetting in continual learning | Joonkyu Kim et.al. | 2505.20970 | null |
2025-05-27 | Continual Learning on CLIP via Incremental Prompt Tuning with Intrinsic Textual Anchors | Haodong Lu et.al. | 2505.20680 | null |
2025-05-26 | Hierarchical Bayesian estimation for continual learning during model-informed precision dosing | Franziska Thoma et.al. | 2505.20240 | null |
2025-05-26 | Continuous Learning for Children’s ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence | Edem Ahadzi et.al. | 2505.20216 | null |
2025-05-28 | Data-Distill-Net: A Data Distillation Approach Tailored for Reply-based Continual Learning | Wenyang Liao et.al. | 2505.20135 | null |
2025-05-26 | Beyond Freezing: Sparse Tuning Enhances Plasticity in Continual Learning with Pre-Trained Models | Huan Zhang et.al. | 2505.19943 | link |
2025-05-26 | Cut out and Replay: A Simple yet Versatile Strategy for Multi-Label Online Continual Learning | Xinrui Wang et.al. | 2505.19680 | link |
2025-05-27 | STRAP: Spatio-Temporal Pattern Retrieval for Out-of-Distribution Generalization | Haoyu Zhang et.al. | 2505.19547 | null |
2025-05-26 | MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering | Xu Li et.al. | 2505.19455 | link |
2025-05-25 | A General Theory of Growth, Employment, and Technological Change: Experiential Matrix Theory and the Transition from GDP to Humanist Experiential Growth in the Age of Artificial Intelligence | Christian Callaghan et.al. | 2505.19045 | null |
2025-05-24 | Can LLMs Alleviate Catastrophic Forgetting in Graph Continual Learning? A Systematic Study | Ziyang Cheng et.al. | 2505.18697 | link |
2025-05-24 | Exemplar-Free Continual Learning for State Space Models | Isaac Ning Lee et.al. | 2505.18604 | null |
2025-05-24 | Learning without Isolation: Pathway Protection for Continual Learning | Zhikang Chen et.al. | 2505.18568 | link |
2025-05-24 | Knowledge Grafting of Large Language Models | Guodong Du et.al. | 2505.18502 | link |
2025-05-23 | CarbonFlex: Enabling Carbon-aware Provisioning and Scheduling for Cloud Clusters | Walid A. Hanafy et.al. | 2505.18357 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101 | null |
2025-05-23 | Evolving Machine Learning: A Survey | Ignacio Cabrera Martin et.al. | 2505.17902 | null |
2025-05-23 | What is the role of memorization in Continual Learning? | Jędrzej Kozal et.al. | 2505.17664 | null |
2025-05-23 | Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models | Chi-Yuan Hsiao et.al. | 2505.17496 | null |
2025-05-23 | Towards Heterogeneous Continual Graph Learning via Meta-knowledge Distillation | Guiquan Sun et.al. | 2505.17458 | null |
2025-05-22 | LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving Scenarios | Huaiyuan Yao et.al. | 2505.17209 | link |
2025-05-22 | Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning | Jiaru Zou et.al. | 2505.16270 | link |
2025-05-21 | Bayesian Ensembling: Insights from Online Optimization and Empirical Bayes | Daniel Waxman et.al. | 2505.15638 | link |
2025-05-21 | Exploring LLM-Generated Feedback for Economics Essays: How Teaching Assistants Evaluate and Envision Its Use | Xinyi Lu et.al. | 2505.15596 | link |
2025-05-21 | Gated Integration of Low-Rank Adaptation for Continual Learning of Language Models | Yan-Shuo Liang et.al. | 2505.15424 | link |
2025-05-21 | A Unified Gradient-based Framework for Task-agnostic Continual Learning-Unlearning | Zhehao Huang et.al. | 2505.15178 | null |
2025-05-20 | UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | Xiaojie Gu et.al. | 2505.14679 | link |
2025-05-20 | Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing | Yang Xiao et.al. | 2505.14601 | null |
2025-05-20 | Contrastive Consolidation of Top-Down Modulations Achieves Sparsely Supervised Continual Learning | Viet Anh Khoa Tran et.al. | 2505.14125 | null |
2025-05-22 | Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | Zhenyu Li et.al. | 2505.14068 | link |
2025-05-20 | StPR: Spatiotemporal Preservation and Routing for Exemplar-Free Video Class-Incremental Learning | Huaijie Wang et.al. | 2505.13997 | null |
2025-05-19 | FlexFed: Mitigating Catastrophic Forgetting in Heterogeneous Federated Learning in Pervasive Computing Environments | Sara Alosaime et.al. | 2505.13576 | null |
2025-05-19 | LiBOG: Lifelong Learning for Black-Box Optimizer Generation | Jiyuan Pei et.al. | 2505.13025 | link |
2025-05-18 | Scalable Strategies for Continual Learning with Replay | Truman Hickok et.al. | 2505.12512 | null |
2025-05-18 | AFCL: Analytic Federated Continual Learning for Spatio-Temporal Invariance of Non-IID Data | Jianheng Tang et.al. | 2505.12245 | null |
2025-05-18 | ACU: Analytic Continual Unlearning for Efficient and Exact Forgetting with Privacy Preservation | Jianheng Tang et.al. | 2505.12239 | null |
2025-05-17 | Growable and Interpretable Neural Control with Online Continual Learning for Autonomous Lifelong Locomotion Learning Machines | Arthicha Srisuchinnawong et.al. | 2505.12029 | link |
2025-05-17 | Parameter Efficient Continual Learning with Dynamic Low-Rank Adaptation | Prashant Shivaram Bhat et.al. | 2505.11998 | null |
2025-05-17 | LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | Junhao Zheng et.al. | 2505.11942 | link |
2025-05-17 | How can Diffusion Models Evolve into Continual Generators? | Jingren Liu et.al. | 2505.11936 | null |
2025-05-17 | MINGLE: Mixtures of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging | Zihuan Qiu et.al. | 2505.11883 | null |
2025-05-17 | AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting | Yang Xiao et.al. | 2505.11817 | null |
2025-05-17 | Continuous Subspace Optimization for Continual Learning | Quan Cheng et.al. | 2505.11816 | null |
2025-05-17 | CL-BioGAN: Biologically-Inspired Cross-Domain Continual Learning for Hyperspectral Anomaly Detection | Jianing Wang et.al. | 2505.11796 | null |
2025-05-17 | CL-CaGAN: Capsule differential adversarial continuous learning for cross-domain hyperspectral anomaly detection | Jianing Wang et.al. | 2505.11793 | null |
2025-05-16 | Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis | Akarsh Kumar et.al. | 2505.11581 | link |
2025-05-16 | Multi-Modal Multi-Task (M3T) Federated Foundation Models for Embodied AI: Potentials and Challenges for Edge Integration | Kasra Borazjani et.al. | 2505.11191 | null |
2025-05-16 | Privacy-Aware Lifelong Learning | Ozan Özdenizci et.al. | 2505.10941 | link |
2025-05-15 | A Conformal Predictive Measure for Assessing Catastrophic Forgetting | Ioannis Pitsiorlas et.al. | 2505.10677 | null |
2025-05-15 | Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging | Xianrui Li et.al. | 2505.10649 | null |
2025-05-15 | Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence | Xiang He et.al. | 2505.10176 | link |
2025-05-15 | ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | Jing-Cheng Pang et.al. | 2505.10010 | link |
2025-05-15 | Task-Core Memory Management and Consolidation for Long-term Continual Learning | Tianyu Huai et.al. | 2505.09952 | null |
2025-05-15 | Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback | Yutao Yang et.al. | 2505.09925 | null |
2025-05-14 | Preserving Plasticity in Continual Learning with Adaptive Linearity Injection | Seyed Roozbeh Razavi Rohani et.al. | 2505.09486 | null |
2025-05-13 | Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition | Zeki Doruk Erden et.al. | 2505.09003 | null |
2025-05-13 | Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots | Glaucia Melo et.al. | 2505.08648 | null |
2025-05-13 | PrePrompt: Predictive prompting for class incremental learning | Libo Huang et.al. | 2505.08586 | link |
2025-05-13 | GradMix: Gradient-based Selective Mixup for Robust Data Augmentation in Class-Incremental Learning | Minsu Kim et.al. | 2505.08528 | null |
2025-05-13 | Attention-based Generative Latent Replay: A Continual Learning Approach for WSI Analysis | Pratibha Kumari et.al. | 2505.08524 | null |
2025-05-13 | Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer | Zhenrong Liu et.al. | 2505.08327 | null |
2025-05-12 | Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models | Songlin Dong et.al. | 2505.07690 | null |
2025-05-16 | Prototype Augmented Hypernetworks for Continual Learning | Neil De La Fuente et.al. | 2505.07450 | null |
2025-05-12 | Adaptive, Robust and Scalable Bayesian Filtering for Online Learning | Gerardo Duran-Martin et.al. | 2505.07267 | null |
2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
2025-05-11 | Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation | Md. Naimur Asif Borno et.al. | 2505.06995 | null |
2025-05-09 | Neuro-Symbolic Concepts | Jiayuan Mao et.al. | 2505.06191 | null |
2025-05-09 | CogniSNN: A First Exploration to Random Graph Architecture based Spiking Neural Networks with Enhanced Expandability and Neuroplasticity | Yongsheng Huang et.al. | 2505.05992 | null |
2025-05-09 | Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma2 | Vytenis Šliogeris et.al. | 2505.05946 | null |
2025-05-08 | MARK: Memory Augmented Refinement of Knowledge | Anish Ganguli et.al. | 2505.05177 | null |
2025-05-09 | Replay to Remember (R2R): An Efficient Uncertainty-driven Unsupervised Continual Learning Framework Using Generative Replay | Sriram Mandalika et.al. | 2505.04787 | null |
2025-05-01 | AI-Driven IRM: Transforming insider risk management with adaptive scoring and LLM-based threat detection | Lokesh Koli et.al. | 2505.03796 | null |
2025-05-05 | Efficient Continual Learning in Keyword Spotting using Binary Neural Networks | Quynh Nguyen-Phuong Vu et.al. | 2505.02469 | null |
2025-05-04 | Lifelong Whole Slide Image Analysis: Online Vision-Language Adaptation and Past-to-Present Gradient Distillation | Doanh C. Bui et.al. | 2505.01984 | null |
2025-05-02 | Monitoring morphometric drift in lifelong learning segmentation of the spinal cord | Enamundram Naga Karthik et.al. | 2505.01364 | null |
2025-05-02 | Fast and Low-Cost Genomic Foundation Models via Outlier Removal | Haozheng Luo et.al. | 2505.00598 | link |
2025-05-01 | SacFL: Self-Adaptive Federated Continual Learning for Resource-Constrained End Devices | Zhengyi Zhong et.al. | 2505.00365 | link |
2025-04-30 | Birdie: Natural Language-Driven Table Discovery Using Differentiable Search Index | Yuxiang Guo et.al. | 2504.21282 | null |
2025-04-30 | Memorization and Knowledge Injection in Gated LLMs | Xu Pan et.al. | 2504.21239 | null |
2025-04-29 | Legilimens: Performant Video Analytics on the System-on-Chip Edge | Murali Ramanujam et.al. | 2504.21136 | null |
2025-04-29 | Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity | Taisuke Kobayashi et.al. | 2504.20932 | null |
2025-04-29 | Partitioned Memory Storage Inspired Few-Shot Class-Incremental learning | Renye Zhang et.al. | 2504.20797 | null |
2025-04-28 | FedCCL: Federated Clustered Continual Learning Framework for Privacy-focused Energy Forecasting | Michael A. Helcig et.al. | 2504.20282 | null |
2025-04-27 | Personalized Artificial General Intelligence (AGI) via Neuroscience-Inspired Continuous Learning Systems | Rajeev Gupta et.al. | 2504.20109 | null |
2025-04-28 | Mitigating Catastrophic Forgetting in the Incremental Learning of Medical Images | Sara Yavari et.al. | 2504.20033 | null |
2025-04-28 | JailbreaksOverTime: Detecting Jailbreak Attacks Under Distribution Shift | Julien Piet et.al. | 2504.19440 | link |
2025-04-24 | QuantBench: Benchmarking AI Methods for Quantitative Investment | Saizhuo Wang et.al. | 2504.18600 | null |
2025-04-25 | Action Flow Matching for Continual Robot Learning | Alejandro Murillo-Gonzalez et.al. | 2504.18471 | link |
2025-04-25 | Enhancing Pre-Trained Model-Based Class-Incremental Learning through Neural Collapse | Kun He et.al. | 2504.18437 | null |
2025-04-25 | PropRAG: Guiding Retrieval with Beam Search over Proposition Paths | Jingjin Wang et.al. | 2504.18070 | null |
2025-04-25 | POET: Prompt Offset Tuning for Continual Human Action Adaptation | Prachi Garg et.al. | 2504.18059 | null |
2025-04-25 | Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation | Zhuang Yu et.al. | 2504.18012 | null |
2025-04-24 | Mathematics of Continual Learning | Liangzu Peng et.al. | 2504.17963 | null |
2025-04-24 | Replay to Remember: Retaining Domain Knowledge in Streaming Language Models | Sneh Pillai et.al. | 2504.17780 | null |
2025-04-24 | Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning | Mingqi Yuan et.al. | 2504.17490 | null |
2025-04-24 | Perturbed Gradient Descent via Convex Quadratic Approximation for Nonconvex Bilevel Optimization | Nazanin Abolfazli et.al. | 2504.17215 | null |
2025-04-23 | Social sustainability through engagement in a training context with tools such as the Native Podcast and Facebook social network | Danielle Mbambe Bebey et.al. | 2504.16964 | null |
2025-04-23 | Noise-Tolerant Coreset-Based Class Incremental Continual Learning | Edison Mucllari et.al. | 2504.16763 | null |
2025-04-23 | Dynamic Time-aware Continual User Representation Learning | Seungyoon Choi et.al. | 2504.16501 | null |
2025-04-22 | ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models? | Doanh C. Bui et.al. | 2504.15627 | null |
2025-04-22 | Few-Shot Vision-Language Action-Incremental Policy Learning | Mingchen Song et.al. | 2504.15517 | null |
2025-04-21 | Bayesian Federated Learning for Continual Training | Usevalad Milasheuski et.al. | 2504.15328 | null |
2025-04-21 | Single-loop Algorithms for Stochastic Non-convex Optimization with Weakly-Convex Constraints | Ming Yang et.al. | 2504.15243 | null |
2025-04-21 | Position: Bayesian Statistics Facilitates Stakeholder Participation in Evaluation of Generative AI | Yanan Long et.al. | 2504.15211 | null |
2025-04-20 | Semi-parametric Memory Consolidation: Towards Brain-like Deep Continual Learning | Geng Liu et.al. | 2504.14727 | null |
2025-04-20 | Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning | Jia Liu et.al. | 2504.14677 | null |
2025-04-20 | Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction | Wenke Xia et.al. | 2504.14588 | link |
2025-04-20 | Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey | Ahsan Bilal et.al. | 2504.14520 | null |
2025-04-18 | Parameter-Efficient Continual Fine-Tuning: A Survey | Eric Nuertey Coleman et.al. | 2504.13822 | null |
2025-04-18 | MEGA: Second-Order Gradient Alignment for Catastrophic Forgetting Mitigation in GFSCIL | Jinhui Pang et.al. | 2504.13691 | null |
2025-04-18 | Bayesian continual learning and forgetting in neural networks | Djohan Bonnet et.al. | 2504.13569 | link |
2025-04-18 | LoRA-Based Continual Learning with Constraints on Critical Parameter Changes | Shimou Ling et.al. | 2504.13407 | link |
2025-04-18 | A mean teacher algorithm for unlearning of language models | Yegor Klochkov et.al. | 2504.13388 | link |
2025-04-17 | Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification | Hyunji Jung et.al. | 2504.12712 | null |
2025-04-16 | Continual Learning Strategies for 3D Engineering Regression Problems: A Benchmarking Study | Kaira M. Samuel et.al. | 2504.12503 | link |
2025-04-16 | Lifelong and Universal Machine Learning Potentials for Chemical Reaction Network Explorations | Marco Eckhoff et.al. | 2504.11933 | null |
2025-04-15 | MULTI-LF: A Unified Continuous Learning Framework for Real-Time DDoS Detection in Multi-Environment Networks | Furqan Rustam et.al. | 2504.11575 | null |
2025-04-17 | Adaptive Decision Boundary for Few-Shot Class-Incremental Learning | Linhao Li et.al. | 2504.10976 | link |
2025-04-14 | Adaptive Synaptogenesis Implemented on a Nanomagnetic Platform | Faiyaz Elahi Mullick et.al. | 2504.10767 | null |
2025-04-16 | Self-Controlled Dynamic Expansion Model for Continual Learning | Runqing Wu et.al. | 2504.10561 | null |
2025-04-14 | Continual learning for rotating machinery fault diagnosis with cross-domain environmental and operational variations | Diogo Risca et.al. | 2504.10151 | null |
2025-04-16 | BoTTA: Benchmarking on-device Test Time Adaptation | Michal Danilowski et.al. | 2504.10149 | null |
2025-04-13 | How new data permeates LLM knowledge and how to dilute it | Chen Sun et.al. | 2504.09522 | null |
2025-04-13 | Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training | Yi Hu et.al. | 2504.09405 | null |
2025-04-17 | CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift | Jiongchi Yu et.al. | 2504.09115 | link |
2025-04-11 | Adaptive Additive Parameter Updates of Vision Transformers for Few-Shot Continual Learning | Kyle Stein et.al. | 2504.08982 | null |
2025-04-11 | Diachronic and synchronic variation in the performance of adaptive machine learning systems: The ethical challenges | Joshua Hatherley et.al. | 2504.08861 | null |
2025-04-09 | FM-LoRA: Factorized Low-Rank Meta-Prompting for Continual Learning | Xiaobing Yu et.al. | 2504.08823 | null |
2025-04-14 | Task-conditioned Ensemble of Expert Models for Continuous Learning | Renu Sharma et.al. | 2504.08626 | link |
2025-04-11 | Enhancing knowledge retention for continual learning with domain-specific adapters and features gating | Mohamed Abbas Hedjazi et.al. | 2504.08613 | null |
2025-04-11 | Boosting-inspired online learning with transfer for railway maintenance | Diogo Risca et.al. | 2504.08554 | null |
2025-04-11 | Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery | Alireza Fathalizadeh et.al. | 2504.08550 | link |
2025-04-11 | Explainability and Continual Learning meet Federated Learning at the Network Edge | Thomas Tsouparopoulos et.al. | 2504.08536 | null |
2025-04-11 | CMIP-CIL: A Cross-Modal Benchmark for Image-Point Class Incremental Learning | Chao Qi et.al. | 2504.08422 | link |
2025-04-10 | Rethinking the Foundations for Continual Reinforcement Learning | Michael Bowling et.al. | 2504.08161 | null |
2025-04-10 | LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution | Danielle Sullivan-Pao et.al. | 2504.08149 | link |
2025-04-10 | LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation | Juzheng Zhang et.al. | 2504.07448 | link |
2025-04-09 | Prototype-Based Continual Learning with Label-free Replay Buffer and Cluster Preservation Loss | Agil Aghasanli et.al. | 2504.07240 | null |
2025-04-09 | Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning | Nikhil Shivakumar Nayak et.al. | 2504.07097 | link |
2025-04-09 | SEE: Continual Fine-tuning with Sequential Ensemble of Experts | Zhilin Wang et.al. | 2504.06664 | link |
2025-04-09 | DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning | Songze Li et.al. | 2504.06521 | null |
2025-04-08 | Meta-Continual Learning of Neural Fields | Seungyoon Woo et.al. | 2504.05806 | null |
2025-04-08 | Continual Learning of Multiple Cognitive Functions with Brain-inspired Temporal Development Mechanism | Bing Han et.al. | 2504.05621 | null |
2025-04-07 | Embodied Perception for Test-time Grasping Detection Adaptation with Knowledge Infusion | Jin Liu et.al. | 2504.04795 | null |
2025-04-06 | Better Rates for Random Task Orderings in Continual Linear Models | Itay Evron et.al. | 2504.04579 | null |
2025-04-05 | Memory-Statistics Tradeoff in Continual Learning with Structural Regularization | Haoran Li et.al. | 2504.04039 | null |
2025-04-04 | Outlook Towards Deployable Continual Learning for Particle Accelerators | Kishansingh Rajput et.al. | 2504.03793 | null |
2025-04-03 | BECAME: BayEsian Continual Learning with Adaptive Model MErging | Mei Li et.al. | 2504.02666 | null |
2025-04-03 | The Self-Learning Agent with a Progressive Neural Network Integrated Transformer | Ajay Sivakumar et.al. | 2504.02489 | null |
2025-04-03 | Distributed Log-driven Anomaly Detection System based on Evolving Decision Making | Zhuoran Tan et.al. | 2504.02322 | null |
2025-04-02 | TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining | Jeffrey Li et.al. | 2504.02107 | link |
2025-03-31 | Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems | Bang Liu et.al. | 2504.01990 | link |
2025-04-03 | AI-Driven Framework for Multi-Service Multi-Modal Devices in NextG ORAN Systems | Mrityunjoy Gain et.al. | 2504.01730 | null |
2025-04-01 | Catastrophic Forgetting in LLMs: A Comparative Analysis Across Language Tasks | Naimul Haque et.al. | 2504.01241 | null |
2025-04-01 | Gradient-free Continual Learning | Grzegorz Rypeść et.al. | 2504.01219 | null |
2025-04-01 | Energy Weighted Learning Progress Guided Interleaved Multi-Task Learning | Hanne Say et.al. | 2504.00707 | null |
2025-04-01 | Continual Cross-Modal Generalization | Yan Xia et.al. | 2504.00561 | null |
2025-03-31 | MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices | Sijia Li et.al. | 2504.00174 | null |
2025-03-31 | Advances in Continual Graph Learning for Anti-Money Laundering Systems: A Comprehensive Review | Bruno Deprez et.al. | 2503.24259 | link |
2025-03-30 | If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs | Siqi Fan et.al. | 2503.23514 | null |
2025-03-30 | Pareto Continual Learning: Preference-Conditioned Learning and Adaption for Dynamic Stability-Plasticity Trade-off | Song Lai et.al. | 2503.23390 | link |
2025-03-30 | Language Guided Concept Bottleneck Models for Interpretable Continual Learning | Lu Yu et.al. | 2503.23283 | link |
2025-03-29 | VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving | Haibo Hu et.al. | 2503.23046 | null |
2025-03-28 | MediTools – Medical Education Powered by LLMs | Amr Alshatnawi et.al. | 2503.22769 | link |
2025-03-26 | Ancestral Mamba: Enhancing Selective Discriminant Space Model with Online Visual Prototype Learning for Efficient and Robust Discriminant Approach | Jiahao Qin et.al. | 2503.22729 | null |
2025-03-28 | Efficient Continual Learning through Frequency Decomposition and Integration | Ruiqi Liu et.al. | 2503.22175 | null |
2025-03-28 | Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation | Hongmei Yin et.al. | 2503.22136 | link |
2025-03-28 | A Proposal for Networks Capable of Continual Learning | Zeki Doruk Erden et.al. | 2503.22068 | null |
2025-03-27 | Stochastic Engrams for Efficient Continual Learning with Binarized Neural Networks | Isabelle Aguilar et.al. | 2503.21436 | null |
2025-03-27 | LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models | Hengyuan Zhao et.al. | 2503.21227 | null |
2025-03-27 | KAC: Kolmogorov-Arnold Classifier for Continual Learning | Yusong Hu et.al. | 2503.21076 | null |
2025-03-25 | Dynamic Allocation Hypernetwork with Adaptive Model Recalibration for Federated Continual Learning | Xiaoming Qi et.al. | 2503.20808 | link |
2025-03-26 | Continual learning via probabilistic exchangeable sequence modelling | Hanwen Xing et.al. | 2503.20725 | null |
2025-03-26 | IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting | Hao Fu et.al. | 2503.20612 | link |
2025-03-26 | Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning | Yousef Sadegheih et.al. | 2503.20326 | link |
2025-03-25 | Experience Replay Addresses Loss of Plasticity in Continual Learning | Jiuqi Wang et.al. | 2503.20018 | null |
2025-03-25 | Continual Learning With Quasi-Newton Methods | Steven Vander Eeckt et.al. | 2503.19939 | null |
2025-03-25 | Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning | Pratibha Kumari et.al. | 2503.19819 | null |
2025-03-24 | Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning | Gautham Udayakumar Bekal et.al. | 2503.19212 | null |
2025-03-23 | LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning | Xuan Liu et.al. | 2503.18985 | link |
2025-03-24 | Autonomous Generation of Sub-goals for Lifelong Learning in Robots | Emanuel Fallas Hernández et.al. | 2503.18914 | null |
2025-03-25 | Feature Calibration enhanced Parameter Synthesis for CLIP-based Class-incremental Learning | Juncen Guo et.al. | 2503.18672 | null |
2025-03-24 | Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation | Octi Zhang et.al. | 2503.18531 | null |
2025-03-24 | Global Convergence of Continual Learning on Non-IID Data | Fei Zhu et.al. | 2503.18511 | null |
2025-03-24 | Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning | Xusheng Cao et.al. | 2503.18403 | null |
2025-03-24 | Do Your Best and Get Enough Rest for Continual Learning | Hankyul Kang et.al. | 2503.18371 | link |
2025-03-23 | Dynamic Allocation Hypernetwork with Adaptive Model Recalibration for FCL | Xiaoming Qi et.al. | 2503.18064 | link |
2025-03-22 | Lifelong Evolution of Swarms | Lorenzo Leuzzi et.al. | 2503.17763 | null |
2025-03-22 | Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds | Huitong Chen et.al. | 2503.17677 | link |
2025-03-21 | On-Device Federated Continual Learning on RISC-V-based Ultra-Low-Power SoC for Intelligent Nano-Drone Swarms | Lars Kröger et.al. | 2503.17436 | null |
2025-03-21 | Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems | Mishal Fatima Minhas et.al. | 2503.17061 | null |
2025-03-21 | Restoring Forgotten Knowledge in Non-Exemplar Class Incremental Learning through Test-Time Semantic Evolution | Haori Lu et.al. | 2503.16793 | null |
2025-03-18 | Adaptive Drift Compensation for Soft Sensorized Finger Using Continual Learning | Nilay Kushawaha et.al. | 2503.16540 | null |
2025-03-20 | Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning | Peiyi Lin et.al. | 2503.15924 | null |
2025-03-19 | Technical Report for the 5th CLVision Challenge at CVPR: Addressing the Class-Incremental with Repetition using Unlabeled Data – 4th Place Solution | Panagiota Moraiti et.al. | 2503.15697 | link |
2025-03-19 | Federated Continual 3D Segmentation With Single-round Communication | Can Peng et.al. | 2503.15414 | null |
2025-03-19 | World Models in Artificial Intelligence: Sensing, Learning, and Reasoning Like a Child | Javier Del Ser et.al. | 2503.15168 | null |
2025-03-19 | Continual Contrastive Learning on Tabular Data with Out of Distribution | Achmad Ginanjar et.al. | 2503.15089 | null |
2025-03-19 | Continual Multimodal Contrastive Learning | Xiaohao Liu et.al. | 2503.14963 | null |
2025-03-19 | H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection | Yuhang Liu et.al. | 2503.14832 | null |
2025-03-18 | FeNeC: Enhancing Continual Learning via Feature Clustering with Neighbor- or Logit-Based Classification | Kamil Książek et.al. | 2503.14301 | null |
2025-03-18 | Robust3D-CIL: Robust Class-Incremental Learning for 3D Perception | Jinge Ma et.al. | 2503.13869 | null |
2025-03-17 | Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Kai Tong et.al. | 2503.13575 | null |
2025-03-17 | ProtoDepth: Unsupervised Continual Depth Completion with Prototypes | Patrick Rim et.al. | 2503.12745 | null |
2025-03-16 | A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT | Dazhou Guo et.al. | 2503.12698 | null |
2025-03-16 | Hybrid Learners Do Not Forget: A Brain-Inspired Neuro-Symbolic Approach to Continual Learning | Amin Banayeeanzade et.al. | 2503.12635 | null |
2025-03-15 | Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints | Yuhao Zhou et.al. | 2503.12053 | null |
2025-03-13 | Safe Continual Domain Adaptation after Sim2Real Transfer of Reinforcement Learning Policies in Robotics | Josip Josifovski et.al. | 2503.10949 | null |
2025-03-12 | Enhanced Continual Learning of Vision-Language Models with Model Fusion | Haoyuan Gao et.al. | 2503.10705 | null |
2025-03-13 | Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? | Subhajit Maity et.al. | 2503.10632 | null |
2025-03-13 | Sample Compression for Continual Learning | Jacob Comeau et.al. | 2503.10503 | null |
2025-03-13 | SCOOP: A Framework for Proactive Collaboration and Social Continual Learning through Natural Language Interaction andCausal Reasoning | Dimitri Ognibene et.al. | 2503.10241 | null |
2025-03-13 | StableFusion: Continual Video Retrieval via Frame Adaptation | Zecheng Zhao et.al. | 2503.10111 | link |
2025-03-13 | Semantic Synergy: Unlocking Policy Insights and Learning Pathways Through Advanced Skill Mapping | Phoebe Koundouri et.al. | 2503.10094 | null |
2025-03-12 | Freeze and Cluster: A Simple Baseline for Rehearsal-Free Continual Category Discovery | Chuyu Zhang et.al. | 2503.09106 | null |
2025-03-11 | SoTCKGE:Continual Knowledge Graph Embedding Based on Spatial Offset Transformation | Xinyan Wang et.al. | 2503.08189 | null |
2025-03-11 | Continual Learning for Multiple Modalities | Hyundong Jin et.al. | 2503.08064 | null |
2025-03-09 | WECAR: An End-Edge Collaborative Inference and Training Framework for WiFi-Based Continuous Human Activity Recognition | Rong Li et.al. | 2503.07669 | null |
2025-03-08 | Merge then Realign: Simple and Effective Modality-Incremental Continual Learning for Multimodal LLMs | Dingkun Zhang et.al. | 2503.07663 | null |
2025-03-10 | PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning | Yuanlong Wu et.al. | 2503.07153 | null |
2025-03-10 | A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications | Siyuan Mu et.al. | 2503.07137 | link |
2025-03-10 | Sequential Function-Space Variational Inference via Gaussian Mixture Approximation | Menghao Waiyan William Zhu et.al. | 2503.07114 | link |
2025-03-10 | Towards Experience Replay for Class-Incremental Learning in Fully-Binary Networks | Yanis Basso-Bert et.al. | 2503.07107 | null |
2025-03-09 | Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation | Wentian Xu et.al. | 2503.06717 | null |
2025-03-09 | A Good Start Matters: Enhancing Continual Learning with Data-Driven Weight Initialization | Md Yousuf Harun et.al. | 2503.06385 | null |
2025-03-08 | Dynamically evolving segment anything model with continuous learning for medical image segmentation | Zhaori Liu et.al. | 2503.06236 | null |
2025-03-08 | Lifelong Learning with Task-Specific Adaptation: Addressing the Stability-Plasticity Dilemma | Ruiyu Wang et.al. | 2503.06213 | null |
2025-03-08 | Minion Gated Recurrent Unit for Continual Learning | Abdullah M. Zyarah et.al. | 2503.06175 | null |
2025-03-12 | Facilitating Daily Practice in Intangible Cultural Heritage through Virtual Reality: A Case Study of Traditional Chinese Flower Arrangement | Yingna Wang et.al. | 2503.06122 | null |
2025-03-08 | STAR: A Foundation Model-driven Framework for Robust Task Planning and Failure Recovery in Robotic Systems | Md Sadman Sakib et.al. | 2503.06060 | null |
2025-03-08 | Pathological Prior-Guided Multiple Instance Learning For Mitigating Catastrophic Forgetting in Breast Cancer Whole Slide Image Classification | Weixi Zheng et.al. | 2503.06056 | null |
2025-03-07 | Enhancing Reasoning with Collaboration and Memory | Julie Michelman et.al. | 2503.05944 | null |
2025-03-06 | Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection | Riccardo De Monte et.al. | 2503.04688 | null |
2025-03-06 | CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models | Shengzhuang Chen et.al. | 2503.04655 | link |
2025-03-07 | No Forgetting Learning: Memory-free Continual Learning | Mohammad Ali Vahedifar et.al. | 2503.04638 | null |
2025-03-06 | Knowledge Retention for Continual Model-Based Reinforcement Learning | Yixiang Sun et.al. | 2503.04256 | null |
2025-03-06 | Synthetic Data is an Elegant GIFT for Continual Vision-Language Models | Bin Wu et.al. | 2503.04229 | null |
2025-03-04 | Memory Efficient Continual Learning for Edge-Based Visual Anomaly Detection | Manuel Barusco et.al. | 2503.02691 | null |
2025-03-04 | A Theory of Initialisation’s Impact on Specialisation | Devon Jarvis et.al. | 2503.02526 | null |
2025-03-04 | Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models | Kenta Tsukahara et.al. | 2503.02256 | null |
2025-03-03 | Parabolic Continual Learning | Haoming Yang et.al. | 2503.02117 | null |
2025-02-28 | Continual Learning-Aided Super-Resolution Scheme for Channel Reconstruction and Generalization in OFDM Systems | Jianqiao Chen et.al. | 2503.01897 | null |
2025-02-27 | When Continue Learning Meets Multimodal Large Language Model: A Survey | Yukang Huo et.al. | 2503.01887 | null |
2025-03-03 | An Efficient Continual Learning Framework for Multivariate Time Series Prediction Tasks with Application to Vehicle State Estimation | Arvin Hosseinzadeh et.al. | 2503.01669 | null |
2025-03-03 | STAR: Stability-Inducing Weight Perturbation for Continual Learning | Masih Eskandar et.al. | 2503.01595 | null |
2025-03-03 | A Selective Learning Method for Temporal Graph Continual Learning | Hanmo Liu et.al. | 2503.01580 | null |
2025-03-03 | Alchemist: Towards the Design of Efficient Online Continual Learning System | Yuyang Huang et.al. | 2503.01066 | null |
2025-03-02 | Advancing Prompt-Based Methods for Replay-Independent General Continual Learning | Zhiqi Kang et.al. | 2503.00677 | link |
2025-03-01 | Efficient Prompting for Continual Adaptation to Missing Modalities | Zirun Guo et.al. | 2503.00528 | null |
2025-03-01 | CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering | Tianyu Huai et.al. | 2503.00413 | null |
2025-03-01 | Reservoir Network with Structural Plasticity for Human Activity Recognition | Abdullah M. Zyarah et.al. | 2503.00393 | null |
2025-02-28 | Same accuracy, twice as fast: continuous training surpasses retraining from scratch | Eli Verwimp et.al. | 2502.21147 | null |
2025-02-28 | Towards Specialized Wireless Networks Using an ML-Driven Radio Interface | Kamil Szczech et.al. | 2502.20996 | link |
2025-02-28 | Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data | Yujie Li et.al. | 2502.20974 | null |
2025-02-27 | Exploring Open-world Continual Learning with Knowns-Unknowns Knowledge Transfer | Yujie Li et.al. | 2502.20124 | link |
2025-02-27 | Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping | Guannan Lai et.al. | 2502.20032 | link |
2025-02-27 | One-for-More: Continual Diffusion Model for Anomaly Detection | Xiaofan Li et.al. | 2502.19848 | link |
2025-02-26 | PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems | Mingdai Yang et.al. | 2502.19628 | null |
2025-02-26 | Online Prototypes and Class-Wise Hypergradients for Online Continual Learning with Pre-Trained Models | Nicolas Michel et.al. | 2502.18762 | null |
2025-02-27 | SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models | Yuxuan Zhang et.al. | 2502.18168 | null |
2025-02-25 | C-LoRA: Continual Low-Rank Adaptation for Pre-trained Models | Xin Zhang et.al. | 2502.17920 | null |
2025-02-25 | PVBF: A Framework for Mitigating Parameter Variation Imbalance in Online Continual Learning | Zelin Tao et.al. | 2502.17794 | null |
2025-02-25 | On-device edge learning for IoT data streams: a survey | Afonso Lourenço et.al. | 2502.17788 | null |
2025-02-22 | Recurrent Knowledge Identification and Fusion for Language Model Continual Learning | Yujie Feng et.al. | 2502.17510 | null |
2025-02-24 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Vishal Thengane et.al. | 2502.17429 | link |
2025-02-24 | A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning | Hamidreza Mazandarani et.al. | 2502.17167 | null |
2025-02-24 | Thus Spake Long-Context Large Language Model | Xiaoran Liu et.al. | 2502.17129 | null |
2025-02-23 | Few-shot Continual Relation Extraction via Open Information Extraction | Thiem Nguyen et.al. | 2502.16648 | null |
2025-02-23 | Efficient 4D Gaussian Stream with Low Rank Adaptation | Zhenhuan Liu et.al. | 2502.16575 | null |
2025-02-22 | An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning | Masoud Shokrnezhad et.al. | 2502.16198 | null |
2025-02-21 | On the Design of Safe Continual RL Methods for Control of Nonlinear Systems | Austin Coursey et.al. | 2502.15922 | link |
2025-02-19 | Stock Price Prediction Using a Hybrid LSTM-GNN Model: Integrating Time-Series and Graph-Based Analysis | Meet Satishbhai Sonani et.al. | 2502.15813 | null |
2025-02-21 | MoMa: A Modular Deep Learning Framework for Material Property Prediction | Botian Wang et.al. | 2502.15483 | null |
2025-02-20 | Online hand gesture recognition using Continual Graph Transformers | Rim Slama et.al. | 2502.14939 | null |
2025-02-20 | From RAG to Memory: Non-Parametric Continual Learning for Large Language Models | Bernal Jiménez Gutiérrez et.al. | 2502.14802 | link |
2025-02-20 | μRL: Discovering Transient Execution Vulnerabilities Using Reinforcement Learning | M. Caner Tol et.al. | 2502.14307 | null |
2025-02-20 | Accurate Forgetting for Heterogeneous Federated Continual Learning | Abudukelimu Wuerkaixi et.al. | 2502.14205 | link |
2025-02-19 | CND-IDS: Continual Novelty Detection for Intrusion Detection Systems | Sean Fuhrman et.al. | 2502.14094 | null |
2025-02-19 | Continually Learning Structured Visual Representations via Network Refinement with Rerelation | Zeki Doruk Erden et.al. | 2502.13935 | null |
2025-02-18 | A Survey of Text Classification Under Class Distribution Shift | Adriana Valentina Costache et.al. | 2502.12965 | null |
2025-02-18 | Continuous Learning Conversational AI: A Personalized Agent Framework via A2C Reinforcement Learning | Nandakishor M et.al. | 2502.12876 | null |
2025-02-18 | Cross-Domain Continual Learning for Edge Intelligence in Wireless ISAC Networks | Jingzhi Hu et.al. | 2502.12736 | null |
2025-02-18 | Bring Your Own Knowledge: A Survey of Methods for LLM Knowledge Expansion | Mingyang Wang et.al. | 2502.12598 | null |
2025-02-17 | Achieving Upper Bound Accuracy of Joint Training in Continual Learning | Saleh Momeni et.al. | 2502.12388 | null |
2025-02-14 | Ten Challenging Problems in Federated Foundation Models | Tao Fan et.al. | 2502.12176 | null |
2025-02-17 | Continual Learning Should Move Beyond Incremental Classification | Rupert Mitchell et.al. | 2502.11927 | null |
2025-02-17 | On the Computation of the Fisher Information in Continual Learning | Gido M. van de Ven et.al. | 2502.11756 | link |
2025-02-17 | Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent | Junda Wu et.al. | 2502.11740 | null |
2025-02-17 | Exploiting Task Relationships for Continual Learning Using Transferability-Aware Task Embeddings | Yanru Wu et.al. | 2502.11609 | null |
2025-02-17 | DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning | Huanxuan Liao et.al. | 2502.11482 | link |
2025-02-16 | Non-Uniform Memory Sampling in Experience Replay | Andrii Krutsylo et.al. | 2502.11305 | link |
2025-02-16 | Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning | Gangwei Jiang et.al. | 2502.11019 | null |
2025-02-16 | Neural Networks Remember More: The Power of Parameter Isolation and Combination | Biqing Zeng et.al. | 2502.10966 | null |
2025-02-15 | ReReLRP – Remembering and Recognizing Tasks with LRP | Karolina Bogacka et.al. | 2502.10789 | link |
2025-02-14 | Adaptive Neural Networks for Intelligent Data-Driven Development | Youssef Shoeb et.al. | 2502.10603 | null |
2025-02-11 | Analysis of Overparameterization in Continual Learning under a Linear Model | Daniel Goldfarb et.al. | 2502.10442 | null |
2025-02-13 | Vertical Federated Continual Learning via Evolving Prototype Knowledge | Shuo Wang et.al. | 2502.09152 | null |
2025-02-13 | Feature-based Graph Attention Networks Improve Online Continual Learning | Adjovi Sim et.al. | 2502.09143 | null |
2025-02-13 | Replay-free Online Continual Learning with Self-Supervised MultiPatches | Giacomo Cignoni et.al. | 2502.09140 | link |
2025-02-13 | A Hybrid Transformer Model for Fake News Detection: Leveraging Bayesian Optimization and Bidirectional Recurrent Unit | Tianyi Huang et.al. | 2502.09097 | null |
2025-02-12 | Latest Advancements Towards Catastrophic Forgetting under Data Scarcity: A Comprehensive Survey on Few-Shot Class Incremental Learning | M. Anwar Ma’sum et.al. | 2502.08181 | null |
2025-02-11 | SymbioSim: Human-in-the-loop Simulation Platform for Bidirectional Continuing Learning in Human-Robot Interaction | Haoran Chen et.al. | 2502.07358 | null |
2025-02-11 | Cost-Efficient Continual Learning with Sufficient Exemplar Memory | Dongkyu Cho et.al. | 2502.07274 | null |
2025-02-10 | Federated Continual Learning: Concepts, Challenges, and Solutions | Parisa Hamedi et.al. | 2502.07059 | null |
2025-02-10 | Position: Episodic Memory is the Missing Piece for Long-Term LLM Agents | Mathis Pink et.al. | 2502.06975 | null |
2025-02-10 | Sequence Transferability and Task Order Selection in Continual Learning | Thinh Nguyen et.al. | 2502.06544 | null |
2025-02-10 | Prompt-Driven Continual Graph Learning | Qi Wang et.al. | 2502.06327 | link |
2025-02-10 | Position: Continual Learning Benefits from An Evolving Population over An Unified Model | Aojun Lu et.al. | 2502.06210 | null |
2025-02-09 | Sustainable Adaptation for Autonomous Driving with the Mixture of Progressive Experts Networ | Yixin Cui et.al. | 2502.05943 | null |
2025-02-09 | MADAR: Efficient Continual Learning for Malware Analysis with Diversity-Aware Replay | Mohammad Saidur Rahman et.al. | 2502.05760 | link |
2025-02-06 | No Images, No Problem: Retaining Knowledge in Continual VQA with Questions-Only Memory | Imad Eddine Marouf et.al. | 2502.04469 | link |
2025-02-05 | Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Bo Wen et.al. | 2502.04384 | link |
2025-02-06 | Cognitive AI framework: advances in the simulation of human thought | Rommel Salas-Guerra et.al. | 2502.04259 | null |
2025-02-07 | Efficient Few-Shot Continual Learning in Vision-Language Models | Aristeidis Panos et.al. | 2502.04098 | null |
2025-02-05 | Optimal Task Order for Continual Learning of Multiple Tasks | Ziyan Li et.al. | 2502.03350 | null |
2025-02-05 | SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs | Dinithi Jayasuriya et.al. | 2502.02909 | null |
2025-02-04 | Activation-Informed Merging of Large Language Models | Amin Heyrani Nobari et.al. | 2502.02421 | link |
2025-02-04 | MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning | Shengbo Gu et.al. | 2502.02372 | null |
2025-02-03 | Online Curvature-Aware Replay: Leveraging $\mathbf{2^{nd}}$ Order Information for Online Continual Learning | Edoardo Urettini et.al. | 2502.01866 | null |
2025-02-03 | Structural features of the fly olfactory circuit mitigate the stability-plasticity dilemma in continual learning | Heming Zou et.al. | 2502.01427 | null |
2025-02-03 | Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss | Sangyeon Park et.al. | 2502.01342 | null |
2025-02-02 | VLM-Assisted Continual learning for Visual Question Answering in Self-Driving | Yuxin Lin et.al. | 2502.00843 | null |
2025-02-02 | Lipschitz Lifelong Monte Carlo Tree Search for Mastering Non-Stationary Tasks | Zuyuan Zhang et.al. | 2502.00633 | null |
2025-02-02 | DesCLIP: Robust Continual Adaptation via General Attribute Descriptions for Pretrained Vision-Language Models | Chiyuan He et.al. | 2502.00618 | null |
2025-01-28 | Agential AI for Integrated Continual Learning, Deliberative Behavior, and Comprehensible Models | Zeki Doruk Erden et.al. | 2501.16922 | null |
2025-01-26 | Random Walk Guided Hyperbolic Graph Distillation | Yunbo Long et.al. | 2501.15696 | null |
2025-01-28 | An Empirical Study on Decision-Making Aspects in Responsible Software Engineering for AI | Lekshmi Murali Rani et.al. | 2501.15691 | null |
2025-01-24 | Low-rank Prompt Interaction for Continual Vision-Language Retrieval | Weicai Yan et.al. | 2501.14369 | link |
2025-01-24 | Active Learning for Continual Learning: Keeping the Past Alive in the Present | Jaehyun Park et.al. | 2501.14278 | null |
2025-01-24 | Top Ten Challenges Towards Agentic Neural Graph Databases | Jiaxin Bai et.al. | 2501.14224 | null |
2025-01-24 | Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading | Minrui Xu et.al. | 2501.14205 | null |
2025-01-23 | Spurious Forgetting in Continual Learning of Language Models | Junhao Zheng et.al. | 2501.13453 | link |
2025-01-23 | Beyond Task Diversity: Provable Representation Transfer for Sequential Multi-Task Linear Bandits | Thang Duong et.al. | 2501.13390 | link |
2025-01-30 | S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning | Yichen Wu et.al. | 2501.13198 | link |
2025-01-26 | Multiple Queries with Multiple Keys: A Precise Prompt Matching Paradigm for Prompt-based Continual Learning | Dunwei Tu et.al. | 2501.12635 | null |
2025-01-21 | UI-TARS: Pioneering Automated GUI Interaction with Native Agents | Yujia Qin et.al. | 2501.12326 | link |
2025-01-21 | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems | Stefano Carlo Lambertenghi et.al. | 2501.12269 | link |
2025-01-21 | Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos | Yanlai Yang et.al. | 2501.12254 | null |
2025-01-21 | A margin-based replacement for cross-entropy loss | Michael W. Spratling et.al. | 2501.12191 | null |
2025-01-28 | Optimally-Weighted Maximum Mean Discrepancy Framework for Continual Learning | KaiHui Huang et.al. | 2501.12121 | null |
2025-01-19 | CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning | William Doherty et.al. | 2501.11140 | link |
2025-01-18 | Dynamic Continual Learning: Harnessing Parameter Uncertainty for Improved Network Adaptation | Christopher Angelini et.al. | 2501.10861 | null |
2025-01-16 | Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work | Xiaoshan Zhou et.al. | 2501.09290 | link |
2025-01-21 | Redefining Affordance via Computational Rationality | Yi-Chi Liao et.al. | 2501.09233 | null |
2025-01-14 | Adaptive Cybersecurity: Dynamically Retrainable Firewalls for Real-Time Network Protection | Sina Ahmadi et.al. | 2501.09033 | null |
2025-01-15 | Incrementally Learning Multiple Diverse Data Domains via Multi-Source Dynamic Expansion Model | Runqing Wu et.al. | 2501.08878 | null |
2025-01-15 | Resource-Constrained Federated Continual Learning: What Does Matter? | Yichen Li et.al. | 2501.08737 | null |
2025-01-15 | ANSR-DT: An Adaptive Neuro-Symbolic Learning and Reasoning Framework for Digital Twins | Safayat Bin Hakim et.al. | 2501.08561 | link |
2025-01-14 | Continual Deep Active Learning for Medical Imaging: Replay-Base Architecture for Context Adaptation | Rui Daniel et.al. | 2501.08245 | link |
2025-01-14 | LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Yukai Ma et.al. | 2501.08168 | null |
2025-01-14 | Continual Learning with Embedding Layer Surgery and Task-wise Beam Search using Whisper | Chin Yuen Kwok et.al. | 2501.07875 | null |
2025-01-13 | Dynamic Prototype Rehearsal for Continual Learning in ECG Arrhythmia Detection | Sana Rahmani et.al. | 2501.07555 | null |
2025-01-13 | TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models | Thales Sales Almeida et.al. | 2501.07482 | link |
2025-01-13 | Information-Theoretic Dual Memory System for Continual Learning | RunQing Wu et.al. | 2501.07382 | null |
2025-01-13 | Lifelong Learning of Large Language Model based Agents: A Roadmap | Junhao Zheng et.al. | 2501.07278 | link |
Model Cooperation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-27 | Adversarial Threats in Quantum Machine Learning: A Survey of Attacks and Defenses | Archisman Ghosh et.al. | 2506.21842 | null |
2025-06-25 | AeroLite-MDNet: Lightweight Multi-task Deviation Detection Network for UAV Landing | Haiping Yang et.al. | 2506.21635 | null |
2025-06-26 | Real-time Terrain Analysis for Off-road Autonomous Vehicles | Edwina Lewis et.al. | 2506.21347 | null |
2025-06-26 | FedSC: Federated Learning with Semantic-Aware Collaboration | Huan Wang et.al. | 2506.21012 | null |
2025-06-25 | Differential Transformer-driven 6G Physical Layer for Collaborative Perception Enhancement | Soheyb Ribouh et.al. | 2506.20597 | null |
2025-06-25 | A digital twin of atomic ensemble quantum memories | Elizabeth Robertson et.al. | 2506.20403 | null |
2025-06-25 | A Novel Large Vision Foundation Model (LVFM)-based Approach for Generating High-Resolution Canopy Height Maps in Plantations for Precision Forestry Management | Shen Tan et.al. | 2506.20388 | null |
2025-06-23 | Dynamic Hybrid Modeling: Incremental Identification and Model Predictive Control | Adrian Caspari et.al. | 2506.18344 | null |
2025-06-21 | Machine Learning Model Integration with Open World Temporal Logic for Process Automation | Dyuman Aditya et.al. | 2506.17776 | null |
2025-06-17 | Fine-Scale Soil Mapping in Alaska with Multimodal Machine Learning | Yijun Lin et.al. | 2506.17302 | null |
2025-06-20 | Bayesian Joint Model of Multi-Sensor and Failure Event Data for Multi-Mode Failure Prediction | Sina Aghaee Dabaghan Fard et.al. | 2506.17036 | null |
2025-06-20 | PET Tracer Separation Using Conditional Diffusion Transformer with Multi-latent Space Learning | Bin Huang et.al. | 2506.16934 | null |
2025-06-20 | Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training | Jianyuan Feng et.al. | 2506.16833 | null |
2025-06-19 | From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling | Yao Lu et.al. | 2506.16393 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825 | null |
2025-06-18 | A Machine Learning Framework for Modeling Ensemble Properties of Atomically Disordered Materials | Zhenyao Fang et.al. | 2506.15652 | null |
2025-06-18 | Research on Graph-Retrieval Augmented Generation Based on Historical Text Knowledge Graphs | Yang Fan et.al. | 2506.15241 | null |
2025-06-17 | Digital twin for virtual sensing of ferry quays via a Gaussian Process Latent Force Model | Luigi Sibille et.al. | 2506.14925 | null |
2025-06-17 | A Model-Mediated Stacked Ensemble Approach for Depression Prediction Among Professionals | Md. Mortuza Ahmmed et.al. | 2506.14459 | null |
2025-06-17 | DGG-XNet: A Hybrid Deep Learning Framework for Multi-Class Brain Disease Classification with Explainable AI | Sumshun Nahar Eity et.al. | 2506.14367 | null |
2025-06-16 | Digging deeper: deep joint species distribution modeling reveals environmental drivers of Earthworm Communities | Sara Si-moussi et.al. | 2506.13568 | null |
2025-06-16 | The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions | Devin Kwok et.al. | 2506.13234 | null |
2025-06-15 | Spatial Optimization of Autonomous Vehicle Assignment Based on Distance-Driven Demand and Customer Patience | Niloufar Mirzavand Boroujeni et.al. | 2506.12671 | null |
2025-06-14 | TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Zhou Chen et.al. | 2506.12473 | null |
2025-06-13 | CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection | Byeongchan Lee et.al. | 2506.11772 | null |
2025-06-13 | Collaborative LLM Inference via Planning for Efficient Reasoning | Byeongchan Lee et.al. | 2506.11578 | null |
2025-06-12 | WaveFormer: A Lightweight Transformer Model for sEMG-based Gesture Recognition | Yanlong Chen et.al. | 2506.11168 | null |
2025-06-16 | A Bayesian Multisource Fusion Model for Spatiotemporal PM2.5 in an Urban Setting | Abi I. Riley et.al. | 2506.10688 | link |
2025-06-12 | Unsupervised Protoform Reconstruction through Parsimonious Rule-guided Heuristics and Evolutionary Search | Promise Dodzi Kpoglu et.al. | 2506.10614 | link |
2025-06-11 | Learning to Collaborate Over Graphs: A Selective Federated Multi-Task Learning Approach | Ahmed Elbakary et.al. | 2506.10102 | link |
2025-06-11 | Engineering Cryogenic FETs: Addressing SCEs and Impact of Interface Traps Down to 2 K Temperature | Nilesh Pandey et.al. | 2506.09356 | null |
2025-06-13 | Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model | Ailin Huang et.al. | 2506.08967 | null |
2025-06-10 | A multi-physics model for dislocation driven spontaneous grain nucleation and microstructure evolution in polycrystals | Izzet Tarik Tandogan et.al. | 2506.08843 | null |
2025-06-10 | Data-Efficient Challenges in Visual Inductive Priors: A Retrospective | Robert-Jan Bruintjes et.al. | 2506.08612 | null |
2025-06-10 | DEKC: Data-Enable Control for Tethered Space Robot Deployment in the Presence of Uncertainty via Koopman Operator Theory | Ao Jin et.al. | 2506.08319 | link |
2025-06-09 | Efficient Seismic Data Interpolation via Sparse Attention Transformer and Diffusion Model | Xiaoli Wei et.al. | 2506.07923 | null |
2025-06-09 | Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning | Haizhao Jing et.al. | 2506.07735 | null |
2025-06-09 | FuXi-Air: Urban Air Quality Forecasting Based on Emission-Meteorology-Pollutant multimodal Machine Learning | Zhixin Geng et.al. | 2506.07616 | null |
2025-06-09 | CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms | Satvik Praveen et.al. | 2506.07357 | null |
2025-06-08 | SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes | Wenxuan Xie et.al. | 2506.07245 | null |
2025-06-07 | Hybrid Extractive Abstractive Summarization for Multilingual Sentiment Analysis | Mikhail Krasitskii et.al. | 2506.06929 | null |
2025-06-07 | Graph Neural Networks in Modern AI-aided Drug Discovery | Odin Zhang et.al. | 2506.06915 | null |
2025-06-06 | Rapid training of Hamiltonian graph networks without gradient descent | Atamert Rahma et.al. | 2506.06558 | null |
2025-06-06 | Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models | Seung-jae Lee et.al. | 2506.06537 | null |
2025-06-06 | Near-real-time ship grounding damage assessment using Bayesian networks | Dimitris G. Georgiadis et.al. | 2506.06493 | null |
2025-06-04 | A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions | Chung-Chun Wang et.al. | 2506.04077 | null |
2025-06-04 | Advancements in Artificial Intelligence Applications for Cardiovascular Disease Research | Yuanlin Mo et.al. | 2506.03698 | null |
2025-06-04 | A Threat Intelligence Event Extraction Conceptual Model for Cyber Threat Intelligence Feeds | Jamal H. Al-Yasiri et.al. | 2506.03551 | null |
2025-06-03 | Low-EFFourth: A computational framework for generating and studying multilevel model ensembles in low-dimensional systems | Francisco de Melo Viríssimo et.al. | 2506.03313 | null |
2025-06-03 | BadReward: Clean-Label Poisoning of Reward Models in Text-to-Image RLHF | Kaiwen Duan et.al. | 2506.03234 | null |
2025-06-03 | SemVink: Advancing VLMs’ Semantic Understanding of Optical Illusions via Visual Global Thinking | Sifan Li et.al. | 2506.02803 | null |
2025-06-03 | PhysGaia: A Physics-Aware Dataset of Multi-Body Interactions for Dynamic Novel View Synthesis | Mijeong Kim et.al. | 2506.02794 | link |
2025-06-02 | Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods | Yifan Hao et.al. | 2506.01901 | null |
2025-06-02 | Human-Centric Evaluation for Foundation Models | Yijin Guo et.al. | 2506.01793 | null |
2025-06-02 | Confidence intervals for forced alignment boundaries using model ensembles | Matthew C. Kelley et.al. | 2506.01256 | link |
2025-06-01 | One for All: Update Parameterized Knowledge Across Multiple Models | Weitao Ma et.al. | 2506.00817 | null |
2025-05-31 | Not Just $N_e$ $N_e$ -more: New Applications for SMC from Ecology to Phylogenies | David Peede et.al. | 2506.00692 | null |
2025-05-31 | Performance Analysis of Few-Shot Learning Approaches for Bangla Handwritten Character and Digit Recognition | Mehedi Ahamed et.al. | 2506.00447 | null |
2025-06-03 | A Systematic Review of Metaheuristics-Based and Machine Learning-Driven Intrusion Detection Systems in IoT | Mohammad Shamim Ahsan et.al. | 2506.00377 | null |
2025-05-30 | Sleep Brain and Cardiac Activity Predict Cognitive Flexibility and Conceptual Reasoning Using Deep Learning | Boshra Khajehpiri et.al. | 2506.00279 | null |
2025-05-30 | A DNA Methylation Classification Model Predicts Organ and Disease Site | Keng-Jung Lee et.al. | 2506.00146 | null |
2025-05-30 | Assessing Future Wind Energy Potential under Climate Change: The Critical Role of Multi-Model Ensembles in Robustness Assessment | Andrea Lira-Loarca et.al. | 2505.24463 | null |
2025-05-30 | Towards Unified Modeling in Federated Multi-Task Learning via Subspace Decoupling | Yipan Wei et.al. | 2505.24185 | null |
2025-05-30 | Autoregressive regularized score-based diffusion models for multi-scenarios fluid flow prediction | Wilfried Genuist et.al. | 2505.24145 | null |
2025-05-28 | CADRE: Customizable Assurance of Data Readiness in Privacy-Preserving Federated Learning | Kaveen Hiniduma et.al. | 2505.23849 | null |
2025-05-27 | DP-RTFL: Differentially Private Resilient Temporal Federated Learning for Trustworthy AI in Regulated Industries | Abhijit Talluri et.al. | 2505.23813 | link |
2025-05-26 | Detection of Suicidal Risk on Social Media: A Hybrid Model | Zaihan Yang et.al. | 2505.23797 | null |
2025-05-29 | Position Paper: Metadata Enrichment Model: Integrating Neural Networks and Semantic Knowledge Graphs for Cultural Heritage Applications | Jan Ignatowicz et.al. | 2505.23543 | null |
2025-05-29 | Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis | Hengyuan Cao et.al. | 2505.23325 | null |
2025-05-29 | Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble | Amit Kumthekar et.al. | 2505.23075 | null |
2025-05-28 | Structured Memory Mechanisms for Stable Context Representation in Large Language Models | Yue Xing et.al. | 2505.22921 | null |
2025-05-30 | Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs | Ziling Cheng et.al. | 2505.22630 | null |
2025-05-28 | EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles | Aakriti Agrawal et.al. | 2505.21959 | null |
2025-05-28 | Revisiting Bayesian Model Averaging in the Era of Foundation Models | Mijung Park et.al. | 2505.21857 | null |
2025-05-27 | RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models | Yue Zhang et.al. | 2505.21281 | null |
2025-05-28 | HTMNet: A Hybrid Network with Transformer-Mamba Bottleneck Multimodal Fusion for Transparent and Reflective Objects Depth Completion | Guanghu Xie et.al. | 2505.20904 | null |
2025-05-29 | Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID Data | Abhijit Chunduru et.al. | 2505.20485 | null |
2025-05-26 | An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation | Shubham Gandhi et.al. | 2505.20182 | link |
2025-05-25 | Cellular Traffic Prediction via Byzantine-robust Asynchronous Federated Learning | Hui Ma et.al. | 2505.19263 | link |
2025-05-25 | Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model | Alaa Dalaq et.al. | 2505.19242 | null |
2025-05-24 | Learning without Isolation: Pathway Protection for Continual Learning | Zhikang Chen et.al. | 2505.18568 | link |
2025-05-24 | Knowledge Grafting of Large Language Models | Guodong Du et.al. | 2505.18502 | link |
2025-05-24 | Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting | Zhining Liu et.al. | 2505.18442 | link |
2025-05-23 | A Robust PPO-optimized Tabular Transformer Framework for Intrusion Detection in Industrial IoT Systems | Yuanya She et.al. | 2505.18234 | link |
2025-05-23 | NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling | Bram Grooten et.al. | 2505.17909 | null |
2025-05-23 | Get Experience from Practice: LLM Agents with Record & Replay | Erhu Feng et.al. | 2505.17716 | null |
2025-05-23 | JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models | Zifan Peng et.al. | 2505.17568 | link |
2025-05-23 | AI-Augmented LLMs Achieve Therapist-Level Responses in Motivational Interviewing | Yinghui Huang et.al. | 2505.17380 | null |
2025-05-22 | Bayesian Optimization for Enhanced Language Models: Optimizing Acquisition Functions | Zishuo Bao et.al. | 2505.17151 | null |
2025-05-18 | Improving LLM Outputs Against Jailbreak Attacks with Expert Model Integration | Tatia Tsmindashvili et.al. | 2505.17066 | null |
2025-05-22 | Four Eyes Are Better Than Two: Harnessing the Collaborative Potential of Large Models via Differentiated Thinking and Complementary Ensembles | Jun Xie et.al. | 2505.16784 | null |
2025-05-21 | Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers | Mehran Zoravar et.al. | 2505.15997 | null |
2025-05-21 | Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning | Taehoon Kim et.al. | 2505.15798 | null |
2025-05-21 | From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems | Xiuchao Sui et.al. | 2505.15685 | link |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376 | null |
2025-05-21 | Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One | Yiwen Song et.al. | 2505.15306 | null |
2025-05-21 | Multi-horizon optimization for domestic renewable energy system design under uncertainty | Giovanni Micheli et.al. | 2505.15167 | null |
2025-05-20 | ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains | Guillaume Vray et.al. | 2505.14511 | null |
2025-05-20 | InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion | Yuanyi Wang et.al. | 2505.13893 | link |
2025-05-20 | InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models | Yanggan Gu et.al. | 2505.13878 | link |
2025-05-20 | Articulatory Feature Prediction from Surface EMG during Speech Production | Jihwan Lee et.al. | 2505.13814 | null |
2025-05-22 | Gaze-Enhanced Multimodal Turn-Taking Prediction in Triadic Conversations | Seongsil Heo et.al. | 2505.13688 | null |
2025-05-17 | Data Balancing Strategies: A Survey of Resampling and Augmentation Methods | Behnam Yousefimehr et.al. | 2505.13518 | null |
2025-05-22 | Feedback-Driven Dynamical Model for Axonal Extension on Parallel Micropatterns | Kyle Cheng et.al. | 2505.13361 | null |
2025-05-19 | HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving | Xianzhe Dong et.al. | 2505.12658 | null |
2025-05-18 | Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment | Siyang Wu et.al. | 2505.12452 | null |
2025-05-17 | MedVKAN: Efficient Feature Extraction with Mamba and KAN for Medical Image Segmentation | Hancan Zhu et.al. | 2505.11797 | link |
2025-05-15 | GSPRec: Temporal-Aware Graph Spectral Filtering for Recommendation | Ahmad Bin Rabiah et.al. | 2505.11552 | null |
2025-05-16 | Driving Mechanisms and Forecasting of China’s Pet Population-An ARIMA-RF-HW Hybrid Approach | Shengjia Chang et.al. | 2505.11269 | null |
2025-05-16 | A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation | Jinke Li et.al. | 2505.10825 | null |
2025-05-14 | WhatsAI: Transforming Meta Ray-Bans into an Extensible Generative AI Platform for Accessibility | Nasif Zaman et.al. | 2505.09823 | null |
2025-05-13 | Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking | Wei-Long Tian et.al. | 2505.08999 | link |
2025-05-13 | LM-Scout: Analyzing the Security of Language Model Integration in Android Apps | Muhammad Ibrahim et.al. | 2505.08204 | null |
2025-05-12 | LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | Jiangling Zhang et.al. | 2505.07734 | null |
2025-05-12 | Hybrid Spiking Vision Transformer for Object Detection with Event Cameras | Qi Xu et.al. | 2505.07715 | null |
2025-05-12 | A Survey on Collaborative Mechanisms Between Large and Small Language Models | Yi Chen et.al. | 2505.07460 | null |
2025-05-12 | REMEDI: Relative Feature Enhanced Meta-Learning with Distillation for Imbalanced Prediction | Fei Liu et.al. | 2505.07245 | null |
2025-05-11 | NeuRN: Neuro-inspired Domain Generalization for Image Classification | Hamd Jalil et.al. | 2505.06881 | null |
2025-05-10 | Improving Generalization of Medical Image Registration Foundation Model | Jing Hu et.al. | 2505.06527 | link |
2025-05-17 | A Preliminary Study for GPT-4o on Image Restoration | Hao Yang et.al. | 2505.05621 | link |
2025-05-08 | Optimal Microgrid Sizing of Offshore Renewable Energy Sources for Offshore Platforms and Coastal Communities | Ann Mary Toms et.al. | 2505.05305 | null |
2025-05-08 | ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment | Wanjiang Weng et.al. | 2505.04974 | null |
2025-05-08 | Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping | Jiepan Li et.al. | 2505.04941 | link |
2025-05-12 | Particle Gibbs without the Gibbs bit | Adrien Corenflos et.al. | 2505.04611 | link |
2025-05-06 | LogiDebrief: A Signal-Temporal Logic based Automated Debriefing Approach with Large Language Models Integration | Zirong Chen et.al. | 2505.03985 | null |
2025-05-06 | Edge Large AI Models: Collaborative Deployment and IoT Applications | Zixin Wang et.al. | 2505.03139 | null |
2025-05-05 | Logits-Constrained Framework with RoBERTa for Ancient Chinese NER | Wenjie Hua et.al. | 2505.02983 | null |
2025-05-05 | LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis | Qingkai Fang et.al. | 2505.02625 | link |
2025-05-05 | ReeM: Ensemble Building Thermodynamics Model for Efficient HVAC Control via Hierarchical Reinforcement Learning | Yang Deng et.al. | 2505.02439 | null |
2025-05-04 | Electrospray Thruster Plume Dynamics: Insights from Precise PP Coulomb Field Simulation | Zhe Liu et.al. | 2505.01981 | null |
2025-05-03 | Parameter Sensitivity Analysis in Zinc-Ion Batteries: A Study on Ionic Conductivity, Current Density, and Electrode Properties | Roya Rajabi et.al. | 2505.01887 | null |
2025-05-02 | Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability | Zhaoyang Ma et.al. | 2505.01168 | null |
2025-05-02 | Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts | Wenfa Wu et.al. | 2505.01135 | null |
2025-05-02 | CoCoAFusE: Beyond Mixtures of Experts via Model Fusion | Aurelio Raffa Ugolini et.al. | 2505.01105 | null |
2025-05-02 | Forecasting the solar cycle using variational data assimilation: validation on cycles 22 to 25 | L. Jouve et.al. | 2505.01053 | null |
2025-05-01 | The Comparability of Model Fusion to Measured Data in Confuser Rejection | Conor Flynn et.al. | 2505.00836 | null |
2025-05-01 | KnowEEG: Explainable Knowledge Driven EEG Classification | Amarpal Sahota et.al. | 2505.00541 | null |
2025-05-03 | Improving Phishing Email Detection Performance of Small Large Language Models | Zijie Lin et.al. | 2505.00034 | null |
2025-04-30 | Polka-dotted Stars: a Hierarchical Model for Mapping Stellar Surfaces Using Occultation Light Curves and the Case of TOI-3884 | Sabina Sagynbayeva et.al. | 2504.21852 | null |
2025-04-30 | MAGNET: an open-source library for mesh agglomeration by Graph Neural Networks | Paola F. Antonietti et.al. | 2504.21780 | link |
2025-04-30 | LSTM+Geo with xgBoost Filtering: A Novel Approach for Race and Ethnicity Imputation with Reduced Bias | S. Chalavadi et.al. | 2504.21259 | null |
2025-04-28 | Hybrid Approach Combining Ultrasound and Blood Test Analysis with a Voting Classifier for Accurate Liver Fibrosis and Cirrhosis Assessment | Kapil Kashyap et.al. | 2504.19755 | null |
2025-04-28 | SynergyAmodal: Deocclude Anything with Text Control | Xinyang Li et.al. | 2504.19506 | null |
2025-04-29 | Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing | James O’ Neill et.al. | 2504.19333 | null |
2025-04-27 | Anyprefer: An Agentic Framework for Preference Data Synthesis | Yiyang Zhou et.al. | 2504.19276 | null |
2025-04-27 | Vessel Length Estimation from Magnetic Wake Signature: A Physics-Informed Residual Neural Network Approach | Mohammad Amir Fallah et.al. | 2504.19112 | null |
2025-04-27 | Efficient Reasoning for LLMs through Speculative Chain-of-Thought | Jikai Wang et.al. | 2504.19095 | link |
2025-04-25 | Proof-of-TBI – Fine-Tuned Vision Language Model Consortium and OpenAI-o3 Reasoning LLM-Based Medical Diagnosis Support System for Mild Traumatic Brain Injury (TBI) Prediction | Ross Gore et.al. | 2504.18671 | null |
2025-04-25 | Enhancing Visual Interpretability and Explainability in Functional Survival Trees and Forests | Giuseppe Loffredo et.al. | 2504.18498 | null |
2025-04-25 | ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications | Felix Viktor Jedrzejewski et.al. | 2504.18369 | null |
2025-04-25 | Auto-Regressive Standard Precipitation Index: A Bayesian Approach for Drought Characterization | Soham Ghosh et.al. | 2504.18197 | null |
2025-04-25 | Modular integration of neural connectomics, dynamics and biomechanics for identification of behavioral sensorimotor pathways in Caenorhabditis elegans | Jimin Kim et.al. | 2504.18073 | null |
2025-04-24 | RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore | Zhenkai Qin et.al. | 2504.17574 | null |
2025-04-24 | Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks | Yang Liu et.al. | 2504.17421 | null |
2025-04-23 | CLPSTNet: A Progressive Multi-Scale Convolutional Steganography Model Integrating Curriculum Learning | Fengchun Liu et.al. | 2504.16364 | link |
2025-04-22 | Few-shot Hate Speech Detection Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.15987 | null |
2025-04-24 | Synergizing RAG and Reasoning: A Systematic Review | Yunfan Gao et.al. | 2504.15909 | null |
2025-04-22 | Identifying eclipsing binary stars with TESS data based on a new hybrid deep learning model | Ying Shan et.al. | 2504.15875 | link |
2025-04-21 | DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models | Chengyu Wang et.al. | 2504.15027 | null |
2025-04-21 | Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments? | Xinglei Dou et.al. | 2504.15021 | null |
2025-04-21 | Multimodal Non-Semantic Feature Fusion for Predicting Segment Access Frequency in Lecture Archives | Ruozhu Sheng et.al. | 2504.14927 | null |
2025-04-19 | Quantum-Enhanced Reinforcement Learning for Power Grid Security Assessment | Benjamin M. Peter et.al. | 2504.14412 | null |
2025-04-19 | Mathematical Programming Models for Exact and Interpretable Formulation of Neural Networks | Masoud Ataei et.al. | 2504.14356 | null |
2025-04-19 | Dusty stellar sources classification by implementing machine learning methods based on spectroscopic observations in the Magellanic Clouds | Sepideh Ghaziasgar et.al. | 2504.14332 | null |
2025-04-19 | End-Edge Model Collaboration: Bandwidth Allocation for Data Upload and Model Transmission | Dailin Yang et.al. | 2504.14310 | null |
2025-04-18 | SatelliteCalculator: A Multi-Task Vision Foundation Model for Quantitative Remote Sensing Inversion | Zhenyu Yu et.al. | 2504.13442 | null |
2025-04-14 | Investigating cybersecurity incidents using large language models in latest-generation wireless networks | Leonid Legashev et.al. | 2504.13196 | null |
2025-04-17 | Bayesian model-data comparison incorporating theoretical uncertainties | Sunil Jaiswal et.al. | 2504.13144 | null |
2025-04-17 | NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results | Xin Li et.al. | 2504.13131 | link |
2025-04-18 | The Athenian Academy: A Seven-Layer Architecture Model for Multi-Agent Systems | Lidong Zhai et.al. | 2504.12735 | null |
2025-04-16 | Decision-based AI Visual Navigation for Cardiac Ultrasounds | Andy Dimnaku et.al. | 2504.12535 | null |
2025-04-15 | Mamba-Based Ensemble learning for White Blood Cell Classification | Lewis Clifton et.al. | 2504.11438 | link |
2025-04-15 | 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification | Guandong Li et.al. | 2504.10795 | null |
2025-04-15 | Collaborative Bayesian Optimization via Wasserstein Barycenters | Donglin Zhan et.al. | 2504.10770 | null |
2025-04-14 | Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content | F. A. Rizvi et.al. | 2504.10679 | null |
2025-04-14 | Refining Financial Consumer Complaints through Multi-Scale Model Interaction | Bo-Wei Chen et.al. | 2504.09903 | null |
2025-04-12 | IMPACT: Behavioral Intention-aware Multimodal Trajectory Prediction with Adaptive Context Trimming | Jiawei Sun et.al. | 2504.09103 | null |
2025-04-12 | Large Language Models integration in Smart Grids | Seyyedreza Madani et.al. | 2504.09059 | null |
2025-04-11 | CMS RPC Non-Physics Event Data Automation Ideology | A. Dimitrov et.al. | 2504.08991 | null |
2025-04-11 | Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition | Lei Kang et.al. | 2504.08616 | null |
2025-04-10 | STEI-PCN: an efficient pure convolutional network for traffic prediction via spatial-temporal encoding and inferring | Kai Hu et.al. | 2504.08061 | null |
2025-04-10 | Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement | Daniel Torres et.al. | 2504.07810 | null |
2025-04-09 | A Multi-Phase Analysis of Blood Culture Stewardship: Machine Learning Prediction, Expert Recommendation Assessment, and LLM Automation | Fatemeh Amrollahi et.al. | 2504.07278 | null |
2025-04-09 | Unifying Search and Recommendation: A Generative Paradigm Inspired by Information Theory | Jujia Zhao et.al. | 2504.06714 | null |
2025-04-09 | SEE: Continual Fine-tuning with Sequential Ensemble of Experts | Zhilin Wang et.al. | 2504.06664 | link |
2025-04-17 | FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion | Longguang Zhong et.al. | 2504.06562 | null |
2025-04-07 | Going beyond explainability in multi-modal stroke outcome prediction models | Jonas Brändli et.al. | 2504.06299 | null |
2025-04-08 | A Lightweight Large Vision-language Model for Multimodal Medical Images | Belal Alsinglawi et.al. | 2504.05575 | null |
2025-04-07 | Content-Aware Transformer for All-in-one Image Restoration | Gang Wu et.al. | 2504.04869 | link |
2025-04-07 | Enhancing Trust in AI Marketplaces: Evaluating On-Chain Verification of Personalized AI models using zk-SNARKs | Nishant Jagannath et.al. | 2504.04794 | null |
2025-04-07 | Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs | Will Cai et.al. | 2504.04715 | link |
2025-04-05 | Foundation Models for Environmental Science: A Survey of Emerging Frontiers | Runlong Yu et.al. | 2504.04280 | null |
2025-04-08 | Scalable Robust Bayesian Co-Clustering with Compositional ELBOs | Ashwin Vinod et.al. | 2504.04079 | null |
2025-04-04 | NucleiML: A machine learning framework of ground-state properties of finite nuclei for accelerated Bayesian exploration | Anagh Venneti et.al. | 2504.03333 | null |
2025-04-03 | From Questions to Insights: Exploring XAI Challenges Reported on Stack Overflow Questions | Saumendu Roy et.al. | 2504.03085 | null |
2025-04-03 | Improving Counterfactual Truthfulness for Molecular Property Prediction through Uncertainty Quantification | Jonas Teufel et.al. | 2504.02606 | null |
2025-04-03 | Impact of Global Warming on Extreme Rainfall in Taiwan | Cheng-Ching Lin et.al. | 2504.02470 | null |
2025-04-03 | FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention | Huangliang Dai et.al. | 2504.02211 | null |
2025-04-02 | Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions | Tahmid Hasan Prato et.al. | 2504.01933 | null |
2025-04-02 | A Two-Timescale Approach for Wireless Federated Learning with Parameter Freezing and Power Control | Jinhao Ouyang et.al. | 2504.01752 | null |
2025-04-02 | Design and Experimental Validation of an Urban Microclimate Tool Integrating Indoor-Outdoor Detailed Longwave Radiative Fluxes at District Scale | Marie-Hélène Azam et.al. | 2504.01736 | null |
2025-04-01 | Cooper: A Library for Constrained Optimization in Deep Learning | Jose Gallego-Posada et.al. | 2504.01212 | link |
2025-04-01 | The role of ethical consumption in promoting democratic sustainability: revisiting neoclassical economics through Kantian ethics | Pascal Stiefenhofer et.al. | 2504.01138 | null |
2025-04-01 | SViQA: A Unified Speech-Vision Multimodal Model for Textless Visual Question Answering | Bingxin Li et.al. | 2504.01049 | null |
2025-04-01 | Galaxy Morphology Classification via Deep Semi-Supervised Learning with Limited Labeled Data | Zhijian Luo et.al. | 2504.00500 | null |
2025-04-01 | Hawkeye:Efficient Reasoning with Model Collaboration | Jianshu She et.al. | 2504.00424 | null |
2025-04-01 | Collaborative LLM Numerical Reasoning with Local Data Protection | Min Zhang et.al. | 2504.00299 | null |
2025-03-31 | VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era | Rong Kang et.al. | 2503.23776 | link |
2025-03-30 | FedCAPrivacy: Privacy-Preserving Heterogeneous Federated Learning with Anonymous Adaptive Clustering | Yunan Wei et.al. | 2503.23292 | null |
2025-03-29 | ShiftLIC: Lightweight Learned Image Compression with Spatial-Channel Shift Operations | Youneng Bao et.al. | 2503.23052 | link |
2025-03-28 | Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation | Sarubi Thillainathan et.al. | 2503.22582 | null |
2025-03-27 | Fusion of Graph Neural Networks via Optimal Transport | Weronika Ormaniec et.al. | 2503.21579 | null |
2025-03-27 | Federated Intelligence: When Large AI Models Meet Federated Fine-Tuning and Collaborative Reasoning at the Network Edge | Wanli Ni et.al. | 2503.21412 | null |
2025-03-26 | Toward Sustainable Polymer Design: A Molecular Dynamics-Informed Machine Learning Approach for Vitrimers | Yiwen Zheng et.al. | 2503.20956 | link |
2025-03-26 | Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework | Usama Zafar et.al. | 2503.20884 | link |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks | Muhammed Shafi K. P. et.al. | 2503.20257 | null |
2025-03-24 | Distributionally Robust Federated Learning: An ADMM Algorithm | Wen Bai et.al. | 2503.18436 | null |
2025-03-23 | Efficient Deep Learning Approaches for Processing Ultra-Widefield Retinal Imaging | Siwon Kim et.al. | 2503.18151 | null |
2025-03-22 | Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models | Wenqi Pei et.al. | 2503.17811 | null |
2025-03-22 | Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving | Yanan Ma et.al. | 2503.17697 | null |
2025-03-21 | A Two-Stage Stochastic Model for Road-Rail Intermodal Freight Transportation Under Demand and Capacity Uncertainty | Jeremiah Gbadegoye et.al. | 2503.17510 | null |
2025-03-16 | State Fourier Diffusion Language Model (SFDLM): A Scalable, Novel Iterative Approach to Language Modeling | Andrew Kiruluta et.al. | 2503.17382 | null |
2025-03-21 | A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations | Théo Bodrito et.al. | 2503.17117 | link |
2025-03-20 | Allocation Multiplicity: Evaluating the Promises of the Rashomon Set | Shomik Jain et.al. | 2503.16621 | null |
2025-03-20 | Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Cheng Li et.al. | 2503.15807 | null |
2025-03-20 | Disentangling Uncertainties by Learning Compressed Data Representation | Zhiyu An et.al. | 2503.15801 | link |
2025-03-19 | MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration | David Wan et.al. | 2503.15272 | null |
2025-03-17 | Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions | Kasra Borazjani et.al. | 2503.14553 | link |
2025-03-18 | FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks | Siqi Zhang et.al. | 2503.13966 | null |
2025-03-18 | SNAKE: A Sustainable and Multi-functional Traffic Analysis System utilizing Specialized Large-Scale Models with a Mixture of Experts Architecture | Tian Qin et.al. | 2503.13808 | null |
2025-03-17 | Mitigating Spectral Bias in Neural Operators via High-Frequency Scaling for Physical Systems | Siavash Khodakarami et.al. | 2503.13695 | link |
2025-03-15 | Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments | Yihong Jin et.al. | 2503.12228 | null |
2025-03-15 | A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI | Paula Andrea Pérez-Toro et.al. | 2503.12102 | null |
2025-03-15 | Generative Modeling of Adversarial Lane-Change Scenario | Chuancheng Zhang et.al. | 2503.12055 | null |
2025-03-14 | Robust Model Predictive Control of Fast Lithium-ion Battery Pretreatment for Safe Recycling | Meng Yuan et.al. | 2503.11857 | null |
2025-03-21 | Enhanced Continual Learning of Vision-Language Models with Model Fusion | Haoyuan Gao et.al. | 2503.10705 | null |
2025-03-08 | LimTopic: LLM-based Topic Modeling and Text Summarization for Analyzing Scientific Articles limitations | Ibrahim Al Azhar et.al. | 2503.10658 | link |
2025-03-14 | AI-assisted Early Detection of Pancreatic Ductal Adenocarcinoma on Contrast-enhanced CT | Han Liu et.al. | 2503.10068 | link |
2025-03-12 | SCOPE-DTI: Semi-Inductive Dataset Construction and Framework Optimization for Practical Usability Enhancement in Deep Learning-Based Drug Target Interaction Prediction | Yigang Chen et.al. | 2503.09251 | link |
2025-03-11 | CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning | Kaiqiang Xiong et.al. | 2503.08219 | null |
2025-03-11 | Physics-based AI methodology for Material Parameter Extraction from Optical Data | M. Koumans et.al. | 2503.08183 | null |
2025-03-11 | Counterfactual Explanations for Model Ensembles Using Entropic Risk Measures | Erfaun Noorani et.al. | 2503.07934 | null |
2025-03-10 | Runtime Detection of Adversarial Attacks in AI Accelerators Using Performance Counters | Habibur Rahaman et.al. | 2503.07568 | null |
2025-03-11 | Mobility-Aware Decentralized Federated Learning with Joint Optimization of Local Iteration and Leader Selection for Vehicular Networks | Dongyu Chen et.al. | 2503.06443 | null |
2025-03-08 | MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model | Miguel Contreras et.al. | 2503.06059 | null |
2025-03-08 | HealthiVert-GAN: A Novel Framework of Pseudo-Healthy Vertebral Image Synthesis for Interpretable Compression Fracture Grading | Qi Zhang et.al. | 2503.05990 | link |
2025-03-07 | Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms | Zitao Fang et.al. | 2503.05320 | null |
2025-03-06 | Multiscale Analysis of Woven Composites Using Hierarchical Physically Recurrent Neural Networks | Ehsan Ghane et.al. | 2503.04901 | null |
2025-03-10 | PanguIR Technical Report for NTCIR-18 AEOLLM Task | Lang Mei et.al. | 2503.04809 | null |
2025-03-06 | Grid-Aware Islanding and Resynchronisation of AC/DC Microgrids | Willem Lambrichts et.al. | 2503.04597 | null |
2025-03-06 | Privacy Preserving and Robust Aggregation for Cross-Silo Federated Learning in Non-IID Settings | Marco Arazzi et.al. | 2503.04451 | null |
2025-03-06 | FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion | Ziyi Yang et.al. | 2503.04222 | link |
2025-03-06 | The Impact Analysis of Delays in Asynchronous Federated Learning with Data Heterogeneity for Edge Intelligence | Ziruo Hao et.al. | 2503.04052 | null |
2025-03-05 | A Survey of Foundation Models for Environmental Science | Runlong Yu et.al. | 2503.03142 | null |
2025-03-01 | Adaptive Entanglement Routing with Deep Q-Networks in Quantum Networks | Lamarana Jallow et.al. | 2503.02895 | null |
2025-03-04 | Resonance-Driven Mechanisms of Ion Transport and Selectivity | Ronald L. Westra et.al. | 2503.02617 | null |
2025-03-04 | Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding | Wenxuan Song et.al. | 2503.02310 | null |
2025-03-04 | Hybrid Quantum Physics-informed Neural Network: Towards Efficient Learning of High-speed Flows | Fong Yew Leong et.al. | 2503.02202 | null |
2025-03-03 | HI-Series Algorithms A Hybrid of Substance Diffusion Algorithm and Collaborative Filtering | Yu Peng et.al. | 2503.01305 | null |
2025-03-03 | Multi-Level Collaboration in Model Merging | Qi Li et.al. | 2503.01268 | null |
2025-03-01 | A Stimulus-Response Model for Explaining When Students Decide to Engage in a Physics Task: The TSMS-Model | Eva Cauet et.al. | 2503.00490 | null |
2025-03-01 | LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention | Hamed Khosravi et.al. | 2503.00387 | null |
2025-02-28 | Goldilocks and the bootstrap | David Berenstein et.al. | 2503.00104 | null |
2025-02-28 | Token-level Ensembling of Models with Different Vocabularies | Rachel Wicks et.al. | 2502.21265 | null |
2025-02-28 | Adaptive Illumination-Invariant Synergistic Feature Integration in a Stratified Granular Framework for Visible-Infrared Re-Identification | Yuheng Jia et.al. | 2502.21163 | null |
2025-02-28 | A Fused Gromov-Wasserstein Approach to Subgraph Contrastive Learning | Amadou S. Sangare et.al. | 2502.20885 | null |
2025-02-28 | Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth | Seyed Pouyan Mousavi Davoudi et.al. | 2502.20758 | null |
2025-02-26 | 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer | Hongkun Yu et.al. | 2502.19623 | null |
2025-02-26 | Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems | Hao Peng et.al. | 2502.19328 | link |
2025-02-26 | Dynamic Classification: Leveraging Self-Supervised Classification to Enhance Prediction Performance | Ziyuan Zhong et.al. | 2502.18891 | null |
2025-02-25 | When Benchmarks Talk: Re-Evaluating Code LLMs with Interactive Feedback | Jane Pan et.al. | 2502.18413 | link |
2025-02-25 | Uncertainty Modeling in Multimodal Speech Analysis Across the Psychosis Spectrum | Morteza Rohanian et.al. | 2502.18285 | null |
2025-02-22 | Recurrent Knowledge Identification and Fusion for Language Model Continual Learning | Yujie Feng et.al. | 2502.17510 | null |
2025-02-24 | Multi-modal and Metadata Capture Model for Micro Video Popularity Prediction | Jiacheng Lu et.al. | 2502.17038 | null |
2025-02-24 | FedBM: Stealing Knowledge from Pre-trained Language Models for Heterogeneous Federated Learning | Meilu Zhu et.al. | 2502.16832 | link |
2025-02-27 | A Survey of Graph Transformers: Architectures, Theories and Applications | Chaohao Yuan et.al. | 2502.16533 | null |
2025-02-22 | DUPRE: Data Utility Prediction for Efficient Data Valuation | Kieu Thao Nguyen Pham et.al. | 2502.16152 | null |
2025-02-26 | Explainable Artificial Intelligence Model for Evaluating Shear Strength Parameters of Municipal Solid Waste Across Diverse Compositional Profiles | Parichat Suknark et.al. | 2502.15827 | null |
2025-02-19 | Stock Price Prediction Using a Hybrid LSTM-GNN Model: Integrating Time-Series and Graph-Based Analysis | Meet Satishbhai Sonani et.al. | 2502.15813 | null |
2025-02-21 | Enhancing Vehicle Make and Model Recognition with 3D Attention Modules | Narges Semiromizadeh et.al. | 2502.15398 | null |
2025-02-21 | Graph-Based Deep Learning on Stereo EEG for Predicting Seizure Freedom in Epilepsy Patients | Artur Agaronyan et.al. | 2502.15198 | null |
2025-02-20 | Towards Physics-Guided Foundation Models | Majid Farhadloo et.al. | 2502.15013 | null |
2025-02-20 | A Collaborative Jade Recognition System for Mobile Devices Based on Lightweight and Large Models | Zhenyu Wang et.al. | 2502.14332 | null |
2025-02-20 | Bayesian Parameter Inference and Uncertainty Quantification for a Computational Pulmonary Hemodynamics Model Using Gaussian Processes | Amirreza Kachabi et.al. | 2502.14251 | null |
2025-02-20 | SleepGMUformer: A gated multimodal temporal neural network for sleep staging | Chenjun Zhao et.al. | 2502.14227 | null |
2025-02-20 | QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification | Hajar Sakai et.al. | 2502.14189 | null |
2025-02-19 | Explainable Distributed Constraint Optimization Problems | Ben Rachmut et.al. | 2502.14102 | null |
2025-02-18 | Transferable Machine Learning Potential X-MACE for Excited States using Integrated DeepSets | Rhyan Barrett et.al. | 2502.12870 | link |
2025-02-18 | IPSR Model: Misinformation Intervention through Prebunking in Social Networks | Robert Rai et.al. | 2502.12740 | null |
2025-02-19 | Linear Diffusion Networks: Harnessing Diffusion Processes for Global Interactions | Jacob Fein-Ashley et.al. | 2502.12381 | null |
2025-02-17 | A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice | Carole Adam et.al. | 2502.12058 | null |
2025-02-16 | Leveraging Multimodal-LLMs Assisted by Instance Segmentation for Intelligent Traffic Monitoring | Murat Arda Onsu et.al. | 2502.11304 | null |
2025-02-15 | MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models | Vanya Cohen et.al. | 2502.10886 | null |
2025-02-14 | Efficient Hierarchical Contrastive Self-supervising Learning for Time Series Classification via Importance-aware Resolution Selection | Kevin Garcia et.al. | 2502.10567 | link |
2025-02-14 | Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations | Abdelrhman Shaheen et.al. | 2502.10303 | null |
2025-02-19 | MonoForce: Learnable Image-conditioned Physics Engine | Ruslan Agishev et.al. | 2502.10156 | link |
2025-02-13 | New Models of Jupiter’s Magnetopause and Bow Shock through the $Juno$ Prime Mission: Probabilistic Location, Shape, and Internally-driven Variation | M. J. Rutala et.al. | 2502.09186 | link |
2025-02-11 | The establishment of static digital humans and the integration with spinal models | Fujiao Ju et.al. | 2502.07844 | null |
2025-02-11 | RoboBERT: An End-to-end Multimodal Robotic Manipulation Model | Sicheng Wang et.al. | 2502.07837 | link |
2025-02-11 | Integrating Physics and Data-Driven Approaches: An Explainable and Uncertainty-Aware Hybrid Model for Wind Turbine Power Prediction | Alfonso Gijón et.al. | 2502.07344 | link |
2025-02-07 | Model Fusion via Neuron Transplantation | Muhammed Öz et.al. | 2502.06849 | link |
2025-02-09 | Propagation of Chaos for Mean-Field Langevin Dynamics and its Application to Model Ensemble | Atsushi Nitanda et.al. | 2502.05784 | null |
2025-02-09 | Target Speaker Lipreading by Audio-Visual Self-Distillation Pretraining and Speaker Adaptation | Jing-Xuan Zhang et.al. | 2502.05758 | null |
2025-02-08 | Constitutive Kolmogorov-Arnold Networks (CKANs): Combining Accuracy and Interpretability in Data-Driven Material Modeling | Kian P. Abdolazizi et.al. | 2502.05682 | null |
2025-02-14 | Federated Learning with Reservoir State Analysis for Time Series Anomaly Detection | Keigo Nogami et.al. | 2502.05679 | link |
2025-02-08 | Analyzing public sentiment to gauge key stock events and determine volatility in conjunction with time and options premiums | SriVarsha Mulakala et.al. | 2502.05403 | null |
2025-02-04 | Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Hao Sun et.al. | 2502.04357 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-04 | Development and validation of a high-fidelity full-spectrum Monte Carlo model for the Swiss airborne gamma-ray spectrometry system | David Breitenmoser et.al. | 2502.02102 | null |
2025-02-02 | Agent-Based Uncertainty Awareness Improves Automated Radiology Report Labeling with an Open-Source Large Language Model | Hadas Ben-Atya et.al. | 2502.01691 | null |
2025-02-01 | Speculative Ensemble: Fast Large Language Model Ensemble via Speculation | Jiale Fu et.al. | 2502.01662 | link |
2025-02-03 | Modeling of photonic integrated resonators using advanced scattering matrix methods | David J. Moss et.al. | 2502.01552 | null |
2025-02-03 | Navigating pollution: A multimodal approach to traffic and exposure management | Yueqi Liu et.al. | 2502.01324 | null |
2025-02-03 | A Single Model Ensemble Framework for Neural Machine Translation using Pivot Translation | Seokjin Oh et.al. | 2502.01182 | null |
2025-02-08 | ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills | Tairan He et.al. | 2502.01143 | link |
2025-02-02 | Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization | Yoontae Hwang et.al. | 2502.00828 | null |
2025-02-01 | Robust Knowledge Distillation in Federated Learning: Counteracting Backdoor Attacks | Ebtisaam Alharbi et.al. | 2502.00587 | link |
2025-02-01 | How Do Model Export Formats Impact the Development of ML-Enabled Systems? A Case Study on Model Integration | Shreyas Kumar Parida et.al. | 2502.00429 | null |
2025-02-01 | A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation | Moein Heidari et.al. | 2502.00314 | link |
2025-02-01 | Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion | Binchi Zhang et.al. | 2502.00264 | link |
2025-01-30 | Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations | Shuaiqun Pan et.al. | 2501.18344 | null |
2025-01-14 | Application of Machine Learning Models for Carbon Monoxide and Nitrogen Oxides Emission Prediction in Gas Turbines | Kamyar Zeinalipour et.al. | 2501.17865 | null |
2025-02-01 | Lightweight Weighted Average Ensemble Model for Pneumonia Detection in Chest X-Ray Images | Suresh Babu Nettur et.al. | 2501.16249 | null |
2025-01-27 | Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks | Ziwei Liu et.al. | 2501.15876 | null |
2025-01-26 | Refined climatologies of future precipitation over High Mountain Asia using probabilistic ensemble learning | Kenza Tazi et.al. | 2501.15690 | link |
2025-01-25 | Deep Multimodal Learning for Real-Time DDoS Attacks Detection in Internet of Vehicles | Mohamed Ababsa et.al. | 2501.15252 | link |
2025-01-24 | Additive Manufacturing Processes Protocol Prediction by Artificial Intelligence using X-ray Computed Tomography data | Sunita Khod et.al. | 2501.14306 | null |
2025-01-23 | Towards Real-World Validation of a Physics-Based Ship Motion Prediction Model | Michail Mathioudakis et.al. | 2501.13804 | null |
2025-01-22 | A Functional Software Reference Architecture for LLM-Integrated Systems | Alessio Bucaioni et.al. | 2501.12904 | null |
2025-01-22 | SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling | Shengshi Yao et.al. | 2501.12696 | null |
2025-01-22 | Toward Model-centric Heterogeneous Federated Graph Learning: A Knowledge-driven Approach | Huilin lai et.al. | 2501.12624 | null |
2025-01-21 | SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology | Dongli Wu et.al. | 2501.12169 | null |
2025-01-17 | Michscan: Black-Box Neural Network Integrity Checking at Runtime Through Power Analysis | Robi Paul et.al. | 2501.10174 | null |
2025-01-17 | Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification | Michael Schulze et.al. | 2501.10089 | null |
2025-01-21 | Intra-day Solar and Power Forecast for Optimization of Intraday Market Participation | Nelson Salazar-Pena et.al. | 2501.09551 | null |
2025-01-15 | Beyond Speaker Identity: Text Guided Target Speech Extraction | Mingyue Huo et.al. | 2501.09169 | link |
2025-01-15 | A Semi-Parametric Bayesian Spatial Model for Rainfall Events in Geographically Complex Domains | Paolo Onorati et.al. | 2501.08748 | null |
2025-01-15 | Mitigating Domain Shift in Federated Learning via Intra- and Inter-Domain Prototypes | Huy Q. Le et.al. | 2501.08521 | null |
2025-01-13 | A Preliminary Survey of Semantic Descriptive Model for Images | Chengxi Yan et.al. | 2501.08352 | null |
2025-01-14 | Executable Multi-Layered Software | Lukas Radosky et.al. | 2501.08186 | link |
2025-01-13 | AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR | The Chuong Chu et.al. | 2501.07102 | link |
2025-01-11 | CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection | Yiheng Li et.al. | 2501.06550 | link |
2025-01-10 | MinMo: A Multimodal Large Language Model for Seamless Voice Interaction | Qian Chen et.al. | 2501.06282 | null |
2025-01-10 | An Attention-Guided Deep Learning Approach for Classifying 39 Skin Lesion Types | Sauda Adiv Hanum et.al. | 2501.05991 | link |
2025-01-10 | How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond | Chen Huang et.al. | 2501.05714 | null |
2025-01-09 | A Machine Learning Model for Crowd Density Classification in Hajj Video Frames | Afnan A. Shah et.al. | 2501.04911 | null |
2025-01-07 | Vision Language Models as Values Detectors | Giulio Antonio Abbo et.al. | 2501.03957 | null |
2025-01-08 | Rethinking Byzantine Robustness in Federated Recommendation from Sparse Aggregation Perspective | Zhongjian Zhang et.al. | 2501.03301 | link |
2025-01-04 | BADTV: Unveiling Backdoor Threats in Third-Party Task Vectors | Chia-Yi Hsu et.al. | 2501.02373 | null |
2025-01-10 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-02 | ROME: Robust Model Ensembling for Semantic Communication Against Semantic Jamming Attacks | Kequan Zhou et.al. | 2501.01172 | null |