This research proposes a neural network method incorporating topological data analysis to improve flood detection accuracy and interpretability. Using the SEN12-FLOOD dataset, topological features extracted from images are integrated into neural networks, demonstrating that topological descriptors carry meaningful flood signals independently and enhance the robustness and interpretability of existing networks.
Flood detection is critical for emergency response, but cloud cover often hinders optical satellite observations.
Existing deep models are black boxes lacking interpretability.
Modern machine learning systems have evolved into complex socio-technical architectures that actively mediate human opportunity. The field of algorithmic fairness addresses how models optimized for predictive accuracy can systematically disadvantage marginalized groups. This thesis (arXiv:2606.26200) identifies two fundamental limitations: reliance on deterministic point estimates for auditing and treatment of individuals as isolated entities devoid of structural context.
Machine learning systems now act as socio-technical mediators of opportunity, embedding structural inequalities.
Early fairness mitigation strategies rely on fragile simplifications that limit real-world effectiveness.
A new FHPLF model integrates hash learning with federated learning, using binary gradient matrices, projected Hamming distance, and a privacy-enhanced upload strategy to improve accuracy, efficiency, and privacy.
FHPLF replaces real-valued gradients with binary matrices to reduce communication and privacy risks.
This paper introduces Clue-Guided Group Discovery (CGGD), a paradigm that progressively recovers money laundering groups from initial clues through analyst interaction. The proposed Clue2Group framework constructs a compact local investigation context, uses a multi-semantic local-temporal GNN to estimate risk fields, and integrates evidence to recover criminal groups. Experiments on large-scale AML benchmarks demonstrate its practical potential for real-world investigations.
Proposes CGGD paradigm mimicking real AML investigation workflows
Clue2Group framework combines local context construction with multi-semantic LST-GNN
This paper challenges the assumption that setting LLM-as-judge temperature to 0 ensures deterministic grading, showing flips occur due to default temperature and residual non-reproducibility even under greedy decoding.
Assumption that temperature=0 makes grading deterministic is false.
Default temperature of 1.0 leads to pass/fail flips for borderline items.
KG-TRACE is a novel neuro-symbolic framework that integrates the WHO mutation knowledge graph as a structured biological constraint into a neural genomic model, dynamically weighting neural evidence against symbolic biological knowledge via a learned epistemic trust gate. Evaluated on the CRyPTIC M. tuberculosis cohort, it achieves an AUROC of 0.9760 for isoniazid and introduces the Biological Grounding Ratio (BGR) metric to quantify alignment between neural attributions and established biology.
KG-TRACE integrates the WHO mutation knowledge graph as a structural constraint into a neural model for neuro-symbolic fusion.
Achieves 0.9760 AUROC for isoniazid on M. tuberculosis dataset, with primary value in symbolic grounding rather than predictive uplift.
This paper provides a comprehensive review of Neural Architecture Search (NAS) methods applied to Generative Adversarial Networks (GANs), comparing search strategies, evaluation metrics, and performance outcomes. It highlights the superiority of evolutionary and gradient-based methods in certain contexts, the need for robust evaluation metrics beyond IS and FID, and the importance of diverse datasets.
NAS automates GAN architecture optimization, improving performance, stability, and efficiency.
Evolutionary algorithms and gradient-based methods outperform others in specific contexts.
This study reframes phototaxis in unicellular algae as an information-driven sensorimotor process, linking a POMDP with biochemical reaction dynamics via CRN-ODEs. Using inverse reinforcement learning on 30 Chlamydomonas trajectories, it infers behavioral objectives and shows that run–tumble alternation emerges as an information-acquisition strategy, demonstrating how intracellular biochemical networks support adaptive information-seeking behavior.
Reframes phototaxis as curiosity-driven exploration rather than mere stimulus-response.
Builds a framework connecting POMDP with biochemical reaction dynamics using CRN-ODEs.
A GPU-native population optimizer, χ-sao (Convergence-Halt-Invert-Stick-And-Oscillate), exploits a deliberate convergence-anticonvergence oscillation cycle to escape local traps while freezing confirmed modes. On all 42 functions of the Simon Fraser University optimization benchmark suite across dimensions d ∈ {2,4,8,16,32,64}, χ-sao achieves 100% mode recovery where all CPU baselines collapse at d ≥ 8 on the hardest multimodal functions, with speedups up to 34× over basin-hopping on Michalewicz d=64 and up to 39× on Rotated Hyper-Ellipsoid d=64. Under substantial likelihood noise (σ_noise up to 1.0), mode detection remains 100% reliable. The algorithm is available as an open-source Python package on PyPI.
χ-sao is the first GPU-native parallel optimizer using convergence-anticonvergence oscillation to run an entire sample batch simultaneously.
It achieves 100% mode recovery on all 42 SFU benchmark functions where CPU baselines fail for d ≥ 8 on hard multimodal functions.
Researchers propose an attention-based, physics-guided convolutional neural network as a surrogate model to predict microstructural evolution in systems governed by the Cahn-Hilliard equation. The model accurately forecasts phase separation in binary mixtures over long times, preserves composition, and aligns with the Lifshitz-Slyozov domain-growth law.
Proposes physics-guided CNN for predicting phase separation dynamics
Model remains stable and accurate over long-time rollouts
MacroLens is a new multi-task benchmark covering 4,416 U.S. small- and micro-cap equities over 2021-2026, integrating prices, accounting data, macroeconomic series, SEC filings, and news. It addresses four key assumption violations in financial time-series evaluation, includes seven tasks and 1,130 macroeconomic events, and evaluates 19 methods with a five-step feature-context ablation. The benchmark is publicly available on Hugging Face.
First public benchmark to jointly handle prices, fundamentals, macro, and text signals
Covers 4,416 U.S. small-cap stocks with 46.8M XBRL facts, 53 macro series, and 215,882 news articles
A study finds that holographic memory models fail at zero-shot compositional queries in knowledge graphs due to capacity and interference effects, not the binding algebra.
Holographic Reduced Representations (HRR) and Fourier HRR (FHRR) are competitive at single-hop retrieval but fail at zero-shot composition.
The failure is mechanistically localized: even with correct intermediate entities, composition fails because facts in compositional chains are intrinsically harder to retrieve.
A new paper proposes a Supervised Reinforcement Learning (SRL) framework for coordinating distributed energy resources (DERs). The approach pre-trains policies on demonstration data, then fine-tunes with offline and online RL, outperforming benchmarks even with low-quality data.
SRL framework combines supervised pre-training on demonstration data with reinforcement learning fine-tuning.
Two-step fine-tuning: offline for performance, then online for real-world adaptation.
Learned world models are useful only over horizons on which their rollout error remains controlled. This paper studies trust-horizon certification for latent world models with known group symmetries. Using split-conformal calibration, the authors show that exact equivariance transports the calibrated trust-horizon curve over the group orbit, making rollout errors and trust horizons orbit-constant. Experiments on 2D and 3D tasks demonstrate that equivariant models achieve safe and non-vacuous orbit-valid certificates from a single calibration sector, while non-equivariant baselines incur additional costs. The certificate is a conservative distributional audit, not a global reachability guarantee.
Proposes a trust-horizon certification method based on split-conformal prediction for equivariant latent world models.
Key theoretical result: exact equivariance transports calibrated trust-horizon curves along group orbits.
This research explores when conservation laws remain certifiable after a physical world model learns a latent representation. The authors introduce 'certified horizons' that bound how many rollout steps provably stay on an invariant's level set. Instead of certifying learned latent Hamiltonians, they certify decoded physical invariants. The framework decomposes certification budgets into representation, readout, and latent-dynamics defects, using a monotone alignment bridge for soft witnesses. Results show hard symplectic structure provides long horizons in known coordinates but fails across learned charts, while controlled-Lipschitz soft invariants survive representation learning. Pixel certification is recovered on readout-stable sub-tubes.
Introduces 'certified horizons' for guaranteeing conservation law preservation in latent world models
Certifies decoded physical invariants rather than learned latent quantities
This paper introduces a saturation index to determine when to stop collecting labeled data in binary few-shot classification. Computed from support features alone, it measures the effective rank of the within-class covariance relative to shot count. Empirical evaluation on 246 observations from 17 tasks shows strong correlation with marginal accuracy gain (median Spearman ρ=0.811) and identifies three phases: exploration, transition, saturation. As a stopping rule, it achieves AUC 0.752. A low saturation index with low accuracy indicates representational inadequacy.
The saturation index S(K) = erank(Σ_W^(K))/K falls below threshold when covariance estimator is concentrated
Correlation with marginal accuracy gain is positive in 16 of 17 tasks, median ρ=0.811
This survey reformulates industrial continual learning for LLMs as a closed-loop update-and-release problem in a versioned ecosystem. It identifies three core challenges (plasticity erosion, capability inheritance breakage, sustainability constraints) and proposes five lifecycle design principles. The paper evaluates maturity of each principle and outlines a deployment blueprint.
Industrial LLMs require continuous updates, not retraining from scratch
Survey proposes a versioned ecosystem view for continual learning
This paper proposes a lightweight neural architecture search performed directly on deployment devices for near-sensor computing, enabling adaptation to individual users in human-machine interfaces. Validated on Italian Sign Language and CWRU datasets, the method reduces RAM usage by 0.44–0.63× and improves accuracy by 0.2–5.96 percentage points on a Raspberry Pi 4.
New approach performs NAS directly on edge devices for near-sensor computing.
Adapts to individual differences by redesigning networks for each user.
A new paper presents a case study of human-AI collaboration transforming a vague research intuition into concrete mathematical discoveries, specifically sign-embedding quantum algorithms for matrix equations and functions. The AI system AIM played a key role in expanding the intuition, comparing candidate formulations, and connecting known identities, while humans retained final scientific judgments such as selecting routes, rejecting invalid approximations, and refining implementations. The authors argue that human-AI co-discovery workflows are most valuable as research partners, not standalone theorem provers.
Human intuition that rational approximation is effective for sign functions led to sign-embedding quantum algorithms, aided by AI exploration.
AIM system helped connect matrix-sign identities to broader classes of matrix equations and drafted proofs.
A new method called Degeneracy Distillery automatically detects and resolves degenerate parameter combinations using Fisher information estimation and flattening, without requiring real data observations. It reduces simulation budgets for neural posterior estimation by up to 10x.
Proposes Degeneracy Distillery to automatically detect and resolve degenerate parameter combinations from parameter-data pairs.
Uses estimation and flattening of the Fisher information matrix, exploring information geometry of the likelihood.
A deep learning approach using a multivariate time series graph neural network (MTGNN) reconstructs GRACE-like terrestrial water storage anomalies back to 1940 by learning from ERA5 meteorological data. The model achieves a basin-mean correlation of 0.94 and reproduces El Niño/La Niña events, using fewer predictors than existing methods.
MTGNN adapts from urban traffic forecasting to satellite geodesy, extending TWS records to 1940.
Hybrid adjacency matrix encodes geodesic proximity and lagged climate correlations.
Molecular surfaces encode the geometric and physicochemical patterns that determine antibody-antigen recognition, central to epitope prediction. However, existing methods rely on sequences or backbone structures and struggle to capture discontinuous, surface-driven epitopes. This study presents SurfBind, a surface-centric learning framework for epitope prediction that operates directly on molecular surface representations. SurfBind integrates geometric and physicochemical cues through a Transformer-based architecture with patch-level surface modeling, binder-aware cross-attention, and a hierarchical coarse-to-fine prediction paradigm. Experiments on challenging epitope identification benchmarks, including SAbDab and DB5.5, demonstrate that SurfBind achieves state-of-the-art performance and strong generalization across unseen antibodies and conformational states, highlighting the value of interaction-aware surface modeling for understanding the crucial mechanisms of protein-protein interactions.
Existing methods for epitope prediction rely on sequences or backbone structures and struggle with discontinuous epitopes.
SurfBind directly uses molecular surface representations with a Transformer-based architecture.
A new study conducts a uniform re-evaluation of causal direction methods on the Tuebingen dataset, introducing a parameter-free compression baseline that achieves 74.7% accuracy and ties with top methods, revealing mechanisms that inflate published figures.
Researchers re-evaluated multiple causal inference methods on the Tuebingen cause-effect pairs using identical protocol.
Introduced sorted-conditional compression baseline with no tuned parameters achieves 74.7% weighted accuracy.
This paper proposes a novel meta-learning strategy called MEDIC that considers implicit gradient matching for both inter-domain and inter-class task splits simultaneously. It addresses the imbalance issue in one-vs-all classifiers for open set domain generalization, achieving better decision boundaries and outperforming prior methods while maintaining competitive closed set generalization.
Open set domain generalization aims to recognize unseen classes in unseen domains, but naive one-vs-all classifiers suffer from sample imbalance.
MEDIC uses dualistic meta-learning with joint domain-class matching to find balanced boundaries.
A gray-box workflow, PC-MCMC-CIGP, integrates spike-and-slab topology sampling, hard conservation and thermodynamic screening, and a Chemical-Informed Gaussian Process (CIGP) residual model for extracting interpretable governing equations from sparse noisy chemical time-series data. On the H2+Br2 benchmark, it distinguishes elementary radical pathways; on styrene epoxidation, it improves yield by 12.5% over the baseline. A 10-seed acquisition study reveals trade-offs: PC-EI reduces low-yield BO suggestions, while EI-style criteria yield the strongest final performance.
PC-MCMC-CIGP combines physically constrained MCMC with chemical-informed GPs for reaction network discovery.
Successfully distinguished elementary radical pathways in H2+Br2 benchmark.
A new approach to physical neural networks places trainable nonlinear functions on connections rather than using scalar weights, achieving low-power continuous control tasks with far fewer nodes. The design, implemented on analogue arrays, shows task-dependent benefits and projects to 30 microwatts in CMOS.
Inspired by Kolmogorov-Arnold networks, trainable nonlinear functions are placed on connections.
The networks excel on smooth continuous targets like robotics and PV tracking, but not on classification boundaries.
This paper presents a comprehensive survey on federated causal discovery and inference, proposing multi-dimensional taxonomies and highlighting the integration of causal structure learning and effect estimation in a unified pipeline, addressing challenges like privacy and data heterogeneity.
The paper systematically reviews federated causal discovery (FCD) and inference (FCI) using multi-dimensional taxonomies.
It organizes FCD along three axes: methodological paradigm, federation topology, and structural scope.
This paper investigates whether six offline RL losses (SFT, RFT, DFT, RIFT, Offline GRPO, DPO) are mechanistically distinct in weight-space geometry when used for reasoning distillation. Using identical math rollouts from Qwen3-4B, they find SFT, RFT, and RIFT have nearly colinear deltas; DFT diverges; Offline GRPO adds orthogonal components; and DPO lies in a near-orthogonal subspace with highest accuracy but a mode-connectivity barrier.
SFT, RFT, and RIFT have cosine similarity >= 0.97 and comparable GSM8K accuracy (~87-88%).
DFT's update direction diverges more than any reward-weighted method.
This paper presents an automated large-scale search pipeline for heterogeneous 4-Expert Mixture-of-Experts (MoE4) architectures within the LEMUR neural network dataset ecosystem. Over 28 days on an NVIDIA RTX 4090, the pipeline generated 4,463 candidate models and evaluated 1,021. A critical coverage bias was discovered: due to alphabetical enumeration, the search space was anchored to the AirNet family. Within this scope, ShuffleNet and MobileNetV3 ensembles achieved highest average accuracy of 0.632, while FractalNet and MNASNet were identified as low-yield families.
Automated pipeline generated 4,463 MoE4 candidates and evaluated 1,021.
Coverage bias found due to alphabetical enumeration anchoring to AirNet family.
Ventricular tachycardia is a life-threatening arrhythmia. Pace-mapping identifies ablation targets. cAPM uses continual learning to transfer knowledge across VTs, reducing pacing sites needed. In silico, it achieved 81% localization accuracy with 4.5 sites vs. 38% with 13.7 for prior methods.
cAPM is a new AI system for pace-mapping that learns continuously across multiple ventricular tachycardia targets.
It uses a task-agnostic surrogate neural network and active learning to select the most informative pacing sites.