Accepted Research Papers for ICDE 2026
-
01Querying Historical k-Dense Subgraphs On Temporal graphs
-
02FaScalSQL: A Fast and Scalable GPU-Accelerated SQL Query Engine for Out-of-Memory Tables
-
03GeminiSketch: An Accurate and Efficient Sketch for Summarizing Temporal Graph Streams with Rolling-out Elimination
-
04Fast Discovery of Functional Dependencies via Bayesian Network Learning
-
05Approximate Butterfly Counting in Sublinear Time
-
06Exqutor: Extended Query Optimizer for Vector-augmented Analytical Queries
-
07RoarChain: A Robust Sharding Blockchain System for Enterprise Consortium
-
08Towards the Distributed Large-scale k-NN Graph Construction by Graph Merge
-
09CSD-CoKV: Host-CSD Collaborative Offloading for High-Performance LSM-tree based KV Stores
-
10On Graph Rewiring with Motifs: a Find-and-Replace Approach
-
11Conflict Resolution for Improving ML Accuracy
-
12MVGPT: Generative Materialized View Forecasting
-
13Accurate Table Question Answering with Accessible LLMs
-
14AutoHFormer: Efficient Hierarchical Autoregressive Transformer for Time Series Prediction
-
15CausalPre: Scalable and Effective Data Pre-processing for Causal Fairness
-
16Evolving Sketch: Time-Decaying Frequency Estimation for Evolving Streams
-
17Contextual Pattern Mining and Counting
-
18Compressing High-Frequency Time Series Through Multiple Models and Stealing from Residuals
-
19A Unified Framework for Compressed and Encrypted Text Direct Processing
-
20Spatiotemporal Sketch Disaggregation: Streaming Analytics with Heterogeneous Resources
-
21L4G: Two-hop Label Management for Group Steiner Tree Search on Graphs
-
22Revisiting Single-Table Retrieval: An Open Problem Under 360° Stress Tests [Experiment, Analysis, and Benchmark]
-
23MICRO: A Lightweight Middleware for Optimizing Cross-store Cross-model Graph-Relation Joins
-
24SwitchDelta: Asynchronous Metadata Updating for Distributed Storage with In-Network Data Visibility
-
25DIndex: An Efficient On-Disk Learned Index for Memory-Constrained Environments
-
26Federated Retrieval over Embedding-Heterogeneous Vector Databases
-
27LogDelta: Differential Encoding for Log Data
-
28Secure Query Processing with Linear Online Cost
-
29VisPoison: An Effective Backdoor Attack Framework for Tabular Data Visualization Models
-
30PRO-HNSW: Proactive Repair and Optimization for High-Performance Dynamic HNSW Indexes
-
31Promoting Fairness in Information Access within Social Networks
-
32Keyword-Aware Skyline Community Search on Semantics and Structure
-
33BS-tree: A gapped data-parallel B-tree
-
34Zero-Knowledge Verifiable Graph Query Evaluation via Expansion-Centric Operator Decomposition
-
35cuFHEDB: GPU-Accelerated Fully Homomorphic Encryption Database
-
36Contemp: Instance Caching Based on Container Temperature in Serverless Environment
-
37TopFGL: A Topology-Aware and Distribution-Agnostic Federated Learning Framework Tackling Topological Heterogeneity on Graph Data
-
38OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
-
39Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation
-
40FLASH Viterbi: Fast and Adaptive Viterbi Decoding for Modern Data Systems
-
41Sorting Compressed Time Series
-
42Reverse k Nearest Neighbor Query in Large Road Networks: A Tree Decomposition based Approach
-
43Benchmarking RL-Enhanced Spatial Indices Against Traditional, Advanced, and Learned Counterparts [Experiment, Analysis, and Benchmark]
-
44PROGQL: A Provenance Graph Query System for Cyber Attack Investigation
-
45Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint
-
46Trading Vector Data in Vector Databases
-
47Updatable Balanced Index for Fast On-device Search with Auto-selection Model
-
48Banknote-Chain: Achieving User-Incentivized Parallelism in Blockchain via a Banknote-Inspired Transaction Model
-
49Efficient Zero-shot and Label-free Log Anomaly Detection for Resource-constrained Systems
-
50Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision
-
51Mitigating Dual Load Imbalance via Dynamic Cooperative Scheduling in Distributed Key-Value Stores
-
52MOCHI: Motif-based Community Search over Large Heterogeneous Information Networks
-
53Reconfiguring Scalable Hashing with Persistent CPU Caches
-
54TemplateQO: Template-aware and Scalable Query Optimization with Data-efficient Learning
-
55Query-Driven Data Exploration with Heterogeneous Treatment Effects
-
56SOLAR: Scalable Distributed Spatial Joins through Learning-based Optimization
-
57Interpreting Graph Inference with Skyline Explanations
-
58Chase Anonymisation: Privacy-Preserving Knowledge Graphs with Logical Reasoning
-
59SSC-Join: an Efficient Syntactic-Semantic Collaboration based Set Semantic Similarity Join Algorithm
-
60Deferred Flushing for Out-of-Order Arrivals in Apache IoTDB
-
61Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution
-
62SaSPartitioner: A Self-adaptive Streaming Partitioner using Deep Reinforcement Learning
-
63AdaFedRec: Adaptive Heterogeneous Federated Recommender Systems across Multi-Device Users
-
64MatKV: Trading Compute for Flash Storage in LLM Inference
-
65LUCID: an Updatable and Concurrent Learned Index for Larger-than-Memory Data Management
-
66CoLSE: A Lightweight and Robust Hybrid Learned Model for Single-Table Cardinality Estimation using Joint CDF
-
67An Encode-then-Decompose Approach to Unsupervised Time Series Anomaly Detection on Contaminated Training Data
-
68GRACE: Alleviating Reconstruction Cost in Dynamic Graph Processing Systems
-
69High-Fidelity Task Assignment in Spatial Crowdsourcing via Implicit Human Feedback
-
70LAMP: A Dual-Mode Framework for Database Workload Memory Prediction
-
71Efficient Model-Agnostic Continual Learning for Next POI Recommendation
-
72GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph
-
73Robust Index Benefit Estimation via Hierarchical and Two-dimensional Feature Representation
-
74MTC: Scalable Transaction Commit for Multi-Master Cloud Databases
-
75EC-RAG: Towards Efficient Edge-Cloud Retrieval-Augmented Generation Systems
-
76TabLoft: Tabular Data Generation Based on LLM with Ordered Features
-
77CYANSQL: Unlock the Power of NL2SQL via Clustering-based Test-Time Scaling
-
78Effective Fairest Community Search over Heterogeneous Information Networks
-
79IIT-Tree: An efficient index to support interval-based query on large temporal graphs
-
80PORCA: Root Cause Analysis with Partially Observed Data
-
81Efficient and Scalable Search for Statistics
-
82Reconstructing TensorLog for Scalable End-to-end Rule Learning
-
83HL-index: Fast Reachability Query in Hypergraphs
-
84Astraea: Efficient Pipelined Micro-batch Stream Processing with Non-hash Differentiated Partitioning
-
85Trajectory–User Linking via Heterogeneous Preference Graph and Dual-Encoder Mutual Distillation
-
86PC-PS: A Multi-Dimensional Point-Cloud Data Publish/Subscribe System
-
87TransLGX: A Self-contained Model to Predict the Entire Lifecycle and complete state of Logistics Package Trajectories
-
88Community-level Personalized Recommendation by Exploiting Evolving User-Item Micro-clusters
-
89SHMemora: Protective Key-Value Store on Distributed Shared Memory
-
90More Than Pivot for Maximal Clique Enumeration
-
91ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data
-
92Data Guard: A Fine-grained Purpose-based Access Control System for Large Data Warehouses
-
93Fine-grained Manipulation Attacks to Local Differential Privacy Protocols for Range Query
-
94RL-Paxos: Relieving the Leader's Burden with Efficient Task Offloading in Distributed Consensus
-
95GPU-Accelerated OLTP: An In-Depth Analysis of Concurrency Control Schemes [Experiment, Analysis, and Benchmark]
-
96Semantic Publish/Subscribe over Evolving Topics
-
97FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
-
98Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction
-
99Beyond Traditional Diagnostics: Transforming Patient-Side Information into Predictive Insights with Knowledge Graphs and Prototypes
-
100SSFusion: Tensor Fusion with Selective Sparsification for Efficient Distributed DNN Training
-
101MojoFrame: Dataframe Library in Mojo Language
-
102Text2VectorSQL: Towards a Unified Interface for Vector Search and SQL Queries
-
103AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMs
-
104Rethinking Flexible Graph Similarity Computation: One-step Alignment with Global Guidance
-
105Beyond Homophily: Community Search on Heterophilic Graphs
-
106Efficient Cloud-edge Collaborative Approaches to SPARQL Queries over Large RDF graphs
-
107Generalizable Address-aware Semantic Prefetching for Scalable Transactional and Analytical Workloads
-
108AlignSketch: A Framework for Aligning Theoretical and Practical Estimation Errors
-
109FedCurrMM: A Federated Map Matching Framework with Curriculum-aware Client Selection
-
110Enabling Homomorphic Analytical Operations on Compressed Scientific Data with Multi-stage Decompression
-
111DNA: A Distribution-and-Aggregation Solution for Spatiotemporal K-function-based Analysis
-
112Chubby: Robust Smart Contract Execution Against Dependency Over-declaration
-
113PLAN: Fast and Approximate Gaussian Kernel Density Visualization in Road Networks
-
114MINOR: Multivariate Time Series Iterative Cleaning Algorithm
-
115From Single to Multiple Attributes: Experimental Insights on Sampling-Based Distinct Combination Estimation in GROUP-BY Queries [Experiment, Analysis, and Benchmark]
-
116Explaining GNN Negatives Globally and Locally
-
117Scaling Subsequence Similarity Join Based on Dynamic Time Warping
-
118Overcoming the Sync-Compute Dilemma in Parallel Graph-Based Vector Retrieval
-
119\textsf{PROCore}: Robust Core-set Selection via Pareto Multi-dimensional Optimization from Noisy Data
-
120NebulaStream: An Adaptive and Efficient Multi-query Stream Processing Engine
-
121An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs
-
122Tetris: Lightweight Hyperparameter Auto-Tuning for Mitigating Performance Spikes in LSM-KVS
-
123An End-to-End Re-Evaluation of Table Entity-Linking Systems
-
124Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
-
125COLE+ Towards Practical Column-based Learned Storage for Blockchain Systems
-
126Efficient Meta-path Constrained Reachability Query on Heterogeneous Information Networks
-
127Hexgen-Flow: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL
-
128Krone: Hierarchical and Modular Log Anomaly Detection
-
129MINT: Multi-Vector Search Index Tuning
-
130LLM4Hint: Leveraging Large Language Models for Hint Recommendation in Offline Query Optimization
-
131CFDGraph: Privacy-Preserving Graph Processing for Large-Scale Collaborative Fraud Detection
-
132SNI-GNN: SmartNIC-Assisted Full-Graph GNN Training with In-Network Embedding Prediction
-
133A Robust and Globally Accurate Hierarchical Hub Labeling Index for SP-Distance Queries in Dynamic Road Networks
-
134TRADER: Real-Time Arbitrage Detection via Negative Cycles on Dynamic Graphs
-
135Decomposition-Driven Multi-Table Retrieval and Reasoning for Numerical Question Answering
-
136HaS: Accelerating RAG through Homology-Aware Speculative Retrieval
-
137SaCal: An Efficient Saliency-Guided Causal Framework for Interpretable Healthcare Analytics
-
138BOND: A Co-Designed Framework for LLM-Powered Analytics Over Relational Data
-
139Efficient Top-k Nearest Neighbors Search in Dynamic Road Networks
-
140An Efficient and Scalable Approach for Path Queries on Public Transportation Networks
-
141Truth ≠ Frequency: Leveraging Dependencies for Subset Repair
-
142iKSP: A Path Enumeration Index in Road Networks
-
143Novel Table Search
-
144Query-Guided Analysis and Mitigation of Data Verification Errors
-
145ImmortalChopper: Real-Time and Resilient Distributed Transactions in the Edge-Cloud
-
146SpendableStore: A UTXO-based Decentralized Data Store
-
147DIFFCOM: Conditional Discrete Diffusion Model for Community Search
-
148Geco: A Confidentiality-Preserving and High-Performance Permissioned Blockchain Framework for General Smart Contracts
-
149Distance Comparison Operations Are Not Silver Bullets in Vector Similarity Search: A Benchmark Study on Their Merits and Limits [Experiment, Analysis, and Benchmark]
-
150Efficient Community Search on Attributed Public-Private Graphs
-
151Batcher: Learning to Construct Cost-Efficient Batches of Small Queries in Big Data Processing Platforms
-
152Improving GPU Tensor Query Processing for Resource-Constrained Environments
-
153UTune: Towards Uncertainty-Aware Online Index Tuning
-
154RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms
-
155CARROT: A Learned Cost-Constrained Retrieval Optimization System for RAG
-
156SLGParser: Practical and Efficient Label-Free Log Parsing Using Large Language Models
-
157Unifying Graph Traversals and Time Series Joins in Hybrid Graphs
-
158RFOD: Random Forest-based Outlier Detection for Mixed-Type Tabular Data
-
159CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation
-
160CactusDB: Unlock Co-Optimization Opportunities for SQL Queries and AI/ML Model Inferences
-
161Time-varying Vector Field Compression with Preserved Critical Point Trajectories
-
162DistVec: Efficient Distributed Machine Learning in Parallel Database Systems
-
163Geography-Aware Large Language Model for Next POI Recommendation
-
164Label-Constrained Column Annotation with Language Models and Graph Neural Networks
-
165When Complex Event Recognition Meets Cloud-Native Architectures
-
166RISK: Efficiently processing rich spatial-keyword queries on encrypted geo-textual data
-
167ABC: Numerical Data Collection under Local Differential Privacy without Prior Knowledge
-
168REMON: Remote External Memory Over the Network
-
169Incremental GNN Embedding Computation on Streaming Graphs
-
170Lightweight 2-Hop Labels for Reachability Queries on Large-Scale Graphs
-
171WikiDBGraph: A Data Management Benchmark Suite for Collaborative Learning over Database Silos [Experiment, Analysis, and Benchmark]
-
172Clue-RAG: Towards Accurate and Cost-Efficient Graph-based RAG via Multi-Partite Graph-based Index
-
173Vireo: Human-in-the-Loop DBMS Fuzzing with Visualization and LLM Support
-
174LEAF-SQL: Level-wise Exploration with Adaptive Fine-graining for Text-to-SQL Skeleton Prediction
-
175An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data
-
176Boosting Small Language Models for Text-to-SQL with Fine-Grained Execution Feedback and Cost-Efficient Rewards
-
177PAT: Towards Transaction Routing with Page Affinity in Shared-Cache Databases
-
178Density Decomposition of Multilayer Graphs
-
179AOEH: An Efficient Extendable Hashing to Reduce Read/Write Amplification for Persistent Memory
-
180Nezha: A Key-Value Separated Distributed Store with Optimized Raft Integration
-
181Process Faster, Pay Less: Functional Isolation for Stream Processing
-
182SQAC: Scalable Querying of Attribute-Constrained (α, β)-Cores over Large Bipartite Graphs
-
183Listing Minimal Cores in Large Real-World Graphs
-
184Time-Frequency Conditioned Diffusion for Multivariate Time Series Imputation
-
185Low-Latency Stateful Stream Processing through Timely and Accurate Prefetching
-
186HYDRA: Breaking the Global Ordering Barrier in Multi-BFT Consensus
-
187Query-Driven LSM Compactions
-
188F5: A Robust SIMD-Accelerated MSD Radix Sort
-
189SQLMorph: Query Mutation and Fine-Grained Metrics for Text-to-SQL Evaluation [Experiment, Analysis, and Benchmark]
-
190Telescope: A Learned What-If Call for Column Store Selection in HTAP Databases
-
191TORepair: Diffusion-based Task-Oriented Error Repair via Differentiable Bi-Level Optimization
-
192SPARQ: A Cost-Efficient Framework for Offline Table Question Answering via Adaptive Routing
-
193BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs [Experiment, Analysis, and Benchmark]
-
194XRAG: eXamining the Core - Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation [Experiment, Analysis, and Benchmark]
-
195A-Scan: Efficient Scale-up Analytics via Throughput-Guided Data Movement
-
196C²TC: A Training-Free Framework for Efficient Tabular Data Condensation
-
197ShareFlow: An Efficient Framework for Multi-Query Continuous Subgraph Matching
-
198L³C: Leaf-Centric Continuous Codes for Natural Language-Driven Table Discovery
-
199APEX: Adaptive Variable-wise Parallel Execution for Worst-Case Optimal Joins on Graph Queries
-
200HistCore: Efficient k-Core Decomposition on GPUs with Locality-Aware Computation
-
201SINDI: An Efficient Index for Sparse Vector Approximate Maximum Inner Product Search
-
202Information Leakage from Prices in Query-based Data Markets
-
203Maximum Balanced Clique Search on Large Directed Graphs
-
204Online Multi-Modal Spatio-Temporal Prediction: A Reinforcement Learning and Dynamic Contrastive Framework
-
205Mirror Asymmetry Perfect Hashing: A Memory-Efficient and Load-Intensive-Optimized Hashing Index on Hybrid DRAM-PMem Architecture
-
206Elena: An Explainability-aided Online Query Optimization Framework
-
207PRIME: Efficient Algorithm for Token Graph Routing Problem
-
208Fast and Accurate Element-Level Streaming CP Decomposition for Higher-Order Tensors
-
209Answering Federated Range Queries with Local Differential Privacy
-
210GoCache: Accelerating Out-of-Core Graph Queries with Pattern-Driven Caching
-
211TS3D: A Temporal Multimodal Dataset for Distributed Database System Analysis [Experiment, Analysis, and Benchmark]
-
212SkyNet: Solving Skyline Queries with Neural Networks
-
213LLMIA: An Out-of-the-Box Index Advisor via In-Context Learning with LLMs
-
214Approximate Diverse k-nearest Neighbor Search in Vector Database
-
215Balancing Competition for Fairness-aware Task Assignment in Spatial Crowdsourcing
-
216RaSE-KGC: A Relation-Aware Segment Encoding Approach for Knowledge Graph Completion
-
217A Set-Theoretic Approach to Detecting Logic Bugs in DBMS Inner Join Optimizations
-
218C2graph: A Compression-Collaboration Algorithm for CPU-GPU Hybrid Weighted Graph Traversals
-
219Accelerating Metadata Management of DFS via Speculative Permission Checking
-
220Prompt-Guided Community Search under Extreme Few-Shot Supervision
-
221Robust Single-message Shuffle Differential Privacy Protocol for Accurate Distribution Estimation
-
222City-wide Origin-destination Matrix Generation via Cascaded Graph Denoising Diffusion
-
223Beyond Imputation: A Semantic Unification Framework for Data and Its Missingness in Multimodal Healthcare Analytics
-
224VisiFold: Long-Term Traffic Forecasting via Temporal Folding Graph and Node Visibility
-
225Resystance: Unleashing Hidden Performance of Compaction in LSM-trees via eBPF
-
226EDDI: Explainable Data Drift Monitoring using Influence
-
227Subtree Mode and Applications
-
228HCT-QA: A Benchmark for Question Answering on Human-Centric Tables [Experiment, Analysis, and Benchmark]
-
229MM2SQL: A Benchmark and Method for Visually-Grounded SQL Generation
-
230Doux: Decoupling Values from Keys for Real-Time Analytics
-
231Analysis of Candidate Keys in Relational Databases
-
232Lequa: A Learning-Based Query-Aware Framework for Selective Query Optimization
-
233Robust Spatial-Temporal Similar Trajectory Search via Structure-Enhanced Domain-Invariant Learning
-
234LLMSQLMUTATOR: LLM-Powered Test Case Generation for Database Using Bug Reports
-
235Toward scalable Tucker decomposition: skew-aware multi-level partitioning with GPU–storage co-processing
-
236Mitigating GenAI-powered Evidence Pollution for Out-of-Context Misinformation Detection
-
237Efficient Query Rewrite Rule Discovery via Standardized Enumeration and Learning-to-Rank
-
238Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL
-
239Efficient Size Constraint Community Search over Heterogeneous Information Networks
-
240SOLAR: Efficient Spatial Queries on Real-time LSM-based Storage
-
241SQLVec: SQL-Based Vector Similarity Search
-
242Data-Segmentation Prompt based Continual Learning Framework for Online Spatio-Temporal Prediction
-
243Efficient Graph Matching with Pattern Reduction
-
244EDITOR: Multi-Resolution Cleaning of Multivariate Time Series via Detect-Localize-Repair
-
245GLIDE: GPU-Accelerated ANN Graph Index Construction via Data Locality
-
246Fast k-means via Data-Aware Grouping and Gap-Optimized Lower Bound
-
247BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search
-
248Representative Functional Dependencies
-
249Unveiling Semantically Cohesive Structures: Maximal Meta-Path Clique Enumeration in Heterogeneous Graphs
-
250ARCADE: A Real-Time Data System for Hybrid and Continuous Query Processing across Diverse Data Modalities
-
251Fast Content-Aware Influence Maximization Query Answering by labeling Index
-
252Systematic Evaluation of Plan-based Adaptive Query Processing [Experiment, Analysis, and Benchmark]
-
253One Size Does NOT Fit All: On the Importance of Physical Representations for Datalog Evaluation [Experiment, Analysis, and Benchmark]
-
254Semantic Compression for Sound and Complete Query Answering over Knowledge Graphs
-
255FlashEKGR: Fast Embedding-Based Knowledge Graph Reasoning Models Training
-
256OMNIA: Closing the Loop by Leveraging LLMs for Knowledge Graphs Completion
-
257Revisiting Locally Differentially Private Protocols: Towards Better Trade-offs in Privacy, Utility, and Attack Resistance [Experiment, Analysis, and Benchmark]
-
258MISFEAT: Feature Selection for Subgroups with Mutual Information Estimation
-
259Improving Data Imputation through a Tuned Strategy for Dependency Discovery
-
260QPAD: Quantile-Preserving Approximate Dimension Reduction for Nearest Neighbors Preservation in High-Dimensional Vector Search
-
261ZTab: Domain-based Zero-shot Annotation for Table Columns
