Accepted Industry and Application Papers
-
01Democratizing Tabular Data Access with an Open-Source Synthetic-Data SDK
-
02DLRover-LM: LLM Pre-Training Framework with Thousands of Accelerators in AntGroup
-
03StreamShield: A Production-Proven Resiliency Solution for Apache Flink at ByteDance
-
04Tackling Workload Forecasting Challenges with an Offline-Online Dynamic Framework
-
05OceanBase Mercury: Building a Distributed Real-time Analytical Processing Database System
-
06High-Fidelity and Complex Test Data Generation for Google SQL Code Generation Services
-
07Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance
-
08OceanBase CDC: A Log-Based Distributed CDC System for High Availability and Scalability
-
09TAT: Temporal-Aligned Transformer for Multi-Horizon Peak Demand Forecasting
-
10Hierarchical Industrial Demand Forecasting with Temporal and Uncertainty Explanations
-
11Bala-Join: An Adaptive Hash Join for Balancing Communication and Computation in Geo-Distributed SQL Databases
-
12Automatic Parameter Tuning for Compaction in LSM-Tree based Databases
-
13REVISION: Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization
-
14DBdoctor: A Fine-grained and Non-intrusive Performance Diagnosis Platform for Databases
-
15CCD–Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs
-
16REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems
-
17Graph Query Generation with Constraint-guided Large Language Agents
-
18JITPrune: An Efficient Online Feature Pruning Framework for Embedding-based DLRM Training
-
19D2SQA: An Edge–Cloud Collaborative Slow Query Analysis Framework Deployed at DBAPPSecurity
-
20GALA: Generative Aligned Learning for Adaptive Multimodal Representation in the Eleme Recommender System
-
21KScaNN: Scalable Approximate Nearest Neighbor Search on Kunpeng
-
22Efficient Data Processing using On-the-Fly Host-PIM Interactions in a Commodity PIM System
-
23RedParrot: Accelerating NL-to-DSL for Business Analytics via Query Semantic Caching
-
24CoLIBRi: Supporting quotation through multi-modal retrieval and conversational search on manufacturing drawings
-
25Building and Benchmarking Large Language Models for Machine Translation in Social Network Services
-
26FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
-
27Cascading Relevance-driven Recommendation Network for CTR Prediction in Trigger-Introduced Recommendation
-
28Accurate and Efficient Multi-channel Time Series Forecasting via Sparse Attention Mechanism
-
29From Benchmarks to Production: Transferring Time Series Anomaly Detection Methods for Electricity Production Monitoring
-
30OpenZL: Using Graphs to Compress Smaller and Faster
-
31GaV: Guess and Verification of Column Semantics
-
32User-Adaptive Meta-Learning for Cold-Start Medication Recommendation with Uncertainty Filtering
-
33On Efficient Materialization in Data Lakes
-
34Billion-scale Fintech Analytics: Scalable Data Management and Anomaly Detection at NPCI
-
35Decoupled Multimodal Fusion for User Interest Modeling in Click-Through Rate Prediction
-
36GalaxyRAG: Graph Retrieval-Augmented Generation for Enterprise Knowledge Systems
-
37OEPO: Online Experience-based Preference Optimization for CTR Prediction
-
38DM-RAG: Enhancing User Support in Dameng Databases with Retrieval-Augmented Generation
-
39Relevance Matters: A Multi-Task and Multi-Stage Large Language Model Approach for E-commerce Query Rewriting
