Papers highlighted in yellow are papers that are likely of interest but we do not currently have access to the paper.
Papers highlighted in red are taken.
Session 1: Streams
Sampling Algorithms in a Stream Operator
Ted Johnson
(AT&T Labs),
Fault-Tolerance in the Borealis Distributed Stream Processing System
Holistic Aggregates in a Networked World: Distributed Tracking of Approximate Quantiles
Graham Cormode (
Session 2: Anonymity
and Nondisclosure
Deriving
Private Information from Randomized Data
Zhengli
Huang, Wenliang Du, Biao Chen (
Incognito
- Efficient Full-Domain K-Anonymity
Kristen
LeFevre, David DeWitt, Raghu Ramakrishnan (
To Do or Not To Do - The Dilemma of Disclosing Anonymized Data
Laks Lakshmanan, Raymond Ng, Ganesh Ramesh (
Session 3: Personal
Information Spaces
Constrained Optimalities in Query Personalization
Georgia Koutrika, Yannis Ioannidis (
Reference
Reconciliation in Complex Information Spaces
Xin Luna
Dong, Alon Halevy, Jayant Madhavan (
http://www.cs.washington.edu/homes/lunadong/publication/reconciliation_sigmod.pdf
Magnet:
Supporting Navigation in Semistructured Data Environments
Vineet
Sinha, David Karger (MIT)
http://haystack.lcs.mit.edu/papers/magnet-sigmod2005.pdf
Session 4: Query
Optimization
Proactive Re-optimization
Shivnath Babu (
Towards a Robust Query Optimizer: A Principled and Practical Approach
Brian Babcock (
RankSQL: Query Algebra and Optimization for Relational Top-k Queries
Chengkai Li (Univ. of Illinois), Kevin Chang (Univ. of Illinois), Ihab Ilyas (Univ. of Waterloo), Sumin Song (Univ. of Illinois)
Session 5: Data
Cleaning and Mapping
A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification
Philip Bohannon (Bell Labs), Wenfei Fan (Univ. of Edinburgh and Bell Labs), Michael Flaster (Bell Labs), Rajeev Rastogi (Bell Labs India)
ConQuer: Efficient Management of Inconsistent Databases
Ariel Fuxman, Elham Fazli, Renee J. Miller (Univ. of Toronto)
Supporting
Executable Mappings in Model Management
Sergey
Melnik (Microsoft Research), Philip A. Bernstein (Microsoft Research), Alon
Halevy (Univ. of Washington), Erhard Rahm (Univ. of Leipzig)
http://www.cs.washington.edu/homes/alon/files/sigmod05-mm.pdf
Session 6: Query
Processing Techniques
Stacked Indexed Views in Microsoft SQL Server
David DeHaan (Univ. of Waterloo), Per-Ake Larson (Microsoft Research), Jingren Zhou (Microsoft Research)
A Nested Relational Approach to Processing SQL Subqueries
Bin Cao, Antonio Badia (Univ. of Louisville)
Stratified Computation of Skylines with Partially-Ordered Domains
Chee-Yong Chan, Pin-Kwang Eng, Kian-Lee Tan (National Univ. of Singapore)
Session 7: Adaptive,
Automatic, Autonomic Systems
AGILE - Adaptive Indexing for Context-Aware Information Filters Jens-Peter Dittrich, Peter M. Fischer, Donald Kossmann
Jens-Peter Dittrich, Peter M. Fischer, Donald Kossmann (ETH Zurich)
Automatic Physical Database Tuning - A Relaxation-based Approach
Nicolas Bruno, Surajit Chaudhuri (Microsoft Research)
Goals
and Benchmarks for Autonomic Configuration Recommenders
Mariano
Consens (
Session 8: OLAP
Privacy Preserving OLAP
Rakesh Agrawal (IBM Almaden), Ramakrishnan Srikant (IBM Almaden), Dilys Thomas (Stanford Univ.)
Efficient Computation of Multiple Group By Queries
Zhimin Chen, Vivek Narasayya (Microsoft Research)
SHIFT-SPLIT: I/O Efficient Maintenance of Wavelet-Transformed Multidimensional Data
Mehrdad Jahangiri, Dimitris Sacharidis, Cyrus Shahabi (USC)
Session 9: Stream
Aggregation
Tributaries and Deltas: Efficient and Robust Aggregation in Sensor
Network Streams
Amit Manjhi (Carnegie Mellon Univ.), Suman Nath (Carnegie
Mellon Univ.), Phillip Gibbons (Intel Research Pittsburgh)
Multiple Aggregations Over Data Streams
Rui Zhang (
Semantics and Evaluation Techniques for Window Aggregates in Data Streams
Jin Li (Portland State Univ.), David Maier (Portland State Univ.), Kristin Tufte (Portland State Univ.), Vassilis Papadimos (Portland State Univ.), Peter Tucker (Whitworth College)
Session 10: Storage,
Indexing and System Architecture
Guaranteeing
Correctness and Availability in P2P Range Indices
Prakash
Linga, Adina Crainiceanu, Johannes Gehrke, Jayavel Shanmugasundaram (
http://www.cs.cornell.edu/People/jai/papers/P2PCorrectness.pdf
Online B-tree Merging
Xiaowei Sun (Northeastern Univ.), Rui Wang (Northeastern Univ.), Betty Salzberg (Northeastern Univ.), Chendong Zou (IBM)
System
RX: One Part Relational, One Part XML
Kevin Beyer, Roberta J. Cochrane, Vanja Josifovski, Jim Kleewein, George Lapis, Guy Lohman, Bob Lyle, Fatma Ozcan, Hamid Pirahesh, Normen Seemann, Tuong Truong, Bert Van der Linden, Brian Vickery, Chun Zhang (IBM Almaden and Silicon Valley Labs)
Session 11: Streams
and Pipelined Processing
On Joining and Caching Stochastic Streams
Junyi Xie, Jun Yang, Yuguo Chen (
RPJ: Producing Fast Join Results on Streams through Rate-based Optimization
Yufei Tao (Univ. of Hong Kong), Man Lung Yiu (Univ. of Hong Kong), Dimitris Papadias (HKUST), Marios Hadjieleftheriou (UC Riverside), Nikos Mamoulis (Univ. of Hong Kong)
QPipe: A
Simultaneously Pipelined Relational Query Engine
Stavros
Harizopoulos, Vladislav Shkapenyuk, Anastassia Ailamaki (Carnegie Mellon Univ.)
http://www-2.cs.cmu.edu/~stavros/app/qpipe.pdf
Session 12: Correctness
and Trust
Fossilized
Index: The Linchpin of Trustworthy Non-Alterable Electronic Records
Qingbo Zhu
(
http://opera.cs.uiuc.edu/~qzhu1/papers/SIGMOD05.pdf
Verifying Completeness of Relational Query Results in Data Publishing
HweeHwa Pang (Inst. for Infocomm Research), Arpit Jain (IIT
Middleware based Data Replication providing Snapshot Isolation
Yi Lin (
Session 13: XML
Processing
DogmatiX
Tracks down Duplicates in XML
Melanie
Weis, Felix Naumann (Humboldt-Universitaet zu
Incremental Maintenance of Path Expression Views
Arsany Sawires, Junichi Tatemura, Oliver Po, Divyakant Agrawal, K. Selcuk Candan (NEC Labs)
On Boosting Holism in XML Twig Pattern Matching using Structural Indexing Techniques
Ting Chen, Jiaheng Lu, Tok Wang Ling (
Session 14: Spatial
and High-Dimensional Data
CURLER: Finding and Visualizing Nonlinear Correlated
Clusters
Anthony K. H. Tung, Xin Xu, Beng Chin Ooi (
A Generic Framework for Monitoring Continuous Spatial Queries over Moving Objects
Haibo Hu (HKUST), Jianliang Xu (HKBU), Dik Lee (HKUST)
Robust and Fast Similarity Search for Moving Object Trajectories
Lei Chen (
Session 15: XML
Query, Update and Search
Extending
XQuery for Analytics
Kevin Beyer (IBM Almaden), Don Chamberlin (IBM Almaden), Latha Colby (IBM Almaden), Fatma Ozcan (IBM Almaden), Hamid Pirahesh (IBM Almaden), Yu Xu (UC San Diego)
Lazy XML
Updates: Laziness as a Virtue of Update and Structural Join Efficiency
Barbara
Catania (Univ. of Genoa), Beng Chin Ooi (National Univ. of Singapore), Wenqiang
Wang (National Univ. of Singapore), Xiaoling Wang (Fudan Univ.)
http://www.comp.nus.edu.sg/~ooibc/sigmod386.pdf
Efficient
Keyword Search for Smallest LCAs in XML Databases
Yu Xu, Yannis
Papakonstantinou (UC
http://www.db.ucsd.edu:8080/root/pubsFileFolder/232.pdf
Session 16: Web
A Verifier
for Interactive, Data-Driven Web Applications
Alin
Deutsch, Monica Marcus, Liying Sui, Victor Vianu, Dayou Zhou (UC
http://opera.cs.uiuc.edu/~qzhu1/papers/SIGMOD05.pdf
Page
Quality: In Search of an Unbiased Web Ranking
Junghoo
Cho, Sourashis Roy, Robert Adams (UCLA)
http://rose.cs.ucla.edu/~cho/papers/cho-quality-long.pdf
Session 17:
Estimation and Approximation
A Disk-Based Join With Probabilistic Guarantees
Christopher Jermaine, Alin Dobra, Subramanian Arumugam,
Shantanu Joshi, Abhijit Pol (
When Can We Trust Progress Estimators for SQL Queries?
Surajit Chaudhuri, Raghav Kaushik, Ravishankar Ramamurthy (Microsoft Research)
Relational Confidence Bounds Are Easy With The Bootstrap
Abhijit Pol, Christopher Jermaine (
Session 18: Stream
and Sequence Mining
BRAID: Stream Mining through Group Lag Correlations
Yasushi Sakurai (NTT), Spiros Papadimitriou (Carnegie Mellon Univ.), Christos Faloutsos (Carnegie Mellon Univ.)
Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors
Naga Govindaraju, Nikunj Raghuvanshi, Dinesh Manocha (UNC
Mining Periodic Patterns with Gap Requirement from Sequences
Minghua Zhang, Ben Kao, David Cheung, Kevin Yip (
Session 19:
Continuous Queries
Conceptual Partitioning: An Efficient Method for Continuous Nearest Neighbor Monitoring
Kyriakos Mouratidis (HKUST), Marios Hadjieleftheriou (UC Riverside), Dimitris Papadias (HKUST)
Matthew Denny, Michael Franklin (UC Berkeley)
Update-Pattern-Aware Modeling and Processing of Continuous Queries
Lukasz Golab, M. Tamer Ozsu (
Session 20: Mining
Biological and Medical Data
Mining Top-k Covering Rule Groups for Gene Expression Data
Gao Cong (Univ. of Edinburgh), Kian-Lee Tan (National Univ. of Singapore), Anthony K. H. Tung (National Univ. of Singapore), Xin Xu (National Univ. of Singapore)
Subsequence Matching on Structured Time Series Data
Huanmei Wu (Northeastern Univ.), Betty Salzberg (Northeastern Univ.), Gregory Sharp (Harvard Medical School), Steve Jiang (Harvard Medical School), Hiroki Shirato (Hokkaido Univ.), David Kaeli (Northeastern Univ.)
TriCluster: An Effective Algorithm for Mining Coherent
Clusters in 3D Microarray Data
Lizhuang Zhao, Mohammed Zaki (RPI)
Session 21: Spatial
and Multimedia Data
Query-Sensitive Embeddings
Vassilis Athitsos (
STRG-Index: Spatio-Temporal Region Graph Indexing for Large Video Databases
JeongKyu Lee, JungHwan Oh, Sae Hwang (
Towards Effective Indexing for Very Large Video Sequence Database
Heng Tao Shen (
Session 22: Graph and
Tree-Structured Data
Cost-Sensitive Reordering of Navigational Primitives
Carl-Christian Kanne, Matthias Brantner, Guido Moerkotte (
Similarity Evaluation on Tree-structured Data
Rui Yang, Panos Kalnis, Anthony K. H. Tung (
Substructure Similarity Search in Graph Databases
Xifeng Yan (
Other:
Alon Y. Halevy (Editor), Naveen Ashish, Dina Bitton, Michael Carey, Denise Draper, Jeff Pollock, Arnon Rosenthal, Vishal Sikkay
**********************************************************************
Pods
Session 1: Querying
XML & Semistructured Data/Query Languages
XML Data
Exchange: Consistency and Query Answering - BEST PAPER
Marcelo
Arenas (U of
http://www.cs.toronto.edu/~libkin/papers/pods05.ps.gz
XPath Satisfiability in the Presence of DTDs
Michael Benedikt (
Deciding Well-Definedness of XQuery Fragments
Stijn Vansummeren (Limburgs Univsersitair Centrum)
Views
and Queries: Determinacy and Rewriting
Luc
Segoufin (INRIA, France ), Victor Vianu (UC
Session 2: Complexity
& Performance Evaluation
On the Complexity of Division and Set Joins in the Relational Algebra - BEST PAPER
Dirk Leinders (Limburgs Universitair Centrum), Jan Van den Bussche (Limburgs Universitair Centrum)
On the Complexity of Nonrecursive XQuery and Functional Query Languages on Complex Values
Christoph Koch (Technical
An Incremental Algorithm for Computing Ranked Full Disjunctions
Sara Cohen (Technion - Israel Institute of Technology),
Yehoshua Sagiv (The
Session 3: Security
& Privacy
Security
Analysis of Cryptographically Controlled Access to XML Documents - BEST
NEWCOMER PAPER
Martin
Abadi (
http://www.cs.ucsd.edu/~bogdan/pdf/xml.pdf
Simulatable Auditing
Krishnaram Kenthapadi (
Practical
Privacy: The SuLQ Framework
Avrim
Blum (Carnegie Mellon), Cynthia Dwork (Microsoft Research), Frank McSherry
(Microsoft Research), Kobbi Nissim (
Privacy-Enhancing k-Anonymization of Customer Data
Sheng Zhong (Stevens Institute of Technology), Zhiqiang Yang (Stevens Institute of Technology), Rebecca Wright (Stevens Institute of Technology)
Session 4: Data
Integration & Interoperability
Computing Cores for Data Exchange: New Algorithms and Practical Solutions
Georg Gottlob (TU Wien, Inst f. Informationssysteme)
Peer
Data Exchange
Ariel Fuxman (University of Toronto), Phokion Kolaitis (IBM Almaden Research Center), Renee J. Miller (University of Toronto), Wang-Chiew Tan (University of California at Santa Cruz)
Composition
of Mappings Given by Embedded Dependencies
Alan
Nash (
Session 5: Data
Mining / Transaction Management
Multi-Structural Databases
Ronald Fagin (IBM Almaden Research Center), R Guha (IBM), Ravi Kumar (IBM), Jasmine Novak (IBM), D Sivakumar (IBM), Andrew Tomkins (IBM)
A Divide-and-Merge Methodology for Clustering
David Cheng (MIT),
Allocating Isolation Levels to Transactions
Alan Fekete (
Session 6: Complexity
& Performance Evaluation / Data Stream Management
Buffering in Query Evaluation over XML Streams
Ziv Bar-Yossef (Technion), Marcus Fontoura (IBM Almaden), Vanja Josifovski (IBM Almaden)
Histograms Revisited: When are histograms the best approximation method for aggregates over joins?
Alin Dobra (
Lower Bounds for Sorting with Few Random Accesses to External Memory
Martin Grohe (Humboldt-Universitaet), Nicole Schweikardt (
Session 7: Data
Stream Management
Operator Placement for In-Network Stream Query Processing
Utkarsh Srivastava (
Join-Distinct Aggregate Estimation over Update Streams
Minos Garofalakis (Bell Labs), Sumit Ganguly (IIT Kanpur), Amit Kumar (IIT Delhi), Rajeev Rastogi (Bell Labs)
Space Efficient Mining of Multigraph Streams
Graham Cormode (Bell Labs), S. Muthukrishan (Rutgers University)
Session 8:
Information Processing on the Web
XML Type Checking with Macro Tree Transducers
Sebastian Maneth (Ecole Polytechnique Federal de Lausanne), Thomas Perst (Technische Universität München), Alexandru Berlea (TU München), Helmut Seidl (Technische Universität München)
Regular and Unambiguous Rewritings for Active XML
Serge Abiteboul (INRIA),
Tova Milo(
Determining Source Contribution in Information Integration Systems
Alin deutsch (UC San Diego), Yannis Katsis (UC San Diego),
Yannis Papakonstantinou (
Session 9: Databases
& Information Retrieval / Data Mining
Estimating arbitrary subset sums with few probes
Noga Alon (
FTW: Fast Similarity Search under the Time Warping Distance
Yasushi Sakurai (NTT), Masatoshi Yoshikawa (
Space Complexity of Hierarchical Heavy Hitters in Multi-Dimensional Data Streams
Nisheeth Shrivastava (University Of California, Santa Barbara), John Hershberger (Mentor Graphics Corp), Subhash Suri (University of California, Santa Barbara), Csaba Toth (Massachusetts Institute of Technology)
Session 10: Logic in
Databases
Differential Constraints
Bassem Sayrafi (
Diagnosis of Asynchronous Discrete Event Systems - Datalog to the Rescue!
Serge Abiteboul (INRIA), Zoe Abrams (
Relative Risk and Odds Ratio: A Data Mining Perspective
Haiquan Li (institute for infocomm research), Jinyan Li (institute for infocomm research), Limsoon Wong (institute for infocomm research), Mengling Feng (Nanyang Technological University), Yap Peng Tan (Nanyang Technological University)