Papers highlighted in green are papers that are likely of interest and we have access to the paper.

Papers highlighted in yellow are papers that are likely of interest but we do not currently have access to the paper.

Papers highlighted in red are taken.

SIGMOD

 

Session 1: Streams

 

Sampling Algorithms in a Stream Operator

Ted Johnson (AT&T Labs), S. Muthukrishnan (Rutgers Univ.), Irina Rozenbaum (Rutgers Univ.)

 

Fault-Tolerance in the Borealis Distributed Stream Processing System

Magdalena Balazinska, Hari Balakrishnan, Sam Madden, Michael Stonebraker (MIT)

 

Holistic Aggregates in a Networked World: Distributed Tracking of Approximate Quantiles

Graham Cormode (Bell Labs), Minos Garofalakis (Bell Labs), S. Muthukrishnan (Rutgers Univ.), Rajeev Rastogi (Bell Labs)

 

Session 2: Anonymity and Nondisclosure

Deriving Private Information from Randomized Data

Zhengli Huang, Wenliang Du, Biao Chen (Syracuse Univ.)

 

Incognito - Efficient Full-Domain K-Anonymity

Kristen LeFevre, David DeWitt, Raghu Ramakrishnan (Univ. of Wisconsin)

 

To Do or Not To Do - The Dilemma of Disclosing Anonymized Data

Laks Lakshmanan, Raymond Ng, Ganesh Ramesh (Univ. of British Columbia)

 

Session 3: Personal Information Spaces

Constrained Optimalities in Query Personalization

Georgia Koutrika, Yannis Ioannidis (Univ. of Athens)

 

Reference Reconciliation in Complex Information Spaces

Xin Luna Dong, Alon Halevy, Jayant Madhavan (Univ. of Washington)

http://www.cs.washington.edu/homes/lunadong/publication/reconciliation_sigmod.pdf

 

Magnet: Supporting Navigation in Semistructured Data Environments

Vineet Sinha, David Karger (MIT)

http://haystack.lcs.mit.edu/papers/magnet-sigmod2005.pdf

 

Session 4: Query Optimization

Proactive Re-optimization

Shivnath Babu (Stanford Univ.), Pedro Bizarro (Univ. of Wisconsin), David DeWitt (Univ. of Wisconsin)

 

Towards a Robust Query Optimizer: A Principled and Practical Approach

Brian Babcock (Stanford Univ.), Surajit Chaudhuri (Microsoft Research)

 

RankSQL: Query Algebra and Optimization for Relational Top-k Queries

Chengkai Li (Univ. of Illinois), Kevin Chang (Univ. of Illinois), Ihab Ilyas (Univ. of Waterloo), Sumin Song (Univ. of Illinois)

 

Session 5: Data Cleaning and Mapping

A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification

Philip Bohannon (Bell Labs), Wenfei Fan (Univ. of Edinburgh and Bell Labs), Michael Flaster (Bell Labs), Rajeev Rastogi (Bell Labs India)

 

ConQuer: Efficient Management of Inconsistent Databases

Ariel Fuxman, Elham Fazli, Renee J. Miller (Univ. of Toronto)

 

Supporting Executable Mappings in Model Management

Sergey Melnik (Microsoft Research), Philip A. Bernstein (Microsoft Research), Alon Halevy (Univ. of Washington), Erhard Rahm (Univ. of Leipzig)

http://www.cs.washington.edu/homes/alon/files/sigmod05-mm.pdf

 

Session 6: Query Processing Techniques

Stacked Indexed Views in Microsoft SQL Server

David DeHaan (Univ. of Waterloo), Per-Ake Larson (Microsoft Research), Jingren Zhou (Microsoft Research)

 

A Nested Relational Approach to Processing SQL Subqueries

Bin Cao, Antonio Badia (Univ. of Louisville)

 

Stratified Computation of Skylines with Partially-Ordered Domains

Chee-Yong Chan, Pin-Kwang Eng, Kian-Lee Tan (National Univ. of Singapore)

 

Session 7: Adaptive, Automatic, Autonomic Systems

AGILE - Adaptive Indexing for Context-Aware Information Filters Jens-Peter Dittrich, Peter M. Fischer, Donald Kossmann

Jens-Peter Dittrich, Peter M. Fischer, Donald Kossmann (ETH Zurich)

 

Automatic Physical Database Tuning - A Relaxation-based Approach

Nicolas Bruno, Surajit Chaudhuri (Microsoft Research)

 

Goals and Benchmarks for Autonomic Configuration Recommenders

Mariano Consens (Univ. of Toronto), Denilson Barbosa (Univ. of Toronto), Adrian M. Teisanu (Univ. of Toronto), Laurent Mignet (IBM India Research Lab)

 

Session 8: OLAP

Privacy Preserving OLAP

Rakesh Agrawal (IBM Almaden), Ramakrishnan Srikant (IBM Almaden), Dilys Thomas (Stanford Univ.)

 

Efficient Computation of Multiple Group By Queries

Zhimin Chen, Vivek Narasayya (Microsoft Research)

 

SHIFT-SPLIT: I/O Efficient Maintenance of Wavelet-Transformed Multidimensional Data

Mehrdad Jahangiri, Dimitris Sacharidis, Cyrus Shahabi (USC)

 

Session 9: Stream Aggregation

Tributaries and Deltas: Efficient and Robust Aggregation in Sensor Network Streams

Amit Manjhi (Carnegie Mellon Univ.), Suman Nath (Carnegie Mellon Univ.), Phillip Gibbons (Intel Research Pittsburgh)

 

Multiple Aggregations Over Data Streams

Rui Zhang (National Univ. of Singapore), Nick Koudas (Univ. of Toronto), Beng Chin Ooi (National Univ. of Singapore), Divesh Srivastava (AT&T Labs),

 

Semantics and Evaluation Techniques for Window Aggregates in Data Streams

Jin Li (Portland State Univ.), David Maier (Portland State Univ.), Kristin Tufte (Portland State Univ.), Vassilis Papadimos (Portland State Univ.), Peter Tucker (Whitworth College)

 

Session 10: Storage, Indexing and System Architecture

Guaranteeing Correctness and Availability in P2P Range Indices

Prakash Linga, Adina Crainiceanu, Johannes Gehrke, Jayavel Shanmugasundaram (Cornell Univ.)

http://www.cs.cornell.edu/People/jai/papers/P2PCorrectness.pdf

 

Online B-tree Merging

Xiaowei Sun (Northeastern Univ.), Rui Wang (Northeastern Univ.), Betty Salzberg (Northeastern Univ.), Chendong Zou (IBM)

 

System RX: One Part Relational, One Part XML

Kevin Beyer, Roberta J. Cochrane, Vanja Josifovski, Jim Kleewein, George Lapis, Guy Lohman, Bob Lyle, Fatma Ozcan, Hamid Pirahesh, Normen Seemann, Tuong Truong, Bert Van der Linden, Brian Vickery, Chun Zhang (IBM Almaden and Silicon Valley Labs)

 

 

Session 11: Streams and Pipelined Processing

On Joining and Caching Stochastic Streams

Junyi Xie, Jun Yang, Yuguo Chen (Duke Univ.)

 

RPJ: Producing Fast Join Results on Streams through Rate-based Optimization

Yufei Tao (Univ. of Hong Kong), Man Lung Yiu (Univ. of Hong Kong), Dimitris Papadias (HKUST), Marios Hadjieleftheriou (UC Riverside), Nikos Mamoulis (Univ. of Hong Kong)

 

QPipe: A Simultaneously Pipelined Relational Query Engine

Stavros Harizopoulos, Vladislav Shkapenyuk, Anastassia Ailamaki (Carnegie Mellon Univ.)

http://www-2.cs.cmu.edu/~stavros/app/qpipe.pdf

 

Session 12: Correctness and Trust

Fossilized Index: The Linchpin of Trustworthy Non-Alterable Electronic Records

Qingbo Zhu (Univ. of Illinois), Windsor Hsu (IBM Almaden)

http://opera.cs.uiuc.edu/~qzhu1/papers/SIGMOD05.pdf

 

Verifying Completeness of Relational Query Results in Data Publishing

HweeHwa Pang (Inst. for Infocomm Research), Arpit Jain (IIT Bombay), Krithi Ramamritham (IIT Bombay), Kian-Lee Tan (National Univ. of Singapore)

 

Middleware based Data Replication providing Snapshot Isolation

Yi Lin (Mcgill Univ.), Bettina Kemme (McGill Univ.), Marta Patino-Martinez (Univ. Politecnica de Madrid), Ricardo Jimenez-Peris (Univ. Politecnica de Madrid)

 

Session 13: XML Processing

DogmatiX Tracks down Duplicates in XML

Melanie Weis, Felix Naumann (Humboldt-Universitaet zu Berlin)

 

Incremental Maintenance of Path Expression Views

Arsany Sawires, Junichi Tatemura, Oliver Po, Divyakant Agrawal, K. Selcuk Candan (NEC Labs)

 

On Boosting Holism in XML Twig Pattern Matching using Structural Indexing Techniques

Ting Chen, Jiaheng Lu, Tok Wang Ling (National Univ. of Singapore)

 

Session 14: Spatial and High-Dimensional Data

CURLER: Finding and Visualizing Nonlinear Correlated Clusters

Anthony K. H. Tung, Xin Xu, Beng Chin Ooi (National Univ. of Singapore)

 

A Generic Framework for Monitoring Continuous Spatial Queries over Moving Objects

Haibo Hu (HKUST), Jianliang Xu (HKBU), Dik Lee (HKUST)

 

Robust and Fast Similarity Search for Moving Object Trajectories

Lei Chen (Univ. of Waterloo), Tamer Ozsu (Univ. of Waterloo), Vincent Oria (NJ Inst. of Technology)

 

Session 15: XML Query, Update and Search

Extending XQuery for Analytics

Kevin Beyer (IBM Almaden), Don Chamberlin (IBM Almaden), Latha Colby (IBM Almaden), Fatma Ozcan (IBM Almaden), Hamid Pirahesh (IBM Almaden), Yu Xu (UC San Diego)

 

Lazy XML Updates: Laziness as a Virtue of Update and Structural Join Efficiency

Barbara Catania (Univ. of Genoa), Beng Chin Ooi (National Univ. of Singapore), Wenqiang Wang (National Univ. of Singapore), Xiaoling Wang (Fudan Univ.)

http://www.comp.nus.edu.sg/~ooibc/sigmod386.pdf

 

Efficient Keyword Search for Smallest LCAs in XML Databases

Yu Xu, Yannis Papakonstantinou (UC San Diego)

http://www.db.ucsd.edu:8080/root/pubsFileFolder/232.pdf

 

Session 16: Web

A Verifier for Interactive, Data-Driven Web Applications

Alin Deutsch, Monica Marcus, Liying Sui, Victor Vianu, Dayou Zhou (UC San Diego)

http://opera.cs.uiuc.edu/~qzhu1/papers/SIGMOD05.pdf

 

Page Quality: In Search of an Unbiased Web Ranking

Junghoo Cho, Sourashis Roy, Robert Adams (UCLA)

http://rose.cs.ucla.edu/~cho/papers/cho-quality-long.pdf

 

Session 17: Estimation and Approximation

A Disk-Based Join With Probabilistic Guarantees

Christopher Jermaine, Alin Dobra, Subramanian Arumugam, Shantanu Joshi, Abhijit Pol (Univ. of Florida)

 

When Can We Trust Progress Estimators for SQL Queries?

Surajit Chaudhuri, Raghav Kaushik, Ravishankar Ramamurthy (Microsoft Research)

 

Relational Confidence Bounds Are Easy With The Bootstrap

Abhijit Pol, Christopher Jermaine (Univ. of Florida)

 

Session 18: Stream and Sequence Mining

BRAID: Stream Mining through Group Lag Correlations

Yasushi Sakurai (NTT), Spiros Papadimitriou (Carnegie Mellon Univ.), Christos Faloutsos (Carnegie Mellon Univ.)

 

Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors

Naga Govindaraju, Nikunj Raghuvanshi, Dinesh Manocha (UNC Chapel Hill)

 

Mining Periodic Patterns with Gap Requirement from Sequences

Minghua Zhang, Ben Kao, David Cheung, Kevin Yip (Univ. of Hong Kong)

 

Session 19: Continuous Queries

Conceptual Partitioning: An Efficient Method for Continuous Nearest Neighbor Monitoring

Kyriakos Mouratidis (HKUST), Marios Hadjieleftheriou (UC Riverside), Dimitris Papadias (HKUST)

 

Predicate Result Range Caching for Continuous Queries

Matthew Denny, Michael Franklin (UC Berkeley)

 

Update-Pattern-Aware Modeling and Processing of Continuous Queries

Lukasz Golab, M. Tamer Ozsu (Univ. of Waterloo)

 

Session 20: Mining Biological and Medical Data

Mining Top-k Covering Rule Groups for Gene Expression Data

Gao Cong (Univ. of Edinburgh), Kian-Lee Tan (National Univ. of Singapore), Anthony K. H. Tung (National Univ. of Singapore), Xin Xu (National Univ. of Singapore)

 

Subsequence Matching on Structured Time Series Data

Huanmei Wu (Northeastern Univ.), Betty Salzberg (Northeastern Univ.), Gregory Sharp (Harvard Medical School), Steve Jiang (Harvard Medical School), Hiroki Shirato (Hokkaido Univ.), David Kaeli (Northeastern Univ.)

 

TriCluster: An Effective Algorithm for Mining Coherent Clusters in 3D Microarray Data

Lizhuang Zhao, Mohammed Zaki (RPI)

 

Session 21: Spatial and Multimedia Data

Query-Sensitive Embeddings

Vassilis Athitsos (Boston Univ.), Marios Hadjieleftheriou (UC Riverside), George Kollios (Boston Univ.), Stan Sclaroff (Boston Univ.)

 

STRG-Index: Spatio-Temporal Region Graph Indexing for Large Video Databases

JeongKyu Lee, JungHwan Oh, Sae Hwang (Univ. of Texas at Arlington)

 

Towards Effective Indexing for Very Large Video Sequence Database

Heng Tao Shen (Univ. of Queensland), Beng Chin Ooi (National Univ. of Singapore), Xiaofang Zhou (Univ. of Queensland)

 

Session 22: Graph and Tree-Structured Data

Cost-Sensitive Reordering of Navigational Primitives

Carl-Christian Kanne, Matthias Brantner, Guido Moerkotte (Univ. of Mannheim)

 

Similarity Evaluation on Tree-structured Data

Rui Yang, Panos Kalnis, Anthony K. H. Tung (National Univ. of Singapore)

 

Substructure Similarity Search in Graph Databases

Xifeng Yan (Univ. of Illinois), Philip Yu (IBM T.J. Watson), Jiawei Han (Univ. of Illinois)

Other:

Enterprise Information Integration: Successes, Challenges and Controversies

Alon Y. Halevy (Editor), Naveen Ashish, Dina Bitton, Michael Carey, Denise Draper, Jeff Pollock, Arnon Rosenthal, Vishal Sikkay

**********************************************************************

 

Pods

 

Session 1: Querying XML & Semistructured Data/Query Languages

XML Data Exchange: Consistency and Query Answering - BEST PAPER

Marcelo Arenas (U of Toronto), Leonid Libkin (U of Toronto)

http://www.cs.toronto.edu/~libkin/papers/pods05.ps.gz

 

XPath Satisfiability in the Presence of DTDs

Michael Benedikt (Bell Labs), Wenfei Fan (Bell Labs), Floris Geerts (University of Edinburgh)

 

Deciding Well-Definedness of XQuery Fragments

Stijn Vansummeren (Limburgs Univsersitair Centrum)

 

Views and Queries: Determinacy and Rewriting

Luc Segoufin (INRIA, France ), Victor Vianu (UC San Diego)

 

Session 2: Complexity & Performance Evaluation

On the Complexity of Division and Set Joins in the Relational Algebra - BEST PAPER

Dirk Leinders (Limburgs Universitair Centrum), Jan Van den Bussche (Limburgs Universitair  Centrum)

 

On the Complexity of Nonrecursive XQuery and Functional Query Languages on Complex Values

Christoph Koch (Technical University of Vienna, Austria )

 

An Incremental Algorithm for Computing Ranked Full Disjunctions

Sara Cohen (Technion - Israel Institute of Technology), Yehoshua Sagiv (The Hebrew University of Jerusalem)

 

Session 3: Security & Privacy

Security Analysis of Cryptographically Controlled Access to XML Documents - BEST NEWCOMER PAPER

Martin Abadi (University of California, Santa Cruz), Bogdan Warinschi (Computer Science Dept, University of California, Santa Cruz)

http://www.cs.ucsd.edu/~bogdan/pdf/xml.pdf

 

Simulatable Auditing

Krishnaram Kenthapadi (Stanford University), Nina Mishra (HP Labs/Stanford), Kobbi Nissim (Ben-Gurion University)

 

Practical Privacy: The SuLQ Framework

Avrim Blum (Carnegie Mellon), Cynthia Dwork (Microsoft Research), Frank McSherry (Microsoft Research), Kobbi Nissim (Ben-Gurion University)

 

Privacy-Enhancing k-Anonymization of Customer Data

Sheng Zhong (Stevens Institute of Technology), Zhiqiang Yang (Stevens Institute of Technology), Rebecca Wright (Stevens Institute of Technology)

 

Session 4: Data Integration & Interoperability

Computing Cores for Data Exchange: New Algorithms and Practical Solutions

Georg Gottlob (TU Wien, Inst f. Informationssysteme)

 

Peer Data Exchange

Ariel Fuxman (University of Toronto), Phokion Kolaitis (IBM Almaden Research Center), Renee J. Miller (University of Toronto), Wang-Chiew Tan (University of California at Santa Cruz)

 

Composition of Mappings Given by Embedded Dependencies

Alan Nash (University of California, San Diego), Phil Bernstein (Microsoft Research), Sergey Melnik (Microsoft Research)

 

 

Session 5: Data Mining / Transaction Management

Multi-Structural Databases

Ronald Fagin (IBM Almaden Research Center), R Guha (IBM), Ravi Kumar (IBM), Jasmine Novak (IBM), D Sivakumar (IBM), Andrew Tomkins (IBM)

 

A Divide-and-Merge Methodology for Clustering

David Cheng (MIT), Ravi Kannan (Yale University), Santosh Vempala (MIT), Grant Wang (MIT)

 

Allocating Isolation Levels to Transactions

Alan Fekete (University of Sydney)

 

 

Session 6: Complexity & Performance Evaluation / Data Stream Management

Buffering in Query Evaluation over XML Streams

Ziv Bar-Yossef (Technion), Marcus Fontoura (IBM Almaden), Vanja Josifovski (IBM Almaden)

 

Histograms Revisited: When are histograms the best approximation method for aggregates over joins?

Alin Dobra (University of Florida)

 

Lower Bounds for Sorting with Few Random Accesses to External Memory

Martin Grohe (Humboldt-Universitaet), Nicole Schweikardt (Humboldt-Universitaet Berlin)

 

Session 7: Data Stream Management

Operator Placement for In-Network Stream Query Processing

Utkarsh Srivastava (Stanford University), Kamesh Munagala (Duke University), Jennifer Widom (Stanford University)

 

Join-Distinct Aggregate Estimation over Update Streams

Minos Garofalakis (Bell Labs), Sumit Ganguly (IIT Kanpur), Amit Kumar (IIT Delhi), Rajeev Rastogi (Bell Labs)

 

Space Efficient Mining of Multigraph Streams

Graham Cormode (Bell Labs), S. Muthukrishan (Rutgers University)

 

Session 8: Information Processing on the Web

XML Type Checking with Macro Tree Transducers

Sebastian Maneth (Ecole Polytechnique Federal de Lausanne), Thomas Perst (Technische Universität München), Alexandru Berlea (TU München), Helmut Seidl (Technische Universität München)

 

Regular and Unambiguous Rewritings for Active XML

Serge Abiteboul (INRIA),  Tova Milo(Tel Aviv University), Omar Benjelloun (Stanford University)

 

Determining Source Contribution in Information Integration Systems

Alin deutsch (UC San Diego), Yannis Katsis (UC San Diego), Yannis Papakonstantinou (University of California at San Diego, USA)

 

Session 9: Databases & Information Retrieval / Data Mining

Estimating arbitrary subset sums with few probes

Noga Alon (Tel-Aviv University), Nick Duffield (AT&T Labs Research), Carsten Lund (AT&T Labs Research), Mikkel Thorup (AT&T Labs Research)

 

FTW: Fast Similarity Search under the Time Warping Distance

Yasushi Sakurai (NTT), Masatoshi Yoshikawa (Nagoya University), Christos Faloutsos (Carnegie Mellon University)

 

Space Complexity of Hierarchical Heavy Hitters in Multi-Dimensional Data Streams

Nisheeth Shrivastava (University Of California, Santa Barbara), John Hershberger (Mentor Graphics Corp), Subhash Suri (University of California, Santa Barbara), Csaba Toth (Massachusetts Institute of Technology)

 

Session 10: Logic in Databases

Differential Constraints

Bassem Sayrafi (Indiana University), Dirk Van Gucht (Indiana University)

 

Diagnosis of Asynchronous Discrete Event Systems - Datalog to the Rescue!

Serge Abiteboul (INRIA), Zoe Abrams (Stanford University), Stefan Haar (IRISA), Tova Milo  (Tel Aviv University)

 

Relative Risk and Odds Ratio: A Data Mining Perspective

Haiquan Li (institute for infocomm research), Jinyan Li (institute for infocomm research), Limsoon Wong (institute for infocomm research), Mengling Feng (Nanyang Technological University), Yap Peng Tan (Nanyang Technological University)