- Database Systems: relational, deductive, object-oriented, and hierarchical ( not what you think!).
- Query Languages and Query Optimization.
- Data Analysis: multi-dimensional databases, data warehousing, OLAP,
- Information Integration: multi-database systems, interoperability,
schema integration and evolution.
- Newer Data Models: social networks, semi-structured data, data on the web, XML.
- Networks: social networks, LDAP network directories, network data management.
- Logic Programming: extensions, semantics, and optimization; application for data cleaning.
- Higher-Order Logics and Advanced Database Models.
- Uncertainty in Data Modeling and Deduction.
- Personalization and Social Web: recommender systems, social networks and media.
- Sports Analytics: for now, cricket.
As the world we live in is getting more and more networked, the need
to understand, manage, and harness the data on the web is becoming critical.
While data in traditional databases tends to be highly structured, with
a clear notion of schema, data on the web is loosely structured (also
called semi-structured), or worse, unstructured, and is often not accompained
by any clear notion of schema. What does it mean to query this data? What
do you look for when you mine this data? If there are several data sources
containing related information, how do you combine the information in
them to answer queries involving them all? How can you index such data
for efficient storage and retrieval? What do you do when the data you
want to analyze is not stored some place but is streaming through? My
research has been concerned with addressing these
questions. I am also interested in newer applications which challenge
the foundations and technology of databases.
More recently, I am interested in integrating the paradigms of
database-style querying, IR-style search, and RecSys-style recommendations.
And I want to do this taking user's context into account. Context as in the social
neighborhood of the user as well as context as in the user's current information needs
or her current task. Opinions and "intelligence" of the crowd is something to be
naturally harnessed in this setting. Stay tuned for more information on what drives
my research these days.
- Next Generation Applications of Social Systems (supported by NSERC Discovery Grant)
- Business Intelligence (funded by an NSERC Strategic Grant on Business Intelligence Network, in
cooperation with top universities in Canada and leading industries including IBM/Cognos and
- Analyzing Influential Behavior in Very Large Social Networks
(with Francesco Bonchi and Amit Goyal)
- Data Cleaning Using Integrity Constraints (with Leo Bertossi and Solmaz Kolahi)
- Keyword Oriented Online Analytical Processing (KOOLAP)
(with Shanshan Chen, Naresh Kumar, Pei Lee, and Min Xie)
- Building Taxonomies from Documents and Social Media
(with Ali Moosavi and Rachel Pottinger)
- Data Cleaning in Data Warehouses (with Leo Bertossi)
- Privacy-aware Recommender Systems (with Rui Chen and Min Xie)
- Sports Analytics: Outcome Prediction of One Day International Cricket Matches
- Query Processing by, for, and of the Masses: Putting the User in the Loop
(funded by NSERC)
- Social Networks Structure Analysis (with Francesco Bonchi, David Gleich, Chen Grief and Pooya Esfandiar)
- Privacy-preserving data mining (with Shaofeng Bu and Raymond Ng)
Data Warehouse, funded by NSERC
Constraint-Driven Data Mining on Heterogeneous Data Stores, funded
by BC Advanced Systems Institute.
New Frontiers in Data Mining (Joint Project: UBC/UofT/SFU), funded
Building Querying, Analyzing, and Mining Data Warehouses on the Internet
(Joint Project: Concordia/SFU/UBC/UofT), funded by NCE/IRIS
- Data Warehousing and OLAP
- Database Mining
- Semi-structured Data and XML
- Network Directories (aka LDAP)
- Information Integration