 |
|
 |
About Me
|
|
|
 |
Research Interests
|
| |
I'm interested in effective and efficient handling of text in databases.
With the advance of technologies, a vast amount of text data are generated by users such as blogs, comments, twits and profiles.
Such data are hardly error-free with typos and different spelling conventions.
They may also be collected from multiple data sources or use different spellings (e.g. Silvia and Sylvia).
For these reasons, there have been growing interests in approximate or error tolerant query processing in databases.
My research focuses on developing size estimation techniques
for approximate text queries, which is crucial in optimizing such queries.
|
|
|

Publications
|
| |
Anish Das Sarma, Lujun Fang, Nitin Gupta, Alon Halevy, Hongrae Lee, Fei Wu, Reynold Xin and Cong Yu.
Finding Related Tables.
To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.
|
|
| |
Changkyu Kim, Jongsoo Park, Nadathur Satish, Hongrae Lee, Jatin Chhugani and Pradeep Dubeyi.
CloudRAMSort: Fast and Efficient Large-Scale Distributed RAM Sort on Shared-Nothing Cluster.
To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.
|
|
| |
Anish Das Sarma, Hongrae Lee, Hector Gonzalez, Alon Halevy and Jayant Madahavan.
Efficient Spatial Sampling of Large Geographical Tables.
To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.
|
|
| |
Jongik Kim and Hongrae Lee.
Efficient Exact Similarity Searches using Multiple Token Orderings.
To appear in Proceedings of 28th IEEE International Conference on Data Engineering (ICDE), Washington DC, USA, 2012.
|
|
| |
Hongrae Lee, Raymond Ng and Kyuseok Shim.
Similarity Join Size Estimation using Locality Sensitive Hashing.
In Proceedings of the VLDB Endowment (PVLDB), Seattle, WA, USA, 2011.
|
|
| |
Ingyu Lee, Hongrae Lee and Byung-Won On.
Unsupervised Methods for Resolving Mixed Entities on the Web.
In Proceedings of International Congress on Computer Applications and Computational Science (CACS), Singapore, 2010.
|
|
| |
Surajit Chaudhuri, Hongrae Lee and Vivek Narasayya.
Variance Aware Optimization of Parameterized Queries.
In Proceedings of 36th International Conference on Management of Data (SIGMOD), Indianapolis, Indiana, USA, 2010.
|
|
| |
Hongrae Lee, Raymond Ng and Kyuseok Shim.
Power-Law Based Estimation of Set Similarity Join Size.
In Proceedings of Proceedings of the VLDB Endowment (PVLDB), Vol 2, Number 1, pages 658-669, Lyon, France, 2009.
|
|
| |
Hongrae Lee, Raymond Ng and Kyuseok Shim.
Approximate Substring Selectivity Estimation.
In Proceedings of 12th International Conference on Extending Database Technology (EDBT), pages 827-838,
Saint-Petersburg, Russia, 2009.
|
|
| |
Hongrae Lee, Raymond Ng and Kyuseok Shim.
Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance.
In Proceedings of 33rd International Conference on Very Large Data Bases (VLDB),
pages 195-206, Vienna, Austria, 2007.
[pdf]
|
| |
Hongrae Lee, Kyuseok Shim and Hyoung-Joo Kim. Compact Suffix Graph for
Substring Selectivity Estimation. Journal of KISS, 34(2), Apr. 2007 (In Korean)
|
|

Awards and Honors
|
|