Hongrae Lee
Department of Computer Science
University of British Columbia
X415 - 2366 Main Mall
Vancouver, B.C. Canada, V6T 1Z4
Phone: 604-827-3984(0)
http://www.cs.ubc.ca/~xguy
xguy at cs dot ubc dot ca


About Me

  News: I defened my dissertation on September 17, 2010, Yay! I will be joining Google Research as a research scientist soon.

  I am a Ph.D. candidate in the Department of Computer Science at the University of British Columbia graduating this summer. My supervisor is Raymond Ng. I earned a Master degree in Computer Science at Seoul National University in Korea under the supervison of Hyoung-Joo Kim. I have also been working closely with Kyuseok Shim on research. I did my undergrad in Nuclear Engineering at the same school. Before grad studies, I spent several years in the industry participating in many projects in data management systems and distributed data management. The complete project list can be found here. I wish 'Stay Hungry. Stay Foolish.' for myself.

 
Research Interests

  I'm interested in effective and efficient handling of text in databases. With the advance of technologies, a vast amount of text data are generated by users such as blogs, comments, twits and profiles. Such data are hardly error-free with typos and different spelling conventions. They may also be collected from multiple data sources or use different spellings (e.g. Silvia and Sylvia). For these reasons, there have been growing interests in approximate or error tolerant query processing in databases. My research focuses on developing size estimation techniques for approximate text queries, which is crucial in optimizing such queries.

 

Publications

  Hongrae Lee, Raymond Ng and Kyuseok Shim. Similarity Join Size Estimation using Locality Sensitive Hashing. To appear in Proceedings of the VLDB Endowment (PVLDB), Seattle, USA, 2011.

  Ingyu Lee, Hongrae Lee, and Byung-Won On. Unsupervised Methods for Resolving Mixed Entities on the Web. In Proceedings of International Congress on Computer Applications and Computational Science (CACS), Singapore, 2010.

  Surajit Chaudhuri, Hongrae Lee, and Vivek Narasayya. Variance Aware Optimization of Parameterized Queries. In Proceedings of 36th International Conference on Management of Data (SIGMOD), Indianapolis, USA, 2010.

  Hongrae Lee, Raymond Ng, and Kyuseok Shim. Power-Law Based Estimation of Set Similarity Join Size. In Proceedings of Proceedings of the VLDB Endowment (PVLDB), Vol 2, Number 1, pages 658-669, Lyon, France, 2009.

  Hongrae Lee, Raymond Ng, and Kyuseok Shim. Approximate Substring Selectivity Estimation. In Proceedings of 12th International Conference on Extending Database Technology (EDBT), pages 827-838, Saint-Petersburg, Russia, 2009.

  Hongrae Lee, Raymond Ng, and Kyuseok Shim. Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance. In Proceedings of 33rd International Conference on Very Large Data Bases (VLDB), pages 195-206, Vienna, Austria, 2007. [pdf]

  Hongrae Lee, Kyuseok Shim, and Hyoung-Joo Kim. Compact Suffix Graph for Substring Selectivity Estimation. Journal of KISS, 34(2), Apr. 2007 (In Korean)


Awards and Honors


Activities


Personal