Hongrae Lee
Department of Computer Science
University of British Columbia
X415 - 2366 Main Mall
Vancouver, B.C. Canada, V6T 1Z4
Phone: 604-827-3984(0)
http://www.cs.ubc.ca/~xguy
xguy at cs dot ubc dot ca


About Me

  I'm working at Google Research as a research scientist. I check this ubc email address once in a while, but it's easy to miss messages among many spams and other emails. My new contact is hrlee at goog led otcom.

  I graduated from the Database Group at the University of British Columbia with a Ph.D. degree in Computer Science in September 2010. My supervisor was Raymond Ng. I earned a Master degree in Computer Science at Seoul National University in Korea under the supervison of Hyoung-Joo Kim. I have also been working closely with Kyuseok Shim on research. I did my undergrad in Nuclear Engineering at the same school. Before grad studies, I spent several years in the industry participating in many projects in data management systems and distributed data management. The complete project list can be found here. I wish 'Stay Hungry. Stay Foolish.' for myself.

 
Research Interests

  I'm interested in effective and efficient handling of text in databases. With the advance of technologies, a vast amount of text data are generated by users such as blogs, comments, twits and profiles. Such data are hardly error-free with typos and different spelling conventions. They may also be collected from multiple data sources or use different spellings (e.g. Silvia and Sylvia). For these reasons, there have been growing interests in approximate or error tolerant query processing in databases. My research focuses on developing size estimation techniques for approximate text queries, which is crucial in optimizing such queries.

 

Publications

  Anish Das Sarma, Lujun Fang, Nitin Gupta, Alon Halevy, Hongrae Lee, Fei Wu, Reynold Xin and Cong Yu. Finding Related Tables. To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.

  Changkyu Kim, Jongsoo Park, Nadathur Satish, Hongrae Lee, Jatin Chhugani and Pradeep Dubeyi. CloudRAMSort: Fast and Efficient Large-Scale Distributed RAM Sort on Shared-Nothing Cluster. To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.

  Anish Das Sarma, Hongrae Lee, Hector Gonzalez, Alon Halevy and Jayant Madahavan. Efficient Spatial Sampling of Large Geographical Tables. To appear in Proceedings of 38th International Conference on Management of Data (SIGMOD), Scottsdale, Arizona, USA, 2012.

  Jongik Kim and Hongrae Lee. Efficient Exact Similarity Searches using Multiple Token Orderings. To appear in Proceedings of 28th IEEE International Conference on Data Engineering (ICDE), Washington DC, USA, 2012.

  Hongrae Lee, Raymond Ng and Kyuseok Shim. Similarity Join Size Estimation using Locality Sensitive Hashing. In Proceedings of the VLDB Endowment (PVLDB), Seattle, WA, USA, 2011.

  Ingyu Lee, Hongrae Lee and Byung-Won On. Unsupervised Methods for Resolving Mixed Entities on the Web. In Proceedings of International Congress on Computer Applications and Computational Science (CACS), Singapore, 2010.

  Surajit Chaudhuri, Hongrae Lee and Vivek Narasayya. Variance Aware Optimization of Parameterized Queries. In Proceedings of 36th International Conference on Management of Data (SIGMOD), Indianapolis, Indiana, USA, 2010.

  Hongrae Lee, Raymond Ng and Kyuseok Shim. Power-Law Based Estimation of Set Similarity Join Size. In Proceedings of Proceedings of the VLDB Endowment (PVLDB), Vol 2, Number 1, pages 658-669, Lyon, France, 2009.

  Hongrae Lee, Raymond Ng and Kyuseok Shim. Approximate Substring Selectivity Estimation. In Proceedings of 12th International Conference on Extending Database Technology (EDBT), pages 827-838, Saint-Petersburg, Russia, 2009.

  Hongrae Lee, Raymond Ng and Kyuseok Shim. Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance. In Proceedings of 33rd International Conference on Very Large Data Bases (VLDB), pages 195-206, Vienna, Austria, 2007. [pdf]

  Hongrae Lee, Kyuseok Shim and Hyoung-Joo Kim. Compact Suffix Graph for Substring Selectivity Estimation. Journal of KISS, 34(2), Apr. 2007 (In Korean)


Awards and Honors


Activities


Personal