Research Interest
I am interested in Computer Vision and Machine Learning.
At present, I specifically focus my research on probabilistic
object tracking and general object recognition.
|
|
An Adaptive Interface for Active Localization
|

Thanks to large-scale image repositories, vast amounts of data for object recognition are now easily available. However, acquiring training labels for arbitrary objects still requires tedious and expensive human effort. This is particularly true for localization, where humans must not only provide labels, but also training windows in an image. We present an approach for reducing the number of labelled training instances required to train an object classifier and for assisting the user in specifying optimal object location windows. As part of this process, the algorithm performs localization to find bounding windows for training examples that are best aligned with the current classification function, which optimizes learning and reduces human effort. To test this approach, we introduce an active learning extension to a latent SVM learning algorithm. Our user interface for training object detectors employs real-time interaction with a human user. Our active learning system provides a mean performance improvement of 4.5% in the average precision over a state of the art detector on the PASCAL Visual Object Classes Challenge 2007 with an average of just 40 minutes of human labelling effort per class.
Demo video of our interface: [mpeg in color (3.6MB)]
Presentation: [PDF (4.5MB)]
|
|
BPF: Fully Automatic Multi-target Tracking System
|
Boosted Particle Filter is a vision system that is capable of learning, detecting and
tracking the objects of interest.
The problem of tracking a varying number of non-rigid objects has two
major difficulties. First, the observation models and target distributions can be highly
non-linear and non-Gaussian. Second, the presence of a large, varying number of objects
creates complex interactions with overlap and ambiguities. To surmount these
difficulties, we introduce BPF.
The system is demonstrated in the context of tracking
hockey players using video sequences. Our approach combines the strengths of
two successful algorithms: mixture particle filters and Adaboost. The mixture particle
filter is ideally suited to multi-target tracking as it assigns a mixture component
to each player. The crucial design issues in mixture particle filters are the
choice of the proposal distribution and the treatment of objects leaving and entering
the scene. With BPF, we construct the proposal distribution using a mixture model
that incorporates information from the dynamic models of each player and the detection
hypotheses generated by Adaboost. The learned Adaboost proposal distribution
allows us to quickly detect players entering the scene, while the filtering process enables
us to keep track of the individual players. The result of interleaving Adaboost
with mixture particle filters is a simple, yet powerful and fully automatic multiple
object tracking system.
Movies: [mpeg in color (838KB)]
[mpeg in gray scale (343KB)]
Presentation: [mpeg (3.5MB)]
Source code (matlab) and data (jpegs): [ Ver. 1.3 (zip, 3.3MB)]
|
|
Hockey system: tracking hockey players!
|
|
Objective:
Development of the system that automatically
acquires trajectories of hockey players from video.
This project has two main stages:
1.
Compute the mapping from video to the rink map
&
||
We choose 13 point correspondences manually from the original
image and the rink map and compute homography between them.
The result of the transformation is shown above. Using this
transformation for the first frame of a sequence, we now
automatically compute transformations for the rest of the sequence. We
implement KLT, RANSAC, and model fitting to do this automation.
Two video clips (about 800 frames each) are presented below. One is a clip of the
original video. The other is a clip of a tranformed sequence.
Movie:
1. original sequence
2. transformed sequence
NOTE: Above transformed sequence looks like
a fast-forward version of the original sequence transformed.
This is because we process the sequence every 4th frames.
2.
Track hockey players
Now that we aquire the transformation from the original sequence
to the rink map, we track hockey players in order to gain their
trajectories.
Here is the example clip of how a hockey player is tracked:
Movie:
We finally combine all the results to visualize trajectories of
hockey players.
The trajectory of the movie above.(Only a single player for now):
|
|
Robot perception: Let Eric track people's face!
|
Eric
|
|
Eric is now equipped with my color-based sequential Monte
Carlo tracker to track people's faces. The link below is
a video clip to show how Eric can lock on my face
and track it nicely.
Movie:
Eric
Acknowledgement:
Pantelis Elinas for setting up Eric
Michael Zhang for filming
NOTE: In a vide clip, please focus on Eric's
head and how I move around.
|
|