CSE 725 Current Problems in Computer Vision

A. Course objectives and preliminaries

This is the website for my Fall 2009 seminar on computer vision. The purpose of the seminar is to explore (take a guess) current problems being addressed by the computer vision community: those issues, approaches and applications evoking notable current research interest and promise. At the end of the seminar, participants should be aware of major trends and directions in current computer vision research, have selected one to explore in some depth, and given a tutorial presentation on it.

The first two seminar meetings will be devoted to surveying the various topics to be discussed throughout the rest of the semester. Each registered student will select one of these topics (or suggest a different topic which I approve), and deliver a presentation on that computer vision research topic based on a literature search of work in the area. The goal of each presentation is to present a tutorial description of the problems and methods associated with the selected topic, and the presenter's thoughts concerning the most important unsolved problems and future directions of research in the area.

This website will serve mainly as a repository of files, links and other information relevant to the course. As student presentations are completed, they will be added to the website. The presentation, and attendance at most/all of the other student presentations, are the only course requirements for students registering for 1 credit. For those registering for 2-3 credits, at the end of the semester, each such student will submit a research paper, similar in scope, length and scholarship to a paper tendered to a good conference in the field. The paper should be built on the presentation, adding some original research ideas and results to the tutorial material of the presentation. As is the custom in our department, seminars will be graded S/U unless otherwise letter grades are requested on an individual basis.

Enrollment is limited to grad students with at least one previous course in image processing or computer vision. The group will meet weekly for one 2 hour session at a time of the week that fits into everyone's schedule. On the first day of class for the Fall semester, Monday Aug 31, I will email everyone who has registered for the course and ask their class and teaching schedules. After perhaps one or two iterations as students add and drop, I will then send a final email around indicating what our regular meeting time and place for the semester will be. If you are interested in attending but do not register for the course, please check my office door (Bell 136), I will post the time and place for our meetings as soon as this information is available.

I will have office hours in my office Bell 136 this semester 2:00-3:00pm Mon and Tues, 3:00-4:00pm Wed. I can be reached by email at peter@buffalo.edu or phone at 716-645-3180 x 137.

Discussion Q&A: Can every message be represented by a bitstream?

B. Syllabus

This is the current syllabus, with a helpful introductory reference for each topic linked. Registered students are asked to review these references before the class in which they will be surveyed. These references are just basic tutorial introductions, which give the flavor of the topic but should not be expected to supply detail and technical information adequate for a thorough understanding. What they will do is facilitate good discussion by grounding each seminar participant in the basic tutorial material before the session in which they will be discussed.

1. Image processing from still images

1.1 Segmentation: fast marching http://math.berkeley.edu/%7Esethian/Explanations/fast_marching_explain.html

Select Fast Marching Methods from frame.
1.2 Shape recovery: level sets http://math.berkeley.edu/%7Esethian/Explanations/level_set_explain.html

Select Level Set Methods from frame.
1.3 Segmentation: graph cuts http://www.cis.upenn.edu/~jshi/GraphTutorial/Tutorial-ImageSegmentationGraph-cut1-Shi.pdf

Select Part 1.
1.4 Registration: image transformations http://www.cs.wright.edu/~agoshtas/CVPR04_Registration_Tutorial.html

Select Transformation Functions
1.5 Registration of images with maps Haala_2003.pdf
1.6 JPEG2000 Impoco_2004.pdf

2. Computer vision from still images

2.1 OR: SVM methods http://www.support-vector.net/icml-tutorial.pdf
2.2 OR: Learning LeCun_2004.pdf
2.3 Free-viewpoint rendering http://iphome.hhi.de/smolic/docs/Smolic_ICIP04_FVV.pdf
2.4 Multi-view 3D reconstruction Vogiatzis_2005.pdf
2.5 Multi-sensor image fusion http://www.ece.lehigh.edu/SPCRL/IF/image_fusion.htm
Select Section 1: Introduction and Section 2: Review of Image Fusion Research

3. Video processing

3.1 Particle filters for tracking Li_2004.pdf
3.2 MPEG-4/H.264 video coding http://www.m4if.org/resources/IEEESpectrum/mpeg-4.htm
3.3 Model-based tracking http://csdl2.computer.org/comp/proceedings/ismar/2003/2006/00/20060313.pdf
3.4 Tracking using Bayes' nets Murphy_2003.pdf
3.5 Free-viewpoint video http://www.merl.com/reports/docs/TR2003-137.pdf
3.6 Motion detection http://lear.inrialpes.fr/people/triggs/events/iccv03/cdrom/iccv03/0734_viola.pdf
3.7 Motion segmentation Weiss_2003.pdf

4. Computer vision from video

4.1 Detecting surprises, abnormalities Koller_2002.pdf
4.2 Action Recognition Bregonzio_2009.pdf
4.3 Detecting, tracking people Cutler_2003.pdf
4.4 Wayfinding for the vis impaired Nagarajan_2003.pdf
4.5 SLAM: Simultaneous Localization and Mapping Hiebert-Treuer_2007.pdf

5. Mathematical foundations

5.1 Representation: wavelets http://www.amara.com/IEEEwave/IEEEwavelet.html

5.2 Superresolution http://www.ece.rice.edu/~rwagner/docs/wagnerSPIE04.pdf

5.3 Learning Weng_2000.pdf

5.4 Modelling uncertainty: prob, belief, fuzzy, rough sets, intervals, etc. Smets_1999.pdf

5.5 Evaluation methods Martens_2002.pdf

5.6 Mosaicking Hsieh_2003.pdf

5.7 A contrario detection methods VISTA_2004.pdf

5.8 Belief and image uncertainty Sun_2003.pdf

6. Architectures and hardware

6.1 FPGA architectures http://www.us.design-reuse.com/articles/article10943.html

6.2 Vision on a chip http://www.nasatech.com/Briefs/Feb00/NPO20449.html

6.3 Smart cameras Leeser_2004.pdf

7. Miscellaneous

7.1 Watermarking Perez _200X.pdf

7.2 Image steganography Johnson_1997.pdf

7.3 Inpainting methods http://www.math.ucla.edu/~imagers/htmls/inp.html

8. Topics which will not be discussed (duplicates other courses)

1. Face recognition
2. Image indexing or retrieval
3. Biometrics (eg. fingerprints)
4. CV for bioinformatics
5. Active or animate vision

Class Schedule

Student presenters and their topics will be filled in as they become scheduled.

Week	Presenters	Topic	Prep Readings
1 08/31 - 09/04		No meeting - set up schedule
2 09/07 - 09/11	Peter Scott	Overview of Topics I	1.1-1.5
3 09/14 - 09/18	Peter Scott	Overview of Topics II	2.1-7.3
4 09/21 - 09/25	Kedar Sarmalkar Praveen Santhanam	Steganography SLAM	Berghel, wikipedia Riisgard, wikipedia
5 09/28 - 10/02	No seminar meeting	Prof at a conference out of town
6 10/05 - 10/09	Balakrishna Thiagarajan Niranjan Kamat	Graph Cut Segmentation Tracking via GMMs	Sharon, Thiagarajan Grest, Stauffer
7 10/12 - 10/16	Santhosh Kandalu Qiang Gao	Motion detection 3-D Terrain Following	Shahinfard, Lee Bors-1, Bors-2
8 10/19 - 10/23	Ning Zhang Lynn Lobo	Face Detection Learning	Tu, Viola PCA, LDA, Regr. Trees
9 10/26 - 10/30	Mukul Apte Bhushanshirish Chitte	Super-resolution Watermarking	Park, wikipedia Provos
10 11/02 - 11/06	No seminar meeting	Presenter health problems
11 11/09 - 11/13	Ranjan Seetharama Porchelvi Vijayakumar	Wayfinding for the visually impaired Bayesian learning	Nagarajan, Heyes wikipedia, Gopnik
12 11/16 - 11/20	Vikas Choudary	CAPTCHA	Wikipedia, Chew
13 11/23 - 11/27	No class	Fall recess
14 11/30 - 12/04	Group discussion	Seminar summary and wrap-up*	Nagel, Piccardi

* In preparation for this meeting, please read the Nagel paper What It Is Like To Be A Bat and the short Piccardi review article, Recent Advances in Computer Vision, and review your own presentation. At this final meeting we will do three things:

1. Go around the room and get everyone's summary of what they posted concerning the completeness of bitstreams over the space of messsages, and whether their view has changed since then, in light of further reflection, other posted views, or the Nagel paper.

2. Ask each seminar member to identify one research idea of their own on the topic of their presentation. Hopefully, these are not ideas contained in one of the references each presenter used, rather each idea contains some degree of novelty. If you are actively working on one now, describe what you are doing.

3. I will summarize the current state of work in computer vision research as I see it, and as we studied it in this seminar, comparing my view to that of Piccardi. I will try identify where I believe the most consequential research is occuring and can be started.

Unlike our previous meetings, attendance at this last meeting is mandatory. This is the meeting where we "pick the fruit" grown during the course of the seminar.