MULTIMEDIA/MULTIMODAL
SIGNAL PROCESSING, ANALYSIS, AND UNDERSTANDING
Prof.
Tom. S. Huang
Beckman
Institute for Advanced Science and Technology, University
of Illinois at Urbana-Champaign
Dia 18 de Março 2005 (6ª
feira), 11 horas, Anfiteatro EA4
Aula Especial da cadeira de Comunicação de Áudio e Vídeo
Abstract
"Multimodal" refers to the
different senses (visual, audio, tactile, etc.) used in human-computer
interface. "Multimedia" refers to the different ways of
representing information (text, graphics, audio, images, video, etc.). A signal
processing, analysis, or understanding task is called Multimedia/Multimodal, if
it involves two or more modalities or media, interacting in non-trivial ways.
We shall give an array of examples of multimedia/multimodal signal processing,
analysis, and understanding; including: Audio/visual speech recognition,
and audio/visual emotion recognition. A stable and robust facial movement
tracking algorithm will be presented, which is used in both tasks.
Biography
Thomas S. Huang received his
B.S. Degree in Electrical Engineering from National
Taiwan University,
Taipei, Taiwan,
China, and his
M.S. and Sc.D. Degrees in Electrical Engineering from the Massachusetts
Institute of Technology, Cambridge, Massachusetts.
He was on the Faculty of the Department of Electrical Engineering at MIT from
1963 to 1973; and on the Faculty of the School
of Electrical Engineering and
Director of its Laboratory for Information and Signal Processing at Purdue
University from 1973 to 1980. In
1980, he joined the University of Illinois at Urbana-Champaign, where he is now
William L. Everitt Distinguished Professor of
Electrical and Computer Engineering, and Research Professor at the Coordinated
Science Laboratory, and Head of the Image Formation and Processing Group at the
Beckman Institute for Advanced Science and Technology and Co-Chair of the
Institute's major research theme Human Computer Intelligent Interaction.
During his
sabbatical leaves: Dr. Huang has worked at the MIT Lincoln Laboratory, the IBM
Thomas J. Watson Research Center, and the Rheinishes Landes Museum in Bonn, West Germany, and held visiting
Professor positions at the Swiss Institutes of Technology in Zurich and
Lausanne, University of Hannover in West Germany, INRS-Telecommunications of
the University of Quebec in Montreal, Canada and University of Tokyo,
Japan. He has served as a consultant to numerous industrial firms and
government agencies both in the U.S.
and abroad.
Dr. Huang's
professional interests lie in the broad area of information technology,
especially the transmission and processing of multidimensional signals. He has
published 14 books, and over 500 papers in Network Theory, Digital Filtering,
Image Processing, and Computer Vision. He is a Member of the National Academy
of Engineering; a Foreign Member of the Chinese Academies of Engineering and
Sciences; and a Fellow of the International Association of Pattern Recognition,
IEEE, and the Optical Society of American; and has received a Guggenheim
Fellowship, an A.V. Humboldt Foundation Senior U.S. Scientist Award, and a
Fellowship from the Japan Association for the Promotion of Science. He received
the IEEE Signal Processing Society's Technical Achievement Award in 1987, and the Society Award in 1991. He was awarded
the IEEE Third Millennium Medal in 2000. Also in 2000, he received the Honda
Lifetime Achievement Award for "contributions to motion analysis". In
2001, he received the IEEE Jack S. Kilby Medal. In
2002, he received the King-Sun Fu Prize, International Association of Pattern
Recognition; and the Pan Wen-Yuan Outstanding
Research Award. He is a Founding Editor of the International Journal Computer
Vision, Graphics, and Image Processing; and Editor of the Springer Series in
Information Sciences, published by Springer Verlag.