Dela

Research Areas Sound and Image Processing

VQ design

The research at the Sound and Image Processing (SIP) Laboratory includes a variety of topics that relate to the application of signal processing and information theory to speech, audio, and video signals and images with the aim to facilitate communication. Examples of research topics that we are active in are auditory modeling, generic source coding (the figure on the right shows a lattice quantizer), audio and speech coding, video coding, speech enhancement, objective methods for speech-quality estimation, robust encoding for packet networks, audio-visual context awareness, etc. An area of special emphasis is technology for hearing impaired people.

Our work is funded by a variety of sources. Industrial sponsors of research projects at the Sound and Image Processing Lab include Ericsson, GN Resound, and Global IP Solutions. Public sponsors of research projects include the European Commission and the Swedish Research Council. The projects of SIP include:

HearCom: technologies for the hearing-impaired (EC)

FlexCode: flexible coding for heterogeneous networks (EC)

ACORNS: machines learning human communication skills (EC)

AUDIS: Digital Signal Processing in Audiology (EC)

Auditory acuity of cochlear implants (Swedish Research Council)

Adaptive Coding (Swedish Research Council)

3D video (Ericsson / KTH ACCESS Center)

Some additional information relating specifically to SIP research in hearing technology can be found on the following link:

Hearing research at SIP

Hearing implant

The picture on the right shows a cochlear implant.

High-level descriptions of earlier research are found in our annual reports:

Annual Report 2006 (PDF) (pdf 78 kB)

Annual Report 2005 (PDF) (pdf 113 kB)

Annual Report 2004 (PDF) (pdf 347 kB)

Annual Report 2003 (PDF) (pdf 198 kB)

Recent SIP research publications can be found through the following link:

Publications