Fraunhofer IDMT
 
 
 
 
 
 
 
 
 
 
 Fraunhofer-Gesellschaft
   
 Department Cluster Audio/Multimedia

 

Address

 

Fraunhofer Institute for Digital Media Technology IDMT
Ehrenbergstraße 31
98693 Ilmenau, Germany

Phone: +49 (0) 36 77/4 67-3 55
Fax: +49 (0) 36 77/4 67-4 67
schuller@idmt.frauhofer.de

University Address

Technische Universität Ilmenau
Institut fuer Medientechnik
Helmholtzplatz 2, Zi. 3527
98693 Ilmenau, Germany

Phone: +49 (0) 36 77/69-2756
gerald.schuller@tu-ilmenau.de

top

Professional Activities

 
  • Associate Editor for the IEEE Transactions on Speech and Audio Processing, March 2002 to Feb. 2006
  • Associate Editor for the IEEE Transactions on Signal Processing, since Feb. 2006
  • Member of the IEEE Technical Committee on Audio and Electroacoustics
  • Member of the IEEE Technical Committee on Speech Processing
  • Member of the Audio Engineering Society (AES) Technical Committee on Coding of Audio Signals
  • Guest Editor, EURASIP Journal on Applied Signal Processing, Special Issue on Multirate Systems and Applications, since Oct. 2005.
  • Technical Program Committee Member for the 14th European Signal Processing Conference 2006.
  • Workshop Chair and Organizer for the AES Workshop “Next Generation Audio Communications”, 119th AES Convention, New York, 7.-10. Okt. , 2005
  • Member of the technical committee, review committee and session chair for several sessions at the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) and the Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
  • Publication Chair for the 2002 IEEE International Workshop on Multimedia Signal Processing
  • ISO/MPEG SC29/WG11 participation in lossless audio coding standardization
top

Research Interests

 

Audio Coding, Digital Signal Processing, Filter Banks, Speech Coding, Image Coding, Communications

Example, efficient low delay filter banks


  • Structure of a modulated low delay analysis filter bank


  • Structure of a modulated low delay synthesis filter bank for perfect reconstruction

top

 

 

Here is a matlab ASCII file of the impulse response of the baseband prototype h(n) of a filter bank with N=1024 bands, filter length of 4096 taps, and a system delay of 2047 samples. The prototype is identical for analysis and synthesis, and the modulating function is h_k(n)=h(n)*cos(pi/N*(k+0.5)(n+0.5+N/2)) for the analysis and g_k(n)=h(n)*cos(pi/N*(k+0.5)(n+0.5-N/2)) for the synthesis.

 

 

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Publications

 

Books, Book Chapters

 

G. Schuller: "Audio Coding",
chapter in "Audio Signal Processing for Next-Generation Multimedia Communication Systems", Y. Huang, J. Benesty (Eds.), Kluwer Academic Publishers, 2004, ISBN 1-4020-7768-8

  G. Schuller: "Zeitvariante Filterbänke mit niedriger Systemverzögerung und perfekter Rekonstruktion",
VDI Verlag, Düsseldorf, 1999, ISBN 3-18-326721-7 (in German, Ph.D. Thesis)
 

I. Selesnick, G. Schuller: "The Discrete Fourier Transform",
chapter in the "Transforms and Data Compression Handbook", CRC Press LLC, Boca Raton, FL, 2001, ISBN 0-8493-3692-9

top

 

 

Journal Papers

 

G.Schuller, H.Krüger-Elencwajg: "Simulation von Operationsverstärkern",
Elektronik No.14,15,16, 1989, (in German)

 

D.Warning, G.Schuller, H.Krüger-Elencwajg: "SPICE-Modellierung für Transimpedanzverstärker",
Elektronik No. 15, 1994, (in German)

G.D.T. Schuller and M. J. T. Smith: "New Framework for Modulated Perfect Reconstruction Filter Banks",
IEEE Transactions on Signal Processing, Vol.44, NO.8, August 1996, pp. 1941–1954

 

G. Schuller: "Low Delay Filter Banks with Perfect Reconstruction",
Frequenz, 50(1996) 9–10

G. Schuller and T. Karp: "Modulated Filter Banks with Arbitrary System Delay: Efficient Implementations and teh Time-Varying Case",
IEEE Transactions on Signal Processing, March 2000, pp. 737–748

G. Schuller, B. Yu, D. Huang, and B. Edler: "Perceptual Audio Coding using Adaptive Pre- and Post-Filters and Lossless Compression", IEEE Transactions on Speech and Audio Processing, September 2002, pp. 379–390 (IEEE Best Paper Award 2007)

G. Schuller, J. Kovavcevic, F. Masson, and Vivek K Goyal: "Robust Low-Delay Audio Coding Using Multiple Descriptions",
IEEE Transactions on Speech and Audio Processing, September 2005, pp. 1014- 1024

Yokotani, Y.; Geiger, R.; Schuller, G.D.T.; Oraintara, S.; Rao, K.R.:
"Lossless Audio Coding Using the IntMDCT and Rounding Error Shaping"
IEEE Transactions on Audio, Speech, and Language Processing, Volume 14, Issue 6, Nov.
2006 Page(s):2201 - 2211

top

 

 

Conference Papers

G. Schuller, M.J.T. Smith: "A General Formulation for Modulated Perfect Reconstruction Filter Banks with Variable System Delay",
Symposium on Applications of Subbands and Wavelets, Newark, N.J., March 18, 1994

G. Schuller, M.J.T. Smith: "Efficient Low Delay Filter Banks",
Sixth IEEE Digital Signal Processing Workshop, Yosemite, California, October 2–5, 1994

G. Schuller, M.J.T. Smith: "A New Algorithm for Efficient Low Delay Filter Bank Design",
IEEE International Conference on Acoustics, Speech, and Signal Proecessing (ICASSP), Detroit, Michigan, May 9–12, 1995

G. Schuller: "A Low Delay Filter Bank for Audio Coding with Reduced Pre-Echoes",
99th Audio Engineering Society (AES) Convention, New York, New York, October 6–9, 1995

G. Schuller: "An Overview Over Filter Banks With Low System Delay Capabilities",
European Workshop on Multirate Digital Signal Processing and Applications, Hamburg University of Technology, March 20–21, 1996

G. Schuller: "A New Factorization and Structure for Cosine Modulated Filter Banks with Variable System Delay",
Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, California, Nov. 3–6, 1996

G. Schuller: "Time-Varying Filter Banks with Variable System Delay",
ICASSP 97, Apr 21–24 1997, Munich, Germany

Poster for the ICASSP 97 paper

T. Karp, A. Mertins, and G. Schuller: "Recent Trends in the Design of Biorthogonal Modulated Filter Banks'',
In Proc. TICSP Workshop on Transforms and Filter Banks, Tampere, Finland, February 1998

G. Schuller, T. Karp: "Causal FIR Filter Banks with Arbitrary System Delay",
DSP98 Workshop in Bryce Canyon, Aug. 9-12, 1998

 

G. Schuller: "Time-Varying Filter Banks with Low Delay for Audio Coding",
105th AES Convention, San Francisco, CA, Sep. 26–29, 1998

G. Schuller, W. Sweldens: "Modulated Filter Bank Design with Nilpotent Matrices",
SPIE 44th Annual Meeting, Denver, CO, July 19–23, 1999

G. Schuller, W. Sweldens: "Filter Bank Design using Nilpotent Matrices",
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, Oct. 17–20,1999

A. Doser, G. Schuller: "Time/Frequency Techniques for Signal Feature Detection",
33rd Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Oct. 24–27 1999

B. Edler and G. Schuller: "Audio Coding Using a Psychoacoustic Pre- and Post-Filter",
IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, June 5–9, 2000

 

B. Edler, C. Faller and G. Schuller: "Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-filter",
109th AES Convention, Los Angeles, CA, Sep. 22–25, 2000

G. Schuller, B. Edler, A. Doser: "A Method for Alias Reduction in Cascaded Filter Banks",
9th IEEE DSP Workshop, Hunt, TX, Oct. 15–18, 2000

S. Dorward, D. Huang, S. A. Savari, G. Schuller, B. Yu: "Low Delay Perceptually Lossless Coding of Audio Signals",
IEEE Data Compression Conference, Snowbird, Utah, March 27–29, 2001

G. Schuller, B. Yu, D. Huang: "Lossless Coding of Audio Signals using Cascaded Prediction",
IEEE International Conference on Acoustics, Speech, and Signal Processing, Salt Lake City, May 7–11, 2001

T. Karp, G. Schuller: "Joint Transmitter / Receiver Design for Multicarrier Data Transmission with Low Latency Time",
IEEE International Conference on Acoustics, Speech, and Signal Processing, Salt Lake City, May 7–11, 2001

V. Weerackody, G. Schuller, H.-L. Lou: "Streaming of Multimedia with Reduced Start-Up Delay",
IEEE International Conference on Communications, Helsinki, Finland, June 11-14, 2001

 

M. Kokes, J. Gibson, G. Schuller: "A Wideband Speech Codedc Based on Nonlinear Approximation'',
35th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, California, Nov. 4–7, 2001

 

G. Schuller: "Low Delay Audio Coding for Communications Applications",
invited talk, DIMACS Working Group on Data Compression in Networks and Applications, Rutgers University, New Jersey, March 18–20, 2002

 

G. Schuller, J. Herre: "Speech Reverberation Artifacts in Audio Coding",
part of Workshop "Listening to Perceptual Audio Coders", 112th AES Convention, Munich, Germany, May 10–13, 2002

G. Schuller, A. Harma: "Low Delay Audio Compression using Predictive Coding",
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Orlando, FL, May 13–17, 2002

R. Geiger, G. Schuller: "Integer Low Delay and MDCT Filter Banks",
36th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, Nov.3–6, 2002

 

G. Schuller: "Coding of Stereophonic Signals",
part of Workshop "Coding of Spatial Audio: Yesterday, Today, and Tomorrow" 113th AES Convention, Los Angeles, CA, October 5-8, 2002, and 114th Convention, Amsterdam, The Netherlands, March 22-25, 2003

R. Geiger, G. Schuller: "Fine Grain Scalable Perceptual and Lossless Audio Coding Based on IntMDCT",
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hong Kong, April 6–10, 2003

R. Geiger, G. Schuller, J. Herre, R. Sperschneider, T. Sporer: "Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC",
115th AES Convention, New York, NY, October 10–13, 2003

R. Geiger, Y. Yokotani, G. Schuller : "Improved Integer Transforms for Lossless Audio Coding",
Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, November 9–12, 2003

R.Geiger, Y. Yokotani, G. Schuller, J. Herre: "Improved Integer Transforms Using Multi-Dimensional Lifting",
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Montreal, Canada, May 17–21, 2004

 

Tutorial, G. Schuller, J. Herre: "Audio Coding: Recent Advances and Standards",
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Montreal, Canada, May 17–21, 2004

Y. Yokotani, R. Geiger, G. Schuller, S. Oraintara, K. R. Rao, "Improved Lossless Audio Coding using the Noise-Shaped IntMDCT",
11th Digital Signal Processing Workshop, Taos Ski Valley, New Mexico, USA, August 1–4, 2004

 

M. Lutzky, G. Schuller, M. Gayer, U. Krämer, S. Wabnik: "A guideline to audio codec delay",
116th AES Convention, Berlin, Germany, May 8–11, 2004

U. Kraemer, G. Schuller, S. Wabnik, J. Klier, and J. Hirschfeld: "Ultra Low Delay audio coding with constant bit rate",
117th AES Convention, San Francisco, CA, Oct. 28–31, 2004

Y. Yokotani, S. Oraintara, R. Geiger, G. Schuller, K.R. Rao: "Approximation Noise Analysis for Transform-based Lossless Audio Coding",
IEEE Globecom 2004, Dallas, TX,
Nov. 29 – Dec 3, 2004

S. Wabnik, G. Schuller, U. Kraemer, J. Hirschfeld: "Frequency Warping in Low Delay Audio Coding",
IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, March 18–23, 2005

 

J. Klier, G. Schuller, M. Haardt, M. Hennhöfer: “A new approach for channel equalization without guard interval using polyphase matrices”, 16th Annual IEEE International Symposium on Personal Indoor and Mobile Radio Communications, Berlin, Sep. 11 - 14, 2005

S. Wabnik, Gerald Schuller, J. Hirschfeld, U. Kraemer: “Packet Loss Concealment in Predictive Audio Coding”,
2005 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain House, New Paltz, New York, Oct. 16-19, 2005

 

Organisation of the workshop "Next Generation Audio Communications", and talk,
at the 119th Audio Engineering Society (AES) Convention, New York, Okt. 7-10, 2005

S. Wabnik, Gerald Schuller, J. Hirschfeld, U. Kraemer: "Different Quantization Noise Shaping Methods for Predictive Audio Coding'',
IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, Mai 2006

R. Geiger, Y. Yokotani, and G. Schuller: "Audio Data Hiding with High Data Rates Based on IntMDCT'',
IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, Mai 2006

S. Wabnik, G. Schuller, J. Hirschfeld, U. Kraemer: "Reduced Bit Rate Ultra Low Delay Audio Coding'',
120th AES Convention, Paris, Mai 2006

A. Carôt, U. Krämer, G. Schuller: "Network Music Performance (NMP) in Narrow Band Networks'',
120th AES Convention, Paris, Mai 2006

G. Schuller: "Filter Banks and Wavelets: Design and Use in Perceptual Coding'' ,
Short Course at the SPIE Electronic Imaging Conference 2007, San Jose, California, USA, January 28 - February 1, 2007

T. Albert, G. Schuller, S. Wabnik, U. Kraemer, J. Hirschfeld: "Comparison of Stereo Redundancy Reduction Schemes for an Ultra Low Delay Audio Coder'',
122nd AES Convention, Vienna, Austria, May 2007

M. Schnell, R. Geiger, M. Schmidt, M. Jander, M. Multrus, G. Schuller,
J. Herre: "MPEG-4 Enhanced Low Delay AAC - Low Bitrate High Quality Communication'', 122nd AES Convention, Vienna, Austria, May 2007

U. Kraemer, J. Hirschfeld, G. Schuller, S. Wabnik, A. Carot, and C. Werner: `"Network Music Performance with Ultra-Low-Delay Audio Coding under Unreliable Network Conditions'',
123rd AES Convention, New York, NY, October 5-8, 2007

T. Friedrich, G. Schuller: "A Spectral Band Replication Tool For Very Low Delay Audio Coding Applications'',
2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 21-24, 2007

M. Schnell, R. Geiger, M. Schmidt, M. Multrus, M. Mellar, J. Herre, G. Schuller: "Low Delay Filterbanks For Enhanced Low Delay Audio Coding'',
2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 21-24, 2007

S. Wabnik, G. Schuller: "A Reduced Rate Ultra Low Delay Audio Coder using VQ'',
Invited paper, Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, November 4 - 7, 2007

Friedrich, Tobias; Gruhne, Matthias; Schuller, Gerald:
Subband Conversion for Feature Extraction from Compressed Audio, IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2008, March 30 – April 4, 2008, Las Vegas, NV, USA, Paper No. 7003

Friedrich, Tobias; Gruhne, Matthias; Schuller, Gerald:
A Fast Feature Extraction System on Compressed Audio Data, 124th AES Convention, May 17 – 20, 2008, Amsterdam, Netherlands

Gruhne, Matthias; Dittmar, Christian; Schuller, Gerald; Gaertner, Daniel: 
An Evaluation of Pre-Processing Algorithms for Rhythmic Pattern Analysis, 125th AES Convention, October 2-5, 2008, San Francisco, CA, USA

Schuller, Gerald; Kraemer, Ferenc:
Graceful Degradation for Digital Radio Mondiale (DRM), 125th AES Convention, October 2-5, 2008, San Francisco, CA, USA

Schuller, Gerald; Arnold, Mirko:
A Parametric Instrument Codec for Very Low Bitrates
, 125th AES Convention, October 2-5, 2008, San Francisco, CA, USA

Neuendorf, Max; Gournay, Philippe; Multrus, Markus; Lecomte, Jérémie; Bessette, Bruno; Geiger, Ralf; Bayer, Stefan; Fuchs, Guillaume; Hilpert, Johannes; Rettelbach, Nikolaus; Salami, Redwan; Schuller, Gerald; Lefebvre, Roch; Grill, Bernhard:
Unified Speech and Audio Coding Scheme for High Quality at Lowbitrates, ICASSP 2009, April 19-24, 2009, Taipei, Taiwan

Wabnik, Stefan; Schuller, Gerald; Kraemer, Ferenc:
An Error Robust Ultra Low Delay Audio Coder Using an MA Prediction Model, ICASSP 2009, April 19-24, 2009, Taipei, Taiwan

Bayer, Stefan; Bessette, Bruno; Fuchs, Guillaume; Geiger, Ralf; Gournay, Philippe; Grill, Bernhard; Hilpert, Johannes; Lecomte, Jérémie; Lefebvre, Roch; Multrus, Markus; Nagel, Frederik; Neuendorf, Max; Rettelbach, Nikolaus; Robilliard, Julien; Salami, Redwan; Schuller, Gerald:
A Novel Scheme for Low Bitrate Unified Speech and Audio Coding,
126th AES Convention, May 7, 2009, München

G. Schuller, M. Werner:
"An Enhanced SBR Tool for Low-Delay Applications", 127th AES Convention, New York, NY, USA, October 9-12, 2009

A. Ferreira,  J. Herre,  Y. E. Kim,  B. Kleijn,  M. Sandler,  G. Schuller:
"What Will Perceptual Audio Coding Stand for 20 Years from Now?", Workshop, 127th AES Convention, New York, NY, USA, October 9-12, 2009

J. Abeßer, H. Lukashevich, C. Dittmar, G. Schuller:
"Genre Classification Using Bass-Related High-Level Features and Playing Styles", 10th International Society for Music Information Retrieval Conference, Kobe, Japan, October 26-30, 2009

top

Co-Authors

Mark J.T. Smith,
Purdue University, West Lafayette, Indiana

Tanja Karp,
Texas Tech University, Lubbock, TX

Alfred Mertins,
University of Luebeck, Germany

Wim Sweldens,
Bell Laboratories, Lucent Technologies, Murray Hill, NJ

 

Adele Doser,
Sandia National Laboratories

Bernd Edler,
University of Hannover, Germany

Christof Faller,
EPFL, Lausanne

Bin Yu,
University of California, Berkeley, CA

Dawei Huang,
Bell Labs China, Beijing, China

Serap Savari,
Texas A&M University

Sean Dorward,
Bell Laboratories, Lucent Technologies, Murray Hill, NJ

 

Mark Kokes,
Nokia Research Center, Nokia Inc., Irving, TX


Aki Harma,
DSP group, Philips Research


Ralf Geiger,
Fraunhofer Institute for Integrated Circuits, Erlangen, Germany


Ulrich Krämer,
Fraunhofer Institute for Digital Media Technology IDMT, Ilmenau, Germany


Stefan Wabnik,
Fraunhofer Institute for Digital Media Technology IDMT, Ilmenau, Germany


Jens Hirschfeld,
Fraunhofer Institute for Digital Media Technology IDMT, Ilmenau, Germany


Juliane Klier,
Fraunhofer Institute for Digital Media Technology IDMT, Ilmenau, Germany

top

Teaching

 

I taught the course Signals and Transforms, EL611, at the Brooklyn Polytechnic University, in 1999.
Since coming to Ilmenau I am co-teaching (with Karlheinz Brandenburg) the course Audio Coding every winter semester at the Technical University of Ilmenau, Germany.
For the summer semester 2005 and winter semester 2005/06 I became temporary full professor at the Insitute of Media Technology there, and taught the courses

  • Übertragungssysteme (Communication Systems),
  • Mediendistribution (Media Distribution),
  • Praxiswerkstatt Algorithmen der Signalcodierung in Matlab (Algorithms of Signal Coding in Matlab)

During winter semester 2005/06 I taught the courses

  • Grundlagen der Videotechnik (Basics of Video Technology),
  • Angew. Videostudiotechnik 1 (Applied Video Studio
    Technology 1),
  • Multimediale Werkzeuge 1 (Multimedia Tools 1),
  • Audio Coding (in English, with Prof. Brandenburg)

Since sommer semester 2008 I am a full professor of the Technical University of Ilmenau, and part time member of Fraunhofer IDMT.

Links to those courses can be found here
http://www.tu-ilmenau.de/site/mt/Studium_und_Lehre.2272.0.html

top

Short Bio

 

I studied mathematics in Clausthal-Zellerfeld and Bonn, Germany, from 1981 to 1984, and Electrical Engineering at the Technical University of Berlin from 1984 to 1989. After finishing my studies with the "Diplom" (M.S.) degree in Berlin I obtained a scholarship for the Massachusetts Institute of Technology, Cambridge, U.S.A., for the year 1989/90. Then I was a research assistant at the Technical University of Berlin from 1990 to 1992, a graduate student and teaching assistant at the Georgia Institute of Technology, Atlanta, U.S.A., in 1993, and a research assistant at the University of Bonn, Germany, in 1994. I was with the University of Hannover, Germany, since 1995, where I received my Ph.D. degree, with Bell Labs, Lucent Technologies, and Agere Systems from 1998 to 2001, and am with the Fraunhofer Institute, Group for Electronic Media Technology (AEMT), Ilmenau, Ilmenau, since 2001. In January 2004 the Group for Electronic Media Technology became the Institute for Digital Media Technology IDMT. For the summer semester 2005 and winter semester 2005/06 I became temporary full professor
at the Insitute of Media Technology of the Technical University of Ilmenau, Germany.
Since sommer semester 2008 I am a full professor of the Technical University of Ilmenau, and part time member of Fraunhofer IDMT.

top

Download

 

Live-Recordings to demonstrate the ULD error concealment described in "Network Music Performance with Ultra-Low-Delay Audio Coding under Unreliable Network Conditions", 123rd AES Convention, New York

  • piano: Michael Stahl
  • bass: Alexander Carôt
  • trompet: Bernhard Grill
  • guitar: Gerd Brohasga

Live Recording of a session between a private apartment in Luebeck, Germany (DSL with 15 mbps down- and 800 kbps upload) and Fraunhofer IIS Erlangen, Germany (connection with approx. 54 mbps) on August 17, 2007.

Links to the sound examples (wav-files):
Conceal Example 1
Conceal Example 2
Conceal Example 3

Matlab script for decoding DRM (Digital Radio Mondiale) recordings of Morphy Richards or Himalaya DRM receivers. The function uses the open source AAC decoder software FAAD v2.

Download Link

top
Linking Policy
© 2005
Fraunhofer IDMT