Università degli Studi di Pavia

Centro Interdisciplinare di Bioacustica e Ricerche Ambientali

Via Taramelli 24 - 27100 Pavia - Italy
e-mail : cibra@unipv.it


The history of the CIBRA Digital Signal Processing Workstation

Before the establishement of the Centre in 1989, bioacoustic research has been carried out at the Institute of Entomology. The activities began in 1980 with the developement of a computer based digital spectrograph. The first one in Italy dedicated to the study of animal vocalizations; it was developed within my thesis for getting the degree in Natural Sciences.
The system was based on a desktop computer Tektronix 4052 (64K RAM, 300K tape storage, monochromatic graphical screen - the famous Tektronix storage tube, fast Basic interpreter in ROM) interfaced (IEEE-488) with a Bruel & Kjaer 2033 spectrum analyzer (10K samples memory, 12 bit AD converters, maximum sample rate 51200 samples/sec to get 20kHz bandwidth, 1024 points FFT, real-time display up to 2kHz) for signal acquisition and analysis. The system also included a disk unit Tektronix 4907 (three 5" floppy disks, 630K each), a Tektronix 4611 hardcopy unit (black & white, no shades) and a Tektronix 4663 plotter. The sound source was a portable open reel recorder SONY TC510-2.


From left to right: Tek 4611, B&K 2033, Tek 4052, Tek 4907, Tek 4663

This system allowed the first experiments of digital analysis on animal sounds in Italy. A series of programs was developed to perform a number of different tasks, including: acquisition of up to 60K samples (six 10K samples consecutive frames by means of pre- and post- triggering facilities available on the B&K analyzer), signal storage and retrieval, signal editing, waveform and envelope display, instantaneous spectrum and cepstrum, formant analysis, signal filtering by means of FFT forth and back transformation, spectrogram on up to 10K samples, frequency tracking, 3D waterfall display.
In the first version of the software, a monochromatic spectrogram on 10K samples (0.2 sec duration and 20kHz bandwidth at 51200 s/sec) required 40 minutes to be computed and displayed. Later, the use of a Tektronix FFT ROMPACK with optimized FFT routines allowed to reduce the computation time to about 15-20 minutes for each 10K block.
Initially, the programs were mainly used for experimentation, bird song analysis and microseismic studies. In 1984 the first analyses on fish sounds (recorded at the University of Parma by Prof. Patrizia Torricelli) were made and opened the pathway to underwater bioacoustics.


Printout of a bird song (1981). The image is made by assembling 6 screen hardcopies (3 for the spectrogram, 3 for the modulation curves).

In 1985, after getting the degree in Natural Sciences in 1983, the programs were rewritten in FORTRAN on a HP 1000 (128K RAM, 5MB removable HD, 10 bit AD converter, 512x512 pixels color graphic display) made available by the Dept. of Electronics of the University of Pavia. I am very grateful to Prof. R.Schmid, chief of the Department in that period, for having allowed me to use all the instruments without any restriction. That powerful instrumentation allowed the development of several new procedures, including continuous signal acquisition and HD recording, easy and fast signal retrieval and display, colour spectrograms, color manipulation on displayed images, wider control on analysis parameters, moving cursors to take measures on the spectrograms (time, frequency and dB readings), and, overall, speed! Few minutes to get a 0.625s spectrogram! While developing software on that system, I had the honour to see at work a Laben computer with 16Kbytes of "ferrite doughnut" RAM, a paper tape reader for programs and data I/O, an oscilloscope for graphical output.

Envelope display for signal retrieval and colour spectrogram of a signal frame (1985).

The emerging Personal Computers, even if limited in their performances and very expensive - more expensive than a car -, suggested me to start developing a personal sound analysis system.
The very first attempt was in early 1986 with a wonderful Sinclair QL (QL meant Quantum Leap...); small, thin, black, lightweight, with 128K of RAM and a Motorola 68008 inside. Its powerful procedural Basic in ROM allowed the first experiments and the development of a skeleton of the new software. Very soon, I switched to a Pascal compiler and a 68000 Assembler to get speed. I rewrote the programs; they worked fine on synthesized signals, but, unfortunately, I never bought an AD converter to work with real sounds.

In late 1986 a further definitive leap towards the emerging PC architecture: an Olivetti M24 (Intel8086 CPU, i8087 FPU, 8MHz, 256K RAM, 20MB HD, an EGC graphic board with 640x480 pixels and 16 colours) equipped with a Data Translation DT2801 signal acquisition board (27ksamples/sec maximum sample rate, 12 bit AD converter, DMA) for sound I/O. Just to get few % of speed increase I replaced the Intel CPU with a compatible NEC CPU.

Spectrogram of courtship sounds emitted by the goby fish Padogobius martensi made on a 8086 PC (1986)

Rewriting the whole software, as well as optimizing it for the new hardware, was a big challenge. I spent a lot of time to make it running. I was very lucky to find a wonderful book ("8087 Applications and programming for the IBM PC and other PCs" written by R.Startz, published by Brady in 1983) and after reading it I gave a boost to the software by writing most of the critical routines in Assembler 8086 and 8087. Then, the shell of the program and the user interface were written with the Microsoft Basic Compiler and subsequently with Microsoft QuickBasic, one of the first compiled procedural Basic available for PCs.
The writing of a well optimized FFT routine in assembler was not a trivial task and the results were slower than expected. Again, I was lucky enough to find a highly optimized assembler routine written by John Hartwell to be linked to FORTRAN programs and I adapted it to my needs. I really spent a lot of time writing, optimizing, compiling and linking programs in three different languages. Some years later I contacted again Mr. Hartwell to ask him for updated routines and he replied: "I no more write computer programs. In the life there are lots of things to do better than programming." Good reply. Now, several years later, I really understand the meaning of that answer.
At the end of 1986 the PC system began to work well. New important features were developed: DMA based continuous recording and playback, variable rate playback, editing and synthesis capabilities, 16 colour spectrograms on up to 16K samples on screen or continuous spectrograms (continuous sheet) on inexpensive dot matrix printers (with multipass printing to produce gray shades), frequency tracking and song parsing.
In 1986, the first system running on a 8MHz 8086+8087 required about 20 minutes to display a 16 colour spectrogram on 16K samples. A bit later, in 1987, computation times were slightly shortened by switching to a 12 MHz 80286+80287 and then to a 20 MHz 386+387.

With this new system I began to work extensively on many animal species, including marine mammals. Also, I began to experiment with song parsing, frequency tracking and automatic analysis of song parameters. The very first attempts were very successful because I tested the algorithms on very good recordings of relatively simple songs... though, I discovered very how much complex "real world recordings" can be...


Spectrogram printed with a simple dot matrix printer (1987).
The software allowed to print continuous spectrogram on inexpensive continuous perforated sheets.
Shades of gray were produced by multipass printing.


A successful example of automatic song parsing (1987).

A dedicated driver was developed to print on LaserJet printers
and on the thermal printer Mitsubishi P71U, the same used by the Kay Sonagraph DSP5500 (1989).

The first real-time portable system to be used in the field was developed in 1990 by integrating in a portable PC (EPSON PC AX3s Portable, 8086SX + 808387, 16MHz, 2MB RAM, 40MB HD) a high quality audio board along with a DSP based signal acquisition board Microstar DAP 2400 (12 bit AD, max 250K samples/sec, a Motorola DSP 56001 and an Intel CPU on board) for real-time processing and high speed data acquisition. The system was called Portable DSPW - Digital Signal Processing Workstation.
In the same period, the Intel 486 CPU with integrated FPU, allowed a desktop PC working in real-time up to 48000 samples/sec without the need for a costly and difficult to program DSP processor.
A great leap in the development of the DSPW was the availability of two high quality sound boards designed by Audiologic, a small Italian company dedicated to the development of high quality audio systems and speech synthesis systems. The two ISA boards - Audiologic Duetto (analog I/O up to 48k s/sec, mono or stereo) and Audiologic Audioboard Plus (SPDIF digital I/O) allowed for the first time to bring DAT quality to a PC. A custom driver developed by Audiologic to match our needs allowed to integrate the boards in the DSPW to allow continuous, gap free, recording to hard disk with real-time sound analysis and display. Another great leap in field studies was the availability of portable DAT recorders at reasonable prices. The page describing the system we developed and used in the period 1990-1994 is still available to provide a more detailed description.

The Portable DSPW with the base to host
I/O and DSP boards (1990)

 

Scheme of the desktop DSPW
as conceived in 1990 >>



 

Screen shot from the real-time Duetto Spectrograph based on the Audiologic board (1993)

Another experimentation made possible by real-time processing and display capabilities, was joining video to audio and spectrograms. A desktop PC was fitted with a video overlay board and a sound acquisition board to show a live video on the PC screen along with a real-time spectrogram. The system was later modified by adding a VGA to video converter and a video mixer to produce video tapes integrating a HiFi sound channel, a live video and a real-time spectrogram. This system was extensively used at the Genova Aquarium to study the vocal behaviour of a newborn bottlenose dolphin and to associate sounds with visible behaviours.

Real-time spectrogram with a live video of a vocalizing young dolphin (1994, research made at the Genova Aquarium)

The introduction of Pentium processors definitively moved the PC into the real-time arena.
In 1995, to make available to other labs the basic capabilities of the DSPW, the software - still in DOS - was improved to use the Creative Sound Blaster series (16/32/64). It was made available for free on the internet with the name SBRTA (Sound Blaster Real Time Analyzer).
In 1996, four different working versions of the DSPW were available, the difference being mainly in the supported board for sound I/O:
- Microstar DAP 2400/3000/3200 (analog I/O up to 769k s/sec)
- Audiologic Duetto (analog I/O up to 48k s/sec, mono or stereo) & Audiologic Audioboard Plus (digital I/O - SPDIF, mono or stereo)
- Creative Sound Blaster boards 16/32/64 (analog I/O up to 44.1k s/sec, mono or stereo)
- Zefiro ZA2 (digital I/O - SPDIF, AES/EBU, TosLink - mono or stereo)

The latest version of the SBRTA software (1996), still available for free download.

Since then, the continuous increase in CPU speed made the analysis of sounds faster and faster. In 1996, a Pentium 233 required only 200 msec to display a spectrogram of 1 second of sound sampled at 48000 samples/sec thus making it possible to work in real-time up to 250K samples/sec.

Other than speed increase, the latest development of multimedia PCs offer a wide range of powerful audio interfaces with disk storage capacity increasing month by month. The high quality audio interfaces now available for both desktop and notebooks PC now outperform even the DAT recorders and allow to setup a powerful sound analyisis and recording system on a notebook.

The SBRTA software developed in 1995 is still working in many labs, though, starting with 1997, a completely new system, based on the Windows platform, was designed and developed to take advantage of all the features of Windows based systems, mainly the independency from the hardware, both for acquiring sounds and displaying graphs. The new software, named wSpecGram, was completely operative in 1998. Then, with the fast Pentium CPUs, the high quality sound devices made available by a globally expanding market of multimedia products, the hard disks able to hold up to 100GB, it was possible to do almost everything on a cheap notebook: multichannel sound recording and real-time display, recording and analysis of ultrasounds up to 400 kHz.

A PII 233MHz notebook with the very first USB digital audio interface made by OPCODE and a DAT SONY D7 (1998).

Then we started thinking to new challenges such as realtime beamforming, sound localization and sound classification...

A whole bioacoustic laboratory in a notebook. A dream become true.
Gianni Pavan (C) 2001 - Comments are welcome!


This article covers the period 1980-1999. The features of the current versions of the Digital Signal Processing Workstation are described in the pages on the equipment and software developed by CIBRA.


CIBRA Home Page


Page created by G.Pavan, 2001. Revised March 2005.