This work was supported in part by DARPA's ASSIST Program (contract number NBCH-C-05-0137) and an ONR MURI grant (No. N000140510388).
The corpus was originally described in our ICASSP 2009 paper, and a more complete description of the data collection effort as well as the resulting data is given in a journal paper (submitted for publication to Computer Speech and Language). A draft of the paper is available here. The characteristics of the corpus are summarized below.
Each speaker wore a portable recording system with 7 microphones: a 4-channel array worn in front of the speaker's chest, as well as a throat microphone, a shoulder-mounted microphone, and a close-talking microphone worn in front of the mouth.
7-microphone portable recording system:

The conversations are unprompted - participants were instructed to talk about anything they like, so they spoke about topics that were natural and interesting for them.
A list of possible conversation topics was provided to the participants in case they ran out of things to talk about, though it was very rarely used.
As a result, the conversations are natural and spontaneous.
The conversations take place in a variety of noisy environments, both indoors and outdoors.
Several aspects of the data can be exploited for achieving improved recognition performance.
* The recorded audio contains natural, spontaneous, conversational speech.
* The recordings were made in environments with a wide range of noise types and noise levels, thus the speech is subject to the Lombard effect.
* Traditional microphone array beam-steering techniques can be used on the 4-channel microphone array recordings.
* The audio from all the channels is synchronized, so multi-stream speech recognition techniques can be used, and mappings from noisy to clean speech can be learned.
* The availability of synchronized speaker turn information can be used to learn conversation dynamics.
The corpus recordings are available as tar archives of FLAC format files.
They are accompanied by transcriptions as well as participant surveys (which contain demographic data and answers to questions about the speakers' language experience).
Audio quality is 44.1 kHz, 16 bit. The original 48 kHz, 24 bit recordings are available upon request.
If you have any questions, please get in touch with us.
email: cosine {at} ssli.ee.washington.edu