NIST/USF Evaluation Resources for the VACE Program - Meeting Data Training
Set Part 2, Linguistic Data Consortium (LDC) catalog number LDC2011V02 and isbn
1-58563-577-4, was developed by researchers at the Department
of Computer Science and Engineering, University of South Florida (USF),
Tampa, Florida and the Multimodal Information
Group at the National Institute of Standards and Technology (NIST). It contains
approximately fourteen hours of meeting room video data collected in 2001 and
2002 at NISTs Meeting Data Collection Laboratory and annotated for the VACE
(Video Analysis and Content Extraction) 2005 face, person and hand detection
and tracking tasks. LDC has previously released NIST/USF Evaluation Resources for the VACE Program - Meeting Data Training Set Part 1 LDC2011V01.
VACE program was established to develop novel algorithms for automatic video
content extraction, multi-modal fusion, and event understanding. During VACE
Phases I and II, the program made significant progress in the automated detection
and tracking of moving objects including faces, hands, people, vehicles and
text in four primary video domains: broadcast news, meetings, street surveillance,
and unmanned aerial vehicle motion imagery. Initial results were also obtained
on automatic analysis of human activities and understanding of video sequences.
performance evaluations were conducted under the auspices of the VACE program
between 2004 and 2007. The 2005 evaluation was administered by USF in
collaboration with NIST and guided by an advisory forum including the evaluation
participants. A summary of results of the evaluation can be found in the 2005
VACE results and analysis paper included in this release.
NISTs Meeting Data Collection Laboratory is designed to collect corpora to
support research, development and evaluation in meeting recognition technologies.
It is equipped to look and sound like a conventional meeting space. The data
collection facility includes five Sony EV1-D30 video cameras, four of which
have stationary views of a center conference table (one view from each surrounding
wall) with a fixed focus and viewing angle, and an addtional floating
camera which is used to focus on particular participants, whiteboard or conference
table depending on the meeting forum. The data is captured in a NIST-internal
file format. The video data was extracted from the NIST format and encoded using
the MPEG-2 standard in NTSC format. Further information concerning the video
data parameters can found in the documentation included with this corpus.
The VACE evaluation tools have been integrated into NISTs downloadable Framework
for Detection Evaluation (F4DE) Toolkit. The toolkit contains small example
files for each of the task/object/domain scoring combinations.
Additional information, updates, bug fixes may be available in the
LDC catalog entry for this corpus at
Portions © 2011 Trustees of the University of Pennsylvania