|
What's New! What's Free!
LDC to Close for Thanksgiving Day Holiday - November 26 and 27, 2009
Early Renewal Discounts for MY2010 - renew membership early and save!
LDC at NWAV 38 - LDC's recent conference participation
LDC Data Sheets - concise descriptions of LDC projects, operations, and technical capabilities
XTrans - new tool for manual transcription and annotation of audio recordings
LDC's Corpus Catalog Receives Top OLAC Rating - LDC's improved catalog
What's New
Archive
New Corpora
2007 NIST Language Recognition Evaluation Supplemental Training Set ~118 hours of conversational telephone speech segments
French Gigaword Second Edition ~comprehensive archive of French news text acquired by LDC
NXT Switchboard Annotations~multiple layers of annotation in XML format
2007 NIST Language Recognition Evaluation Test Set ~66 hours of conversational telephone speech segments
OntoNotes Release 3.0 ~Treebank, PropBank, word sense, and coreference annotated English, Chinese, and Arabic news text
Web 1T 5-gram, 10 European Languages Version 1~word n-grams and their observed frequency counts for ten European languages
New Corpora Archive
Employment at the LDC
ACL Anthology ~ A Digital Archive of Research
Papers in Computational Linguistics
OLAC ~ Open Language Archives Community
|
|
Linguistic Resources
The Linguistic Data Consortium supports language-related education, research
and technology development by creating and sharing linguistic resources:
data, tools and standards.


LDC is supported in part by grant IRI-9528587 from the Information and Intelligent
Systems division and grant 9982201 from the Human Computer Interaction Program of the
National Science Foundation.
LDC's corpus creation efforts are powered in part by Academic Equipment Grant 7826-990
237-US from Sun Microsystems.
|
|