Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome


What's New! What's Free!

LDC to Close for Thanksgiving Day Holiday - November 26 and 27, 2009
Early Renewal Discounts for MY2010 - renew membership early and save!
LDC at NWAV 38 - LDC's recent conference participation
LDC Data Sheets - concise descriptions of LDC projects, operations, and technical capabilities
XTrans - new tool for manual transcription and annotation of audio recordings
LDC's Corpus Catalog Receives Top OLAC Rating - LDC's improved catalog
What's New Archive

New Corpora

2007 NIST Language Recognition Evaluation Supplemental Training Set ~118 hours of conversational telephone speech segments
French Gigaword Second Edition ~comprehensive archive of French news text acquired by LDC
NXT Switchboard Annotations~multiple layers of annotation in XML format
2007 NIST Language Recognition Evaluation Test Set ~66 hours of conversational telephone speech segments
OntoNotes Release 3.0 ~Treebank, PropBank, word sense, and coreference annotated English, Chinese, and Arabic news text
Web 1T 5-gram, 10 European Languages Version 1~word n-grams and their observed frequency counts for ten European languages
New Corpora Archive

Employment at the LDC

ACL Anthology ~ A Digital Archive of Research Papers in Computational Linguistics

OLAC ~ Open Language Archives Community

Linguistic Resources
Linguistic Data Consortium

The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards.

map

LDC is supported in part by grant IRI-9528587 from the Information and Intelligent Systems division and grant 9982201 from the Human Computer Interaction Program of the National Science Foundation. LDC's corpus creation efforts are powered in part by Academic Equipment Grant 7826-990 237-US from Sun Microsystems.

About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
Last modified: Friday, 20-Nov-2009 17:38:07 EST
© 1992-2009 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.