Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome


What's New! What's Free!


2007 Member Survey Responses - survey responses are in
2008 Publications Pipeline - information on our planned, yet tentative, 2008 publications
50,000th LDC Corpus Distributed! - the LDC has reached another landmark distribution
Membership Fee Increases and Discounts - information on our increases in membership fees.
Free Web 1T 5-gram Copies Available - Google is sponsoring 100 copies for university researchers.
Language Resource and Evaluation Conference 2008 - as in previous years, LDC is pleased to contribute to the organization and promotion of LREC.
15th Anniversary Fidelity Celebration! - the LDC honors its most loyal members.
LDC Turns Fifteen! - join us in celebrating our 15th anniversary.
What's New Archive

New Corpora

An English Dictionary of the Tamil Verb ~contains translations for 6597 English verbs and defines 9716 Tamil verbs
GALE Phase 1 Chinese Blog Parallel Text ~313K character of Chinese blog text and its translation from eight sources
CSLU: National Cellular Telephone Speech Release 2.3 ~approximately one minute of transcribed speech from 2336 speakers throughout the US
GALE Phase 1 Arabic Blog Parallel Text ~102K words of Arabic blog text and its English translation from thirty-three sources
STC-TIMIT 1.0 ~entire TIMIT database recorded through a single telephone channel
New Corpora Archive

Employment at the LDC

ACL Anthology ~ A Digital Archive of Research Papers in Computational Linguistics, hosted at the LDC

OLAC ~ Open Language Archives Community

Linguistic Resources
Linguistic Data Consortium

The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards.

map

LDC is supported in part by grant IRI-9528587 from the Information and Intelligent Systems division and grant 9982201 from the Human Computer Interaction Program of the National Science Foundation. LDC's corpus creation efforts are powered in part by Academic Equipment Grant 7826-990 237-US from Sun Microsystems.

About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
Last modified: Tuesday, 22-Apr-2008 11:55:35 EDT
© 1992-2007 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.