Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

Linguistic Resources  
What's New! What's Free! Archive

Free Corpora/Software| Press Releases|

Free Corpora/Software


Free Talkbank Corpora. TalkBank is an indisciplinary research project funded by a five year NSF grant to foster research and development in communicative behavior by providing tools and standards for analysis and distribution of language data. The LDC distributes grant-covered copies of the following Talkbank corpora:



Free copies for all of the above corpora are still available; a US$30 shipping and handling fee applies for data on disc.


Free Web 1T 5-gram Copies Available - the LDC would like to thank Google for its kind sponsorship of nearly 200 free copies of the Web 1T 5-gram data for university researchers. To date, all copies have been claimed. The data is available for licensing at the regular Non-member Fee.

ESPS Software - signal processing programs that can be used for the analysis, manipulation and labeling of speech.

Release of AGLIB 2.0! - new version of software infrastructure for linguistic annotation.

Transcriber - tool for segmenting, labeling and transcribing speech.

Champollion - parallel text sentence alignment tool for as many language pairs as possible.

Press Releases - UNDER CONSTRUCTION - all links not yet functional

15th Anniversary Monthly Spotlight Archive - as part of our 15th Anniversary celebration, we highlighted one aspect of the LDC in our monthly newsletters. These features provided our members and data users with a glimpse of the broad range of the LDC’s research activities.

Use of LDC Corpora in University Summer Schools - ways LDC corpora have been used for teaching purposes at university summer school programs.

Conference Attendence by LDC - recent publisher displays by the LDC.

Newly Updated LDC Papers Page - papers presented by LDC staff at LREC2006 and other conferences.

OLAC Search - search for language resources from dozens of language data centers and language archives.

Member Resources Page! The LDC has new and improved resources for members and membership info. Please check it out!



About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
Last modified: Wednesday, 25-Jun-2008 17:46:02 EDT
© 1992-2007 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.