Steven Bird's Publications

This page lists my online publications. Where one publication supersedes another, only the most recent one is given.

2008

Querying linguistic trees.
Catherine Lai and Steven Bird, manuscript
[ pdf ]

Multidisciplinary instruction with the Natural Language Toolkit.
Steven Bird, Ewan Klein, Edward Loper, and Jason Baldridge, Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics, Columbus, Ohio, USA.
[ eprint ]

Defining a core body of knowledge for the introductory computational linguistics curriculum.
Steven Bird, Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics, Columbus, Ohio, USA.
[ eprint ]

The ACL Anthology Reference Corpus: A reference dataset for bibliographic research in Computational Linguistics.
Steven Bird, Robert Dale, Bonnie Dorr, Bryan Gibson, Mark Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir Radev, Yee Fan Tan, Proceedings of the Sixth International Conference on Language Resources and Evaluation, Marrakech, Morocco.
[ eprint ]

Natural Language Processing (in preparation)
Steven Bird, Ewan Klein, and Edward Loper
[ webpage ]

2007

Graphical query for linguistic treebanks
Steven Bird and Haejoong Lee, 10th Conference of the Pacific Association for Computational Linguistics
[ eprint ]

Managing Fieldwork Data with Toolbox and the Natural Language Toolkit
Stuart Robinson, Greg Aumann, and Steven Bird, Language Documentation and Conservation 1, pp 44-57.
[ PDF ]

2006

NLTK: The Natural Language Toolkit
Steven Bird. Proceedings of the ACL demonstration session, Sydney, July 2006
[ PDF ]

Building a Search Engine to Drive Problem-Based Learning
Steven Bird and James Curran. 11th Annual Conference on Innovation and Technology in Computer Science Education (ITiCSE). University of Bologna, Italy, June 2006
[ eprint ]

Collecting Low-Density Language Materials on the Web
Timothy Baldwin, Steven Bird and Baden Hughes, Proceedings of 12th Australasian Web Conference (AusWeb06), Southern Cross University. pp 318-321.
[ online]

Reconsidering Language Identification for Written Language Resources
Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson, and Andrew MacKinlay, 5th International Conference on Language Resources and Evaluation (LREC). pp 485-488, Genoa, Italy. May 2006.
[ eprint ]

Designing and Evaluating an XPath Dialect for Linguistic Queries
Steven Bird, Yi Chen, Susan Davidson, Haejoong Lee, and Yifeng Zheng. 22nd International Conference on Data Engineering (ICDE). pp 52-61, Atlanta, USA. April 2006.
[ eprint ]

2005

NLTK-Lite: Efficient Scripting for Natural Language Processing
Steven Bird. Proceedings of the 4th International Conference on Natural Language Processing (ICON). pp 11-18, Kanpur, India. New Delhi: Allied Publishers. December 2005.
[ eprint ]

Structuring Documents Efficiently
Robert Marshall, Steven Bird and Peter Stuckey. Proceedings of the Australasian Language Technology Workshop. pp 120-128, Sydney, Australia, December 2005.
[ eprint ]

Extending XPath to Support Linguistic Queries
Steven Bird, Yi Chen, Susan Davidson, Haejoong Lee, and Yifeng Zheng. Proceedings of Programming Language Technologies for XML (PLANX) pp 35-46, Long Beach, California. ACM. January 2005.
[ eprint ]

Transforming Access to the Spoken Word
Jerry Goldman, Steve Renals, Steven Bird, Franciska de Jong, Marcello Federico, Carl Fleischhauer, Mark Kornbluh, Lori Lamel, Douglas Oard, Claire Stewart and Richard Wright, International Journal on Digital Libraries 5, 287-298. 12pp.
[ eprint ]

2004

Querying and Updating Treebanks: A Critical Survey and Requirements Analysis
Catherine Lai and Steven Bird. Proceedings of the Australasian Language Technology Workshop, pp 139-146. Sydney, Australia. December 2004
[ eprint ]

Representing and Rendering Linguistic Paradigms
David Penton and Steven Bird. Proceedings of the Australasian Language Technology Workshop, pp 123-130. Sydney, Australia. December 2004.
[ eprint ]

NLTK: The Natural Language Toolkit
Steven Bird and Edward Loper. Proceedings of the ACL demonstration session, Barcelona, pp 214-217. July 2004
[ eprint ]

Towards a General Model for Linguistic Paradigms
David Penton, Catherine Bow, Steven Bird, and Baden Hughes, Proceedings of the E-MELD Workshop on Databases and Best Practice, Detroit, pp 1-15, July 2004
[ eprint ]

TalkBank: Building an Open Unified Multimodal Database of Communicative Interaction
Brian MacWhinney, Steven Bird, Christopher Cieri and Craig Martell. Proceedings of the 4th International Conference on Language Resources and Evaluation, pp 525-528. Lisbon, Portugal.
[ eprint ]

Functional Requirements for an Interlinear Text Editor
Baden Hughes, Catherine Bow and Steven Bird. Proceedings of the 4th International Conference on Language Resources and Evaluation, pp 771-775. Lisbon, Portugal.
[ eprint ]

Securing Interpretability: The Case of Ega Language Documentation
Dafydd Gibbon, Catherine Bow, Steven Bird and Baden Hughes. Proceedings of the 4th International Conference on Language Resources and Evaluation, pp 1369-1372. Lisbon, Portugal.
[ eprint ]

Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project
Baden Hughes, David Penton, Steven Bird, Catherine Bow, Gillian Wigglesworth, Patrick McConvell and Jane Simpson. Proceedings of the 4th International Conference on Language Resources and Evaluation, pp 193-196. Lisbon, Portugal.
[ eprint ]

A Four-Level Model for Interlinear Text (DRAFT)
Catherine Bow, Baden Hughes and Steven Bird
[ pdf ] (draft only)

Building an Open Language Archives Community on the DC Foundation
Steven Bird and Gary Simons, In Hillmann and Westbrooks (editors), Metadata in Practice: A Work in Progress, ALA Editions, pp 203-222.
[ pdf ] (prepublication version)

2003

Encoding and Presenting Interlinear Text Using XML Technologies
Baden Hughes, Steven Bird and Catherine Bow. In Knott, Alistair and Estival, Dominique, Eds. Proceedings Australasian Language Technology Workshop, pages 105-113, Melbourne, Australia.
[ eprint ]

Grassfields Bantu Fieldwork: Dschang Lexicon
Steven Bird, LDC speech corpus LDC2003L01, ISBN 1-58563-255-4.
[ local | LDC | link ]

Grassfields Bantu Fieldwork: Dschang Tone Paradigms
Steven Bird, LDC speech corpus LDC2003S02, ISBN 1-58563-254-6.
[ local | LDC | link ]

Grid-Enabling Natural Language Engineering By Stealth
Baden Hughes and Steven Bird, Proceedings of the Workshop on The Software Engineering and Architecture of Language Technology Systems, pp 31-38 Association for Computational Linguistics.
[ arXiv | local | link ]

Seven Dimensions of Portability for Language Documentation and Description
Steven Bird and Gary Simons, Language 79: 557-582. (Supersedes an earlier version which appeared in the Proceedings of the Workshop on Portability Issues in Human Language Technologies, Third International Conference on Language Resources and Evaluation, Paris: European Language Resources Association, pp 23-30.)
Revised/updated version (subject to further revision)
Earlier version: [ arXiv | local | link ]
[NSF DEL solicitation cites this paper
]

The Open Language Archives Community: An infrastructure for distributed archiving of language resources
Gary Simons and Steven Bird, Literary and Linguistic Computing 18: 117-128.
[ arXiv | local | link ]

Building an Open Language Archives Community on the OAI Foundation
Gary Simons and Steven Bird, Library Hi Tech 21, 210-218, Special Issue on Open Archives Initiative Metadata Harvesting
[ arXiv | local | link ]

Extending Dublin Core Metadata to support the description and discovery of language resources
Steven Bird and Gary Simons, Computing and the Humanities 37, 375-388.
[ arXiv | local | link ]

2002

Proceedings of the IRCS Workshop on Open Language Archives
Steven Bird and Gary Simons (eds). 59pp
[ HTML | PDF ]

NLTK: The Natural Language Toolkit
Edward Loper and Steven Bird, Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics, Philadelphia, July 2002, Association for Computational Linguistics, pp 62-69.
[ arXiv | local | link ]

An Integrated Framework for Treebanks and Multilayer Annotations
Scott Cotton and Steven Bird, Proceedings of the Third International Conference on Language Resources and Evaluation, Paris: European Language Resources Association, pp 1670-1677. Las Palmas, Spain.
[ arXiv | local | link ]

TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit
Steven Bird, Kazuaki Maeda, Xiaoyi Ma, Haejoong Lee, Beth Randall, and Salim Zayat, Proceedings of the Third International Conference on Language Resources and Evaluation, Paris: European Language Resources Association, pp 364-370. Las Palmas, Spain.
[ arXiv | local | link ]

Creating Annotation Tools with the Annotation Graph Toolkit
Kazuaki Maeda, Steven Bird, Xiaoyi Ma, and Haejoong Lee, Proceedings of the Third International Conference on Language Resources and Evaluation, Paris: European Language Resources Association, pp 1914-1921. Las Palmas, Spain.
[ arXiv | local | link ]

Models and Tools for Collaborative Annotation
Xiaoyi Ma, Haejoong Lee, Steven Bird and Kazuaki Maeda, Proceedings of the Third International Conference on Language Resources and Evaluation, Paris: European Language Resources Association, pp 2066-2073. Las Palmas, Spain.
[ arXiv | local | link ]

Computational Phonology
Steven Bird, Oxford International Encyclopedia of Linguistics, 4pp, 2nd Edition, 2002
[ arXiv | local | link ]

Phonology
Steven Bird, In Ruslan Mitkov (ed), Oxford Handbook of Computational Linguistics, pp 1-24. Oxford University Press, 2002.
[ arXiv | local | link ]

2001

Grassfields Bantu Fieldwork: Ngomba Tone Paradigms
Steven Bird and John Bell, LDC speech corpus LDC2001S16, ISBN 1-58563-216-3.
[ local | LDC | link ]

The Open Language Archives Community and Asian Language Resources
Steven Bird, Gary Simons and Chu-Ren Huang, Proceedings of the Workshop on Language Resources in Asia, 6th Natural Language Processing Pacific Rim Symposium (NLPRS), 8pp, Tokyo, November 2001.
[ arXiv | local | link ]

The OLAC Metadata Set and Controlled Vocabularies
Steven Bird and Gary Simons, Proceedings of the ACL Workshop on Sharing Tools and Resources for Research and Education, Toulouse, July 2001, pp 7-18.
[ arXiv | local | link ]

Annotation Graphs and Servers and Multi-Modal Resources: Infrastructure for Interdisciplinary Education, Research and Development
Christopher Cieri and Steven Bird, Proceedings of the ACL Workshop on Sharing Tools and Resources for Research and Education, Toulouse, July 2001, pp 23-30.
[ arXiv | local | link ]

Speech Annotation and Corpus Tools - Special Issue of Speech Communication
Steven Bird and Jonathan Harrington (eds). Speech Communication 33(1,2), 2001.
contents

A formal framework for linguistic annotation
Steven Bird and Mark Liberman, Speech Communication 33(1,2), pp 23-60, 2001.
[ arXiv | local | link ]

Orthography and identity in Cameroon
Steven Bird, Written Language and Literacy 4(2), pp 131-162, 2001. (Also in Notes on Literacy 26.1-2, pp 1-37, SIL. Revised from paper presented at the 96th Annual Meeting of the American Anthropological Association, Washington, November 1997.)
[ CogPrints | local | link ]

2000

A Preliminary Study of the Structure of Lexicon Entries
John Bell and Steven Bird, Proceedings of the Workshop on Web-Based Language Documentation and Description, Philadelphia, December 2000.
[ local | link ]

A Formal Framework for Interlinear Text
Kazuaki Maeda and Steven Bird, Proceedings of the Workshop on Web-Based Language Documentation and Description, Philadelphia, December 2000.
[ local | link ]

ATLAS: A flexible and extensible architecture for linguistic annotation
Steven Bird, David Day, John Garofolo, John Henderson, Christophe Laprun, Mark Liberman, Proceedings of the Second International Conference on Language Resources and Evaluation, pp 1699-1706, 2000.
[ arXiv | local | link ]

Towards a query language for annotation graphs
Steven Bird, Peter Buneman and Wang-Chiew Tan, Proceedings of the Second International Conference on Language Resources and Evaluation, pp 807-814, 2000.
[ arXiv | local | link ]

Many uses, many annotations for large speech corpora: Switchboard and TDT as case studies
David Graff and Steven Bird, Proceedings of the Second International Conference on Language Resources and Evaluation, pp 427-433, 2000.
[ arXiv | local | link ]

Transcribing with annotation graphs
Edouard Geoffrois, Claude Barras, Steven Bird and Zhibiao Wu, Proceedings of the Second International Conference on Language Resources and Evaluation, pp 1517-1521, 2000.
abstract, ps, ps.Z, link

Querying databases of annotated speech
Steve Cassidy and Steven Bird, Database Technologies: Proceedings of the Eleventh Australasian Database Conference, pp. 12-20. IEEE Computer Society, 2000.
[ arXiv | local | link ]

1998/99

Annotation graphs as a framework for multidimensional linguistic data analysis
Steven Bird and Mark Liberman, Towards Standards and Tools for Discourse Tagging, Proceedings of the Workshop, pp 1-10. Association for Computational Linguistics, 1999.
[ arXiv | local | link ]

A formal framework for linguistic annotation
Steven Bird and Mark Liberman, Technical Report MS-CIS-99-01, Computer and Information Science, University of Pennsylvania, 1999. (Revised version)
[ arXiv | local | link ]

Multidimensional exploration of online linguistic field data
Steven Bird, Proceedings of the 29th Meeting of the North-East Linguistic Society, pp 33-50, 1999.
abstract, pdf, ps, ps.Z, Web Version, link

When marking tone reduces fluency: an orthography experiment in Cameroon
Steven Bird, Language and Speech 42, 83-115, 1999. (Supersedes research Paper HCRC/RP-91, Human Communication Research Centre, University of Edinburgh, 1997.)
[ CogPrints | local | link ]

Strategies for representing tone in African writing systems
Steven Bird, Written Language and Literacy 2, 1-44, 1999. (Supersedes: `Principles of African Tone Orthography', Research Paper HCRC/RP-80, Human Communication Research Centre, University of Edinburgh, 1996.)
[ CogPrints | local | link ]

Dschang syllable structure
Steven Bird, In van der Hulst & Ritter (eds) The Syllable: Views and Facts. Studies in Generative Grammar, Mouton-De Gruyter, pp 447-476, 1999. (Supersedes: `Dschang Syllable Structure and Moraic Aspiration', Research Paper EUCCS/RP-69, Centre for Cognitive Science, University of Edinburgh, 1996.)
[ CogPrints | local | link ]

Towards a formal framework for linguistic annotations
Steven Bird and Mark Liberman, Proceedings of the International Conference on Spoken Language Processing, Sydney, December 1998. (Revised version.)
abstract, pdf, ps, ps.Z, link

1996/97

A lexical database tool for quantitative phonological research
Steven Bird, Proceedings of the Third Meeting of the ACL Special Interest Group in Computational Phonology, 33-39, Madrid, July 1997.
[ cmp-lg | local | link ]

Petit Dictionnaire Yémba-Français
Steven Bird and Maurice Tadadjeu, Yaoundé: NACALCO, 176pp, 1997.
abstract, pdf, ps, ps.Z, link

Key aspects of declarative phonology
James Scobbie, John Coleman and Steven Bird, In Durand & Laks (eds). Current Trends in Phonology: Models and Methods, pp 685-710. University of Salford Publications. 1996
abstract

1994/95

Computational Phonology: A Constraint-Based Approach
Steven Bird, Studies in Natural Language Processing, Cambridge University Press, 1995.
cover-scan + reviews, abstract, ordering info, contents, references: pdf, ps ps.Z, link

The Bamileke Dschang associative construction: instrumental findings
Steven Bird and Oliver Stegen, Research Paper EUCCS/RP-66, Centre for Cognitive Science, University of Edinburgh, 1995,
abstract, pdf, ps, ps.Z

Computational Phonology - Special Issue of Computational Linguistics
Steven Bird, Computational Linguistics, 20 (3), 1994.

Introduction to computational phonology
Steven Bird, Computational Linguistics, 20 (3), 1994.
abstract, pdf, ps, ps.Z, link

Automated tone transcription
Steven Bird, Proceedings of the First Meeting of the ACL Special Interest Group in Computational Phonology, 1-12, Las Cruces, 1994.
[ cmp-lg | local | link ]

One level phonology: autosegmental representations and rules as finite automata
Steven Bird and T. Mark Ellison, Computational Linguistics, 20, 55-90, 1994.
abstract, pdf, ps, ps.Z,

Phonological analysis in typed feature systems
Steven Bird and Ewan Klein, Computational Linguistics, 20, 455-91, 1994.
abstract, pdf, ps, ps.Z, link

1992/93

The morphotonology of the Dschang-Bamileke associative construction
Steven Bird, In Scobbie & Ellison (eds), Phonology and Computation, Working Papers in Cognitive Science, Volume 8, University of Edinburgh, 1993.
abstract, pdf, ps, ps.Z, link

Tone in the Bamileke Dschang associative construction: an electrolaryngographic study and comparison with Hyman (1985)
Steven Bird and Oliver Stegen, Technical Report CCS/RP-57, University of Edinburgh, 1993.
pdf, ps, ps.Z

Declarative phonology
Steven Bird, John Coleman, Janet Pierrehumbert and Jim Scobbie, Proceedings of the 15th International Conference of Linguists, Quebec, Canada, 1992.
pdf, ps, ps.Z

A phonologist's workbench
Steven Bird and T. Mark Ellison, Proceedings of the 15th International Conference of Linguists, Quebec, Canada, 1992.
pdf, ps, ps.Z

Finite-state phonology in HPSG
Steven Bird, Proceedings of the Fifteenth International Conference on Computational Linguistics (COLING-92), 74-80, 1992.
pdf, ps, ps.Z

1990/91

Feature structures and indices
Steven Bird, Phonology 8, 137-144, 1991.

Focus and phrasing in Unification Categorial Grammar
Steven Bird, In Bird (ed), Declarative Perspectives in Phonology, Working Papers in Cognitive Science, Volume 7, University of Edinburgh. pp. 139-166, 1991.
pdf, ps, ps.Z

Declarative perspectives in phonology
Steven Bird (ed), Working Papers in Cognitive Science, Volume 7, University of Edinburgh.

A logical approach to Arabic phonology
Steven Bird and Patrick Blackburn, Proceedings of the Fifth Meeting of the European Chapter of the Association for Computational Linguistics, 89-94, 1991.
pdf, ps, ps.Z

Defaults in underspecification phonology
Jonathan Calder and Steven Bird, In Bird (ed), Declarative Perspectives in Phonology, Working Papers in Cognitive Science, Volume 7, University of Edinburgh. pp. 107-125, 1991.
pdf, ps, ps.Z

Phonological structure and abstract specification
Ewan Klein and Steven Bird, Proceedings of the 12th International Congress of Phonetic Sciences, Volume 5, pp 110-13, 1991.

Presenting autosegmental phonology
Steven Bird and D. Robert Ladd, Journal of Linguistics, 27, 193-210, 1991.

Phonological events
Steven Bird and Ewan Klein, Journal of Linguistics, 26, 33-56, 1990.

Prosodic morphology and constraint-based phonology
Steven Bird, Technical Report CCS/RP-38, University of Edinburgh

Constraint-Based Phonology
Steven Bird, PhD Thesis, University of Edinburgh, 1990.


Steven Bird's Homepage | sb@ldc.upenn.edu