Treebanks : building and using parsed corpora /

Linguists and engineers in Natural Language Processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer from a word by word perspective. A new line of res...

Full description

Bibliographic Details
Corporate Author: SpringerLink (Online service)
Other Authors: Abeillé, Anne
Format: eBook
Language:English
Published: Dordrecht ; Boston : Kluwer Academic Publishers, [2003]
Series:Text, speech, and language technology ; v. 20.
Subjects:
Online Access:Connect to the full text of this electronic book
Description
Summary:Linguists and engineers in Natural Language Processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer from a word by word perspective. A new line of research involves corpora with richer annotations such as clauses and major constituents, grammatical functions and dependency links. The first parsed corpora were the English Lancaster treebank and Penn Treebank. New ones have recently been developed for other languages. This book: provides a state of the art on work being done with parsed corpora; gathers 21 papers on building and using parsed corpora raising many relevant questions; deals with a variety of languages and a variety of corpora; is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Item Description:Electronic resource.
Physical Description:1 online resource (xxvi, 405 pages :) : illustrations
Bibliography:Includes bibliographical references and index.
ISBN:9789401002011 (electronic bk.)
9401002010 (electronic bk.)