ALTA Logo Proceedings of ALTSS/ALTW, Melbourne, December 2003

Statistical Parsing - probabilistic models and stochastic grammars

Mark Johnson, Brown University, USA


ABSTRACT:

This course unites two different approaches to computational linguistics and natural language processing. On the one hand, there is considerable linguistic evidence that natural language possesses a rich hierarchical structure that is only indirectly reflected in the sequence of words or sounds that make up sentences. On the other hand, there are also reliable statistical regularities in the selection and ordering of words and phrases in natural languages. Stochastic grammars are capable of describing both aspects of natural languages, and are a key component of state-of-the-art technology in many areas of computational linguistics.

This course will cover the following topics:

The course does not have any specific prerequisites, but mathematical and computer science background will be helpful. The ability to take derivatives and manipulate mathematical expressions at a first-year undergraduate level will enable students to follow the derivation of the formulas, and computer science experience in algorithms with enable students to understand, analyze and implement the various algorithms described in the course.

BIO :

Mark Johnson is Professor of Cognitive & Linguistic Sciences and Computer Science at Brown University, and current president of the Association for Computational Linguistics. He has made significant contributions to research into computational processes involved in human language understanding, and is at the forefront of research in statistical natural language processing. [http://www.cog.brown.edu/~mj/]

RESOURCES: