Multiword Expression Resources

The following is a list of resources relevant to the LinGO Multiword Expression Project, along with a basic description of each resource, the date of release and a description of the author(s). In the instance that a reference is listed for the resource, we ask that any published results which make use of the given data set cite that reference appropriately.

Resource Name




English and Russian Prepositional Phrases

A list of Russian PP MWEs and a ranked list of PP MWE candidates [Download]; a ranked list of English PP MWE candidates from the BNC [Download]; a tool for extracting MWEs [Download]; [Read me]

Reference: Serge Sharoff (2004) What is at Stake: a Case Study of Russian Expressions Starting with a Preposition, In Proceedings of the ACL 2004 Workshop on Multiword Expressions: Integrating Processing, Barcelona, Spain.

Serge Sharoff (Centre for Translation Studies, Leeds) 16 Aug 2004
Verb particle constructions with compositionality judgements
(version 1)

A list of 116 verb particle constructions (VPCs) with compositionality judgements from a total of 4 annotators [Read Me] [Download]

Reference: McCarthy, Diana, Bill Keller and John Carroll (2003) Detecting a Continuum of Compositionality in Phrasal Verbs, In Proceedings of the ACL-SIGLEX Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, Sapporo, Japan.

Diana McCarthy (Informatics, Sussex) 17 Dec 2003
BNC verb particle construction frequency list
(version 1)

A list of valence-specified verb particle constructions (VPCs), automatically extracted from the British National Corpus. [Read Me] [Download]

Timothy Baldwin and Aline Villavicencio (2002) Extracting the Unextractable: A Case Study on Verb-particles, In Proceedings of the Sixth Conference on Computational Natural Language Learning (CoNLL 2002), Taipei, Taiwan, pp. 98-104.

Timothy Baldwin (CSSE, Melbourne) 16 Oct 2003
Verb particle constructions with Levin verb classes and Google frequencies
(version 1)

A list of verb particle constructions (VPCs) specified for basic sense according to a Levin verb class, and with an estimated frequency of web page based on Google. [Read Me] [Temporarily unavailable]

Aline Villavicencio (2003) Verb Particle Constructions in the WWW, In Proceedings of the ACL-SIGSEM Workshop on the Linguistic Dimensions of Prepositions and their Use in Computational Linguistics Formalisms and Applications, Toulouse, France, pp. 101-111.

Aline Villavicencio (Computer Lab, Cambridge) 16 Oct 2003

Last modified: Mon Aug 16 18:29:42 PDT 2004