/httpd/html/Corpus Eye

The standard (grep-based) search system and the treebank interfaces were designed and programmed by Eckhard Bick for VISL, using ordinary linux-tools and Tgrep2. The menu-based cqp-interface was designed by Eckhard Bick and programmed in Java by Poul Henriksen and Nikolaj Hald Nielsen on a special ISK-grant. It uses the IMS corpus workbench and was inspired by user feed back and a similar interface made by Paul Meurer for Oslo University's Tekstlaboratoriet.

Raw text corpora were kindly provided or licensed by ECI, DSL, Oxford University, Linguateca, ATILF, NILC, the Danish parliament and others. For a full list of corpus credits and references see our copyright page, which is also linked from the individual corpus pages.

Grammatical corpus annotation, both morphosyntactic tags (CG) and tree-structures (PSG), was performed with Eckhard Bick's VISL parsers: PALAVRAS (Portuguese), PALAVRAS-HIS (Spanish), DanGram (Danish), GerGram (German), EspGram (Esperanto), ItaGram (Italian) and FrAG (French), which are all accessible online (including file upload service). Annotation of German and English also involved morphological, and in the latter case syntactic, modules licensed by Lingsoft. For French, the PoS-tagging stage used H. Schmid's and A. Stein's DecisionTreeTagger. For Romanian, the morphological annotation was performed with Dan Tufis' probabilistic MSD tagger.

Treebank revision was supervised work involving, among others, the following VISL-students: Susanna Afonsoand Raquel Marchi (Portuguese), Ina Størner Rasmussen, Camilla Pedersen, Dorte Lønsmann and Kim Ebensgaard Jensen (Danish), and Ane Dybro Johansen (French). The treebank projects had funding support by Linguateca (Portuguese), The Nordic Council of Ministers (Danish) and ATILF (French).

More information on the VISL project as well as live grammatical analysis and a number of grammar teaching tools are available at the VISL main site or its research oriented beta version.