Workflow of Corpus-based Lexicography
COST ENeL WG3 meeting
Bolzano, 19 July 2014
| TIMES | title of contribution | author(s) | 
|---|---|---|
| 14:30-14:35 | OPENING & WELCOME | |
| General Purpose Dictionaries: Monolingual and Bilingual | ||
| Workflow in Kielitoimiston sanakirja | Tarja	Heinonen (Institute for the Languages of Finland, Helsinki) | |
| The Dictionary of Contemporary Latvian Language and its Lexicographical Process | Ilga Migla, Ieva Zuicena (Latvian Language Institute of the University of Latvia) | |
| The lexicographical process of elexiko | Christine Möhrs	(Institut für Deutsche Sprache, Mannheim) | |
| Polish Academy of Sciences [PAS] Great Dictionary of Polish – lexicographical workflow and summary of the project | Piotr Żmigrodzki (Institute of Polish Language, Polish Academy of Sciences, Krakow) | |
| Slovene Lexical Database: lexicographical process | Simon Krek, Iztok Kosem, Polona Gantar (“Jožef Stefan” Institute, Ljubljana) | |
| The online Dutch-Frisian Dictionary – Workflow | Frits Kuip, van der Hindrik Sijens (Fryske Akademy, Leeuwarden) | |
| 15:05 | Short Discussion and Round up | |
| Specialised content: collocations, metaphors, valency & terminology | ||
| 15:15 | Valency database as a preparation for online Valency dictionary of Croatian verbs | Matea Birtić (Institute of Croatian Language and Linguistics) | 
| Croatian Metaphor Repository | Kristina Štrkalj Despot (Institute of Croatian Language and Linguistics) | |
| Workflow of a Portuguese dictionary of multiword expressions | Amália Mendes,	Sandra Antunes, Luísa Pereira, Fernanda Bacelar do Nascimento (Centre for Linguistics at the University of Lisbon) | |
| Estonian Collocations Dictionary: conception and implementation | Jelena Kallas, Maria Tuulik (Institute for Estonian Language, Tallinn) | |
| Handbook of Slovak Nouns | Radovan Garabik (Ľ. Štúr Institute of Linguistics, Bratislava) | |
| Workflow and problems of a multilingual electronic dictionary: the case of PORTLEX | María José	Domínguez Vázquez, Mónica Mirazo Balsa, Carlos Valcárcel Riveiro	(Universidad de Santiago de Compostela) | |
| The Croatian National Termbank Struna – the workflow of a terminological project | Ana	Ostroški Anić (Institute of Croatian Language and Linguistics) | |
| 15:50 | Short Discussion and Round up | |
| User involvement & Visualisation | ||
| 16:00 | The involvement of native speakers in the lexicographic phase of data acquisition | Styliani Markantonatou, Katarina Tzortzi (Institute for Language and Speech Processing, Athens) | 
| Expanding the Work Phases Model: User and Expert Involvement in the Construction of Online Specialised Dictionaries | Patrick Leroyer (Aarhus University, School of Business and Social Sciences) | |
| Visualising Distributional Models for Corpus-Driven Lexicography | Kris Heylen (University of Leuven) | |
| 16:15 | Final Discussion and Round up (including preparation for next meeting(s)) | |
| 16:45 | Closing | 
In this meeting we will focus on the workflow of corpus-based lexicography.
The changes in the construction of dictionaries raise a number of research questions, which we aim to address during the Bolzano meeting:
- Can different online dictionaries (under construction) fit their lexicographical process into the proposed phases?
- How to deal with version control and archiving of different versions of the dictionary?
- Print dictionaries mostly follow the alphabetic order of the entries, this is not necessarily so for online dictionaries. What guidelines are used in the different projects for determining which entries should be described and in which order they should be described?
- How can visualisation of lexicographical information influence the lexicographical process?
- How do the changes in the lexicographical process affect the user?
- How can users be involved in the lexicographical process and during which phase should this be done?
- How does automatic extraction of lexicographical data from corpora affect the lexicographical process?
For the Bolzano meeting we ask contributions (2-4 pages) to be presented at the meeting. You can use a sample template that you can download from this web page. You are, however, free to use a different format. The contributions should describe the lexicographical workflow of a project using the phases described in the documents available on this web page, plus an indication of the time span for each phase. Contributions addressing one of the other research questions above can also be submitted.
Documents for download
- Task description:
 LexicographicalWorkflow_final.pdf
- Template:
 Stylesheet_Template_COST_Bolzano2014.doc
- Reference 1:
 TiberiusSchoonheim_The ANW and its Lexicographical Process_Preprint.pdf
