Directorate-General for Translation
IATE
EU tool for translation-oriented terminology work
Christine Herwig
Terminology Coordination
DGT-D.3.2
European Commission
Vienna, April 2011
EUROPEAN COMMISSION
Unique?
8.6 million terms
23 EU languages
+ others
CAT tool for over
4 400 EU translators
- 2 -
History
IATE = “Inter-Active Terminology for Europe”
Objective: creation of an interactive terminology database for the
consultation, storage and joint management of terminological data
IATE partners: European Parliament, Council, Commission, Court of Justice,
Court of Auditors, European Economic and Social Committee, Committee of
the Regions, European Investment Bank, European Central Bank,
Translation Centre
Development work started in January 2000.
Internally IATE became operational at the beginning of 2005.
- 3 -
Interactive tool serving translators
Council
of the EU
650
European Commission
1750
European
Parliament
760
110
Translation Centre
100
Court of Auditors
350
European
Investment Bank
30
70
Court of Justice
620
European Central Bank
Interpretation services
Lawyer-linguists
- 4 -
Committee of the Regions and
European Economic and Social
Committee
Interinstitutional management
 IATE Management Group - IMG
(one official representative per IATE partner)
 Common rules and procedures
(“Best Practice”, “Input Manual”)
 Coordination of consolidation efforts
 Coordination of terminology work for the “new languages”
 Decisions on technical development
 Technical support, maintenance and development
- 5 -
How DGT organises terminology work
Other EU institutions
and bodies
IATE
“Terminology Coordination”
Terminology services
and
organisations
Terminology projects
Terminology Board
Terminologists
Terminologists
Terminologists
Terminologists
Translators
- 6 -
Terminologists
Terminologists
How multilingual work is planned
• Terminologists ask translators about their needs
Needs
• Terminology Coordination Team
+
“WP Advisory Committee”
Draft
• “Terminology Board”
Work programme
- 7 -
Translation-oriented work
systematic + ad hoc
subject-driven + text-driven
prescriptive + descriptive
multilingual + language-specific
- 8 -
European Commission - Right of initiative
The Commission’s ‘right of initiative’ means it is the main source of draft legislation.
It drafts and then implements acts adopted by the Council and the European Parliament.
New
subject fields
New
terminology
- 9 -
IATE - Present situation
Total
Commission
Concepts
1.5 million
0.9 million
Terms
8.6 million
5.7 million
Abbreviations
Phrases
0.5 million
> 0.2 million
0.15 million
< 0.1 million
approx.
60 %
7 March 2011
- 10 -
Language coverage
1958
German, French, Italian and Dutch  4 official languages
1973
English and Danish  6 official languages
1981
Greek  7 official languages
1986
Spanish and Portuguese  9 official languages
1995
Finnish and Swedish  11 official languages
2004
Czech, Estonian, Hungarian, Lithuanian, Latvian, Maltese, Polish,
Slovak, Slovenian  20 official languages
2007
Bulgarian, Irish, Romanian  23 EU official languages
+
Latin
+
Languages of candidate and potential candidate countries: e.g. Croatian, Turkish
+
Languages relevant to the European External Action Service:
e.g. Russian, Chinese, Arabic
- 11 -
Terms per language
Language
Number of terms
Language
Number of terms
BG
22 092 LA
64 623
CS
31 316 LT
43 652
DA
604 572 LV
25 491
DE
1 039 010 MT
23 239
EL
522 877 NL
695 772
EN
1 416 823 PL
48 850
ES
616 088 PT
530 230
ET
28 293 RO
22 986
327 598 SK
28 626
FR
1 354 170 SL
30 531
GA
37 030 SV
314 086
HU
37 341 Others
FI
IT
24 661
705 681
7 March 2011
- 12 -
Number of potential duplicates in IATE (Jan. 2011)
Language
mt
ro
bg
ga
et
sl
sk
pl
lv
hu
cs
lt
el
sv
pt
it
da
es
nl
fi
de
fr
en
Total
Terms
21590
22304
18649
34832
27140
30110
27744
37360
25106
36955
30841
42502
522636
314176
530255
705697
604706
615304
696000
327456
1036177
1350127
1446004
8503671
Potential
duplicates
755
862
751
1739
1473
1722
1599
2230
1540
2301
2112
3104
48171
33634
56895
81253
70353
72071
82019
38836
123203
172739
186961
986323
%
3,50%
3,86%
4,03%
4,99%
5,43%
5,72%
5,76%
5,97%
6,13%
6,23%
6,85%
7,30%
9,22%
10,71%
10,73%
11,51%
11,63%
11,71%
11,78%
11,86%
11,89%
12,79%
12,93%
11,60%
- 13 -
Main challenges for DGT terminology
Identify the most efficient way of incorporating data
for the “new languages”
Respond to upcoming terminology needs
(in relation to the Commission’s right of initiative)
Reduce the number of duplicates in the legacy data
(consolidating IATE content)
- 14 -
Structure of IATE records
Language-independent
information
Language 1
EN
Term 1
Term 2
E.g.: Domain, Domain
note, Origin, Problem
language, Cross
reference
Language 2
FR
E.g.: Definition,
Reference, Note
Term 1
E.g.: Term, Reference,
Term type, Context,
Context reference,
Grammatical
information
Term 2
- 15 -
“Language-independent level” and “Language level”
- 16 -
“Term level”
- 17 -
Subject field classification - EuroVoc
http://eurovoc.europa.eu
At generic level, the thesaurus EuroVoc has a two-tier hierarchical classification:
1.
fields, identified by two-digit numbers and titles in words, e.g.:
10 EUROPEAN COMMUNITIES
2.
microthesauri, identified by four-digit numbers — the first two digits being those for the field containing the microthesaurus — followed by titles in words, e.g.:
1011 European Union law
- 18 -
In-house IATE
• Is password-protected
• Allows interactive use
• Has a “virtual antechamber”, Pre-IATE, for raw data and multilingual batch
imports
Glossaries
Collections
...
Pre-IATE
Raw data
- 19 -
Interactive Use
 New entries and modifications
• “Write” rights for translators and terminologists
 Communication
• A “mark” function allows users to add comments
to specific entries
 Validation
• Is mother-tongue-based to ensure data quality
- 20 -
IATE Public: http://iate.europa.eu
Officially opened on 28 June 2007
Shares all validated EU terminology with the public
- 21 -
Thank you for your
attention.
- 22 -
Descargar

Slide 1