mdh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Term Ranking Adaptation to the Domain: Genetic Algorithm-Based Optimisation of the C-Value
LIMSI CNRS, Orsay, France.;Univ Paris 13, Sorbonne Paris Cite, Paris, France..
Mälardalen University, School of Education, Culture and Communication, Educational Sciences and Mathematics. (MAM)ORCID iD: 0000-0002-1624-5147
Mälardalen University, School of Education, Culture and Communication, Educational Sciences and Mathematics. (MAM)ORCID iD: 0000-0003-4554-6528
2014 (English)In: ADVANCES IN NATURAL LANGUAGE PROCESSING / [ed] Przepiorkowski, A Ogrodniczuk, M, SPRINGER INT PUBLISHING AG , 2014, p. 71-+Conference paper, Published paper (Refereed)
Abstract [en]

Term extraction methods based on linguistic rules have been proposed to help the terminology building from corpora. As they face the difficulty of identifying the relevant terms among the noun phrases extracted, statistical measures have been proposed. However, the term selection results may depend on corpus and strong assumptions reflecting specific terminological practice. We tackle this problem by proposing a parametrised C-Value which optimally considers the length and the syntactic roles of the nested terms thanks to a genetic algorithm. We compare its impact on the ranking of terms extracted from three corpora. Results show average precision increased by 9% above the frequency-based ranking and by 12% above the C-Value-based ranking.

Place, publisher, year, edition, pages
SPRINGER INT PUBLISHING AG , 2014. p. 71-+
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 8686
Keywords [en]
Terminology, term extraction, term ranking, genetic algorithm
National Category
Mathematics Computational Mathematics
Research subject
Mathematics/Applied Mathematics
Identifiers
URN: urn:nbn:se:mdh:diva-38387ISI: 000348926100008ISBN: 978-3-319-10888-9 OAI: oai:DiVA.org:mdh-38387DiVA, id: diva2:1182174
Conference
9th International Conference on Natural Language Processing (NLP), SEP 17-19, 2014, Warsaw, POLAND
Available from: 2018-02-12 Created: 2018-02-12 Last updated: 2018-11-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records BETA

Engström, ChristopherSilvestrov, Sergei

Search in DiVA

By author/editor
Engström, ChristopherSilvestrov, Sergei
By organisation
Educational Sciences and Mathematics
MathematicsComputational Mathematics

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 2 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf