de.tudarmstadt.ukp.jwktl.parser.ru.wikokit.base.wikt.multi.en
Class WEtymologyEn
java.lang.Object
de.tudarmstadt.ukp.jwktl.parser.ru.wikokit.base.wikt.multi.en.WEtymologyEn
public class WEtymologyEn
- extends Object
Etymology part of English Wiktionary article.
Etymology is a level 3 header in English Wiktionary:
1)
===Noun===
===Etymology=== (level 3 in English Wiktionary)
===Noun===
===Verb===
==Finnish==
===Etymology===
===Noun===
2)
Also level 3 in the case of multiple etymologies:
===Etymology 1=== (level 3)
====Pronunciation====
====Noun====
===Etymology 2=== (level 3)
====Pronunciation====
====Noun====
====Verb====
See http://en.wiktionary.org/wiki/Wiktionary:Entry_layout_explained
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
WEtymologyEn
public WEtymologyEn()
splitToEtymologySections
public static LangText[] splitToEtymologySections(String page_title,
LangText lt_source)
- Splits text to fragments related to different etymologies.
page_title - word which are described in this article 'text'
- Parameters:
lt
- .text will be parsed and splitted,
.lang is not using now, may be in future...
1) Checks whether exists more than one section ===Etymology===
2) If there is only one or zero sections then return lt_source
If there more than one sections then split it.
Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.