de.tudarmstadt.ukp.jwktl.parser.ru.wikokit.base.wikt.multi.en
Class WEtymologyEn

java.lang.Object
  extended by de.tudarmstadt.ukp.jwktl.parser.ru.wikokit.base.wikt.multi.en.WEtymologyEn

public class WEtymologyEn
extends Object

Etymology part of English Wiktionary article. Etymology is a level 3 header in English Wiktionary: 1)

 ===Noun===
 ===Etymology=== (level 3 in English Wiktionary)
 ===Noun===
 ===Verb===

 ==Finnish==
 ===Etymology===
 ===Noun===
2)
 Also level 3 in the case of multiple etymologies:
 ===Etymology 1===        (level 3)
 ====Pronunciation====
 ====Noun====
 ===Etymology 2===        (level 3)
 ====Pronunciation====
 ====Noun====
 ====Verb====
See http://en.wiktionary.org/wiki/Wiktionary:Entry_layout_explained


Constructor Summary
WEtymologyEn()
           
 
Method Summary
static LangText[] splitToEtymologySections(String page_title, LangText lt_source)
          Splits text to fragments related to different etymologies.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

WEtymologyEn

public WEtymologyEn()
Method Detail

splitToEtymologySections

public static LangText[] splitToEtymologySections(String page_title,
                                                  LangText lt_source)
Splits text to fragments related to different etymologies. page_title - word which are described in this article 'text'

Parameters:
lt - .text will be parsed and splitted, .lang is not using now, may be in future... 1) Checks whether exists more than one section ===Etymology=== 2) If there is only one or zero sections then return lt_source If there more than one sections then split it.


Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.