de.tudarmstadt.ukp.jwktl.parser.ru
Class RUWiktionaryEntryParser

java.lang.Object
  extended by de.tudarmstadt.ukp.jwktl.parser.WiktionaryEntryParser
      extended by de.tudarmstadt.ukp.jwktl.parser.ru.RUWiktionaryEntryParser
All Implemented Interfaces:
IWiktionaryEntryParser

public class RUWiktionaryEntryParser
extends WiktionaryEntryParser

An implementation of the IWiktionaryEntryParser interface for parsing the contents of article pages from the Russian Wiktionary. The entry parser is delegate its calls to the Wikokit API.

Author:
Yevgen Chebotar, Christian M. Meyer

Field Summary
 
Fields inherited from class de.tudarmstadt.ukp.jwktl.parser.WiktionaryEntryParser
COMMENT_PATTERN, entryId, handlers, IMAGE_PATTERN, language, redirectTemplate, REFERENCES_PATTERN
 
Constructor Summary
RUWiktionaryEntryParser()
          Initializes the Russian entry parser.
 
Method Summary
protected  ParsingContext createParsingContext(WiktionaryPage page)
           
 boolean isStartOfBlock(String line)
          Hotspot for deciding if the given line is a potential start of a new article constituent.
 void parse(WiktionaryPage page, String text)
          Creates Wiktionary word entry instances from the provided text, and adds them to the given article page.
 IBlockHandler selectHandler(String line)
          Find a handler that is willing to handle the given line.
 
Methods inherited from class de.tudarmstadt.ukp.jwktl.parser.WiktionaryEntryParser
checkForRedirect, getLanguage, register
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

RUWiktionaryEntryParser

public RUWiktionaryEntryParser()
Initializes the Russian entry parser. That is, the language and the redirection pattern is defined.

Method Detail

parse

public void parse(WiktionaryPage page,
                  String text)
Description copied from interface: IWiktionaryEntryParser
Creates Wiktionary word entry instances from the provided text, and adds them to the given article page.

Specified by:
parse in interface IWiktionaryEntryParser
Overrides:
parse in class WiktionaryEntryParser

createParsingContext

protected ParsingContext createParsingContext(WiktionaryPage page)
Specified by:
createParsingContext in class WiktionaryEntryParser

selectHandler

public IBlockHandler selectHandler(String line)
Description copied from class: WiktionaryEntryParser
Find a handler that is willing to handle the given line.

Overrides:
selectHandler in class WiktionaryEntryParser

isStartOfBlock

public boolean isStartOfBlock(String line)
Description copied from class: WiktionaryEntryParser
Hotspot for deciding if the given line is a potential start of a new article constituent. This may include headlines, templates, or other typographic variants.

Specified by:
isStartOfBlock in class WiktionaryEntryParser


Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.