de.tudarmstadt.ukp.jwktl.parser
Interface IWiktionaryDumpParser

All Known Implementing Classes:
WiktionaryDumpParser, XMLDumpParser

public interface IWiktionaryDumpParser

Parser for Wiktionary dump files obtained from http://download.wikimedia.org/backup-index.html.

Author:
Christian M. Meyer

Method Summary
 Iterable<IWiktionaryPageParser> getPageParsers()
          Returns the list of all registered IWiktionaryPageParsers.
 void parse(File dumpFile)
          Starts the parsing of the given dump file.
 void register(IWiktionaryPageParser pageParser)
          Register the given IWiktionaryPageParser.
 

Method Detail

parse

void parse(File dumpFile)
           throws WiktionaryException
Starts the parsing of the given dump file. The file can be either bzip2-compressed or the extracted XML version.

Throws:
WiktionaryException - in case of any parser errors.

register

void register(IWiktionaryPageParser pageParser)
Register the given IWiktionaryPageParser. The registered parser will then be notified once a Wiktionary-related XML tag has been processed.


getPageParsers

Iterable<IWiktionaryPageParser> getPageParsers()
Returns the list of all registered IWiktionaryPageParsers.



Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.