Package de.tudarmstadt.ukp.jwktl.parser

Interface Summary
IWiktionaryDumpParser Parser for Wiktionary dump files obtained from http://download.wikimedia.org/backup-index.html.
IWiktionaryEntryParser A parser for separating an article page's text into individual Wiktionary word entries.
IWiktionaryPageParser Generic interface for parsing XML dumps in a MediaWiki format.
IWritableWiktionaryEdition Generic interface for writable Wiktionary language editions used by the parsers to store the extracted entries and information types.
 

Class Summary
WiktionaryArticleParser Parses a Wiktionary XML dump and stores the parsed information as a Berkeley DB within a specified directory.
WiktionaryDumpParser Extension of the XMLDumpParser that reads the different XML tags of the Wiktionary XML dump file format and provides hotspots for each type of information.
WiktionaryEntryParser Base implementation for parsing the textual contents of an article page in order to construct IWiktionaryEntry and IWiktionarySense instances.
WiktionaryPageParser<PageType extends WiktionaryPage> Abstract base class for implementations of the IWiktionaryPageParser interface.
WritableBerkeleyDBWiktionaryEdition Extends the Berkeley DB implementation by providing the possibility for modifying the contents.
XMLDumpParser Implementation of IWiktionaryDumpParser for processing XML files downloaded from http://download.wikimedia.org/backup-index.html.
 



Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.