de.tudarmstadt.ukp.jwktl.parser.util
Interface IDumpInfo

All Known Implementing Classes:
DumpInfo

public interface IDumpInfo

Data object for information on the IWiktionaryDumpParser. This dump parser creates and maintains an instance of this type to share information on the dump file with all its registed IWiktionaryPageParser.

Author:
Christian M. Meyer

Method Summary
 File getDumpFile()
          Returns the current Wiktionary XML dump file, which is being parsed.
 ILanguage getDumpLanguage()
          Returns the language of the Wiktionary edition this dump file belongs to.
 Set<String> getNamespaces()
          Returns a set containing all namespaces registered in the siteinfo header of the XML dump file.
 IWiktionaryDumpParser getParser()
          Returns the Wiktionary dump parser instance that maintains this dump info object.
 int getProcessedPages()
          Returns the number of pages in the dump file that have been processed.
 boolean hasNamespace(String namespace)
          Returns true if the specified namespace has been defined within the siteinfo header of the XML dump file.
 void reset()
          Reset the dump information, such the number of processed pages.
 

Method Detail

getDumpFile

File getDumpFile()
Returns the current Wiktionary XML dump file, which is being parsed.


getDumpLanguage

ILanguage getDumpLanguage()
Returns the language of the Wiktionary edition this dump file belongs to. The language is automatically determined from the base URL of the Wiktionary edition.


getNamespaces

Set<String> getNamespaces()
Returns a set containing all namespaces registered in the siteinfo header of the XML dump file.


hasNamespace

boolean hasNamespace(String namespace)
Returns true if the specified namespace has been defined within the siteinfo header of the XML dump file.


getProcessedPages

int getProcessedPages()
Returns the number of pages in the dump file that have been processed.


getParser

IWiktionaryDumpParser getParser()
Returns the Wiktionary dump parser instance that maintains this dump info object.


reset

void reset()
Reset the dump information, such the number of processed pages. This is to be called before a new dump file is being processed. Users of this interface should normally not invoke this method.



Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.