de.tudarmstadt.ukp.jwktl.parser
Interface IWiktionaryPageParser

All Known Implementing Classes:
WikisaurusArticleParser, WiktionaryArticleParser, WiktionaryPageParser

public interface IWiktionaryPageParser

Generic interface for parsing XML dumps in a MediaWiki format.

Author:
Christian M. Meyer

Method Summary
 void onClose(IDumpInfo dumpInfo)
          Hotspot that is invoked after the parser has finished its work.
 void onPageEnd()
          Hotspot that is invoked upon finishing the current article page.
 void onPageStart()
          Hotspot that is invoked upon starting a new article page.
 void onParserEnd(IDumpInfo dumpInfo)
          Hotspot that is invoked when the parser is about to end.
 void onParserStart(IDumpInfo dumpInfo)
          Hotspot that is invoked upon starting the parser.
 void onSiteInfoComplete(IDumpInfo dumpInfo)
          Hotspot that is invoked after the siteinfo header has been read.
 void setAuthor(String author)
          Hotspot that is invoked after the current page's author is read.
 void setPageId(long pageId)
          Hotspot that is invoked after the current page's id is read.
 void setRevision(long revisionId)
          Hotspot that is invoked after the current page's revision id is read.
 void setText(String text)
          Hotspot that is invoked after the current page's text is read.
 void setTimestamp(Date timestamp)
          Hotspot that is invoked after the current page's timestamp is read.
 void setTitle(String title, String namespace)
          Hotspot that is invoked after the current page's title is read.
 

Method Detail

onParserStart

void onParserStart(IDumpInfo dumpInfo)
Hotspot that is invoked upon starting the parser.


onSiteInfoComplete

void onSiteInfoComplete(IDumpInfo dumpInfo)
Hotspot that is invoked after the siteinfo header has been read. At this point in time, the dump info contains all information, including dump language and namespaces.


onParserEnd

void onParserEnd(IDumpInfo dumpInfo)
Hotspot that is invoked when the parser is about to end. Use this method for writing any owing information to a file or database. For closing and cleaning up resources, you should, however, use the onClose(IDumpInfo) hotspot.


onClose

void onClose(IDumpInfo dumpInfo)
Hotspot that is invoked after the parser has finished its work. This method is supposed to close and cleanup any resources (e.g., closing a database connection). It is called after all onParserEnd(IDumpInfo) calls have been handled.


onPageStart

void onPageStart()
Hotspot that is invoked upon starting a new article page.


onPageEnd

void onPageEnd()
Hotspot that is invoked upon finishing the current article page.


setAuthor

void setAuthor(String author)
Hotspot that is invoked after the current page's author is read.


setRevision

void setRevision(long revisionId)
Hotspot that is invoked after the current page's revision id is read.


setTimestamp

void setTimestamp(Date timestamp)
Hotspot that is invoked after the current page's timestamp is read.


setPageId

void setPageId(long pageId)
Hotspot that is invoked after the current page's id is read.


setTitle

void setTitle(String title,
              String namespace)
Hotspot that is invoked after the current page's title is read.

Parameters:
namespace -

setText

void setText(String text)
Hotspot that is invoked after the current page's text is read.



Copyright © 2011-2013 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.