Automatic Generation of RSS Feed based on HTML Document Structure Analysis

Accession number;05A0661034
Title;Automatic Generation of RSS Feed based on HTML Document Structure Analysis
Author; NANNO TOMOYUKI (Grad. Sch. at Nagatsuta Tokyo Inst. of Technol.) OKUMURA MANABU (Tokyokodai Seimitsukogakuken)
Journal Title;Proceedings of the Annual Conference on JSAI (CD-ROM)
Journal Code:X0580B
ISSN:
VOL.19th;NO.;PAGE.1A4-01(2005)
Figure&Table&Reference;FIG.6, REF.10
Pub. Country;Japan
Language;Japanese
Abstract;In this paper, we present a system to automatically generate RSS Feeds from HTML documents which include time-series information with date expressions (e.g., archives of weblogs, BBSs, chats, and mailing lists, update descriptions on a site page, announcements of events, and so on). Our system is based on extraction of date expressions, structure analysis of HTML documents, and title detection/generation from the contents. (author abst.)