4396

Using iDocument for Document Categorization in Nepomuk Social Semantic Desktop

Benjamin Adrian, Martin Klinkigt, Heiko Maus, Andreas Dengel

Proceedings of I-KNOW 09 and I-SEMANTICS 09 International Conference on Semantic Systems (I-Semantics-09), September 2-4, Graz, Austria , Pages: 638-643 , Verlag der Technischen Universität Graz, Graz , 2009
On the Semantic Desktop users maintain their model of the world in a formal personal information model ontology. Concepts from this ontology are used to annotate documents from desktop, allowing efficient navigation and browsing of these. However, the mental overhead required for correctly classifying new incoming document is substantial. We present the integration of the ontology-based information extraction system iDocument into the Nepomuk Semantic Desktop for classifying documents within the personal information model. A comparison is done between iDocument and the original classification system Structure Recommender. It is based on real models and documents from five Nepomuk users. Results reveal evidences that iDocument's categorization proposals are rated with higher recall and precision values and show that iDocument's result ranking corresponds to user ratings.

Show BibTex:

@inproceedings {
       abstract = {On the Semantic Desktop users maintain their model of the world in a formal personal information model ontology. Concepts from this ontology are used to annotate documents from desktop, allowing efficient navigation and browsing of these. However, the mental overhead required for correctly classifying new incoming document is substantial. We present the integration of the ontology-based information extraction system iDocument into the Nepomuk Semantic Desktop for classifying documents within the personal information model. A comparison is done between iDocument and the original classification system Structure Recommender. It is based on real models and documents from five Nepomuk users. Results reveal evidences that iDocument's categorization proposals are rated with higher recall and precision values and show that iDocument's result ranking corresponds to user ratings.},
       number = {}, 
       month = {9}, 
       year = {2009}, 
       title = {Using iDocument for Document Categorization in Nepomuk Social Semantic Desktop}, 
       journal = {}, 
       volume = {}, 
       pages = {638-643}, 
       publisher = {Verlag der Technischen Universität Graz, Graz}, 
       author = {Benjamin Adrian, Martin Klinkigt, Heiko Maus, Andreas Dengel}, 
       keywords = {semantic desktop, ontology-based information extraction, text classification, personal information model},
       url = {http://www.i-semantics.at/2009/papers/using_idocument_for_document_categorization.pdf, http://www.dfki.de/web/forschung/publikationen/renameFileForDownload?filename=using_idocument_for_document_categorization.pdf&file_id=uploads_392}
}