Paper Browsing System with Structure Analysis and Displaying Annotation on Side-note Windows Takeshi Abekawa Akiko Aizawa National Institute of Informatics Abstract: In this paper, we introduce our on-going efforts to construct a scientific paper browsing system to assist users to read and understand advanced technical content. The paper features on two major functions that are prerequisite for such systems: document structure analysis for image, PDF, and XML formatted articles, and automatic link detection that help users access richer information from diverse external sources. We also present technical details of our current implementation to generate and display the linked external data in side-note windows with a target paper image. 1 XML 1 XML PDF XHTML EPUB 1 J-STAGE 3 XML 1 XML XML XHTML Web PMC(PubMed Central) PubReader 2 PDF PDF PDF 101-8430 2-1-2 E-mail: abekawa@nii.ac.jp 1 https://www.jstage.jst.go.jp/pub/html/ay04s230 ja.html 2 http://www.ncbi.nlm.nih.gov/pmc/about/pubreader/ PDF PDF Web SideNoter XML PDF PDF OCR 100% 20 3 PC 3 http://kmcs.nii.ac.jp/nlp annual/ - 13 -
1: SideNoter 2 PDF PDF EndNote 4 Mendeley 5 1 Active Reading [1] XLibris[4] 1998 LiquidText[5] TextTearing[2] 4 http://endnote.com/ 5 http://www.mendeley.com/ Web [10] [8] 3 1 PC Web 3.1.2-14 -
JPEG TIFF PDF PNG DOC TeX PDF XML 2: 3.1 2 3.1.1 PDF OCR PDF L A TEX Microsoft Word PDF PDF PDF PDF OCR PDF Poppler 6 pdftotext 3.1.2 6 http://poppler.freedesktop.org/ PNG PDF PNG PDF PNG Poppler pdftocairo PDF imagemagick 7 ( ) PNG Web 6 64 4 ( PDF XML XHTML EPUB)) PDF XML Web 5 Web PDF XML 7 http://imagemagick.org/ - 15 -
PDF XML 1 Web 2 4 PDF 2 1 XML 8 PDF 8 k2pdfopt: http://www.willus.com/k2pdfopt/ 1: PDF XML Web Copy Copy Click i-linkage [9] CiNii 9 Webcat Plus 10 PDF PDF () 1 5 9 http://ci.nii.ac.jp/ 10 http://webcatplus.nii.ac.jp/ - 16 -
(Side-note) 2 Wikify[3] Amazon Kindle X-Ray 11 Wikipedia Wikipedia 1 [6] GETA[7] Wikipedia Web 6 11 http://www.amazon.com/gp/help/customer/ display.html/?nodeid=200729910 XML XML PDF XML PDF PDF XML PDF XML XML XML PDF 3 XML 7 PDF - 17 -
3:? [1] Mortimer Jerome Adler. How to Read a Book. Simon and Schuster, 1940. :.,.. 1987. [2] Franois Guimbretire Dongwook Yoon, Nicholas Chen. Texttearing: opening white space for digital ink annotation. In the 26th annual ACM symposium on User interface software and technology, pages 107 112, 2013. [3] Rada Mihalcea and Andras Csomai. Wikify!: linking documents to encyclopedic knowledge. In The 18th ACM Conference on Information and Knowledge Management, pages 233 242, 2007. [4] Morgan N. Price, Bill N. Schilit, and Gene Golovchinsky. Xlibris: the active reading machine. In CHI 98 Cconference Summary on Human Factors in Computing Systems, pages 22 23, 1998. [5] Craig S. Tashman and W. Keith Edwards. Liquidtext: A flexible, multitouch environment to support active reading. In CHI 11 Conference on Human Factors in Computing Systems, pages 3285 3204, 2011. [6] and.. In 2013-EC-27(19), 2013. [7]. GETA., 26(4):87 106, 2009. [8],,, and.., 66(11):J461 J470, 2012. [9],, and.. DBSJ Letters, 6(4):17 20, 2008. [10],, and. Web. In 2009-DBS-149(14), pages 1 6, 2009. - 18 -