The 15th ACM SIGWEB International Symposium on Document Engineering

September 8-11, Lausanne, Switzerland

Accepted Full Papers

The browser as a document composition engine. Tamir Hassan (Hewlett-Packard Laboratories), Niranjan Damera Venkata (Hewlett-Packard Laboratories)

Similarity-Based Support for Text Reuse in Technical Writing. Axel Soto (Dalhousie University), Abidalrahman Mohammad (Dalhousie University), Andrew Albert (Innovatia Inc.), Aminul Islam (Dalhousie University), Evangelos Milios (Dalhousie University), Michael Doyle (Innovatia Inc.), Rosane Minghim (Universidade de São Paulo), Maria Cristina Ferreira de Oliveira (Universidade de São Paulo)

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight. Michael Cutter (University of California at Santa Cruz), Roberto Manduchi (University of California at Santa Cruz) (Best student paper award)

Exploring scholarly papers through citations. Angelo Di Iorio (University of Bologna), Raffaele Giannella (University of Bologna), Silvio Peroni (Università di Bologna), Francesco Poggi (University of Bologna and ISTC-CNR), Fabio Vitali (University of Bologna) (Best paper award)

The Delaunay document layout descriptor. Sébastien Eskenazi (L3I - Université de La Rochelle), Petra Gomez-Krämer (L3i - University of La Rochelle), Jean-Marc Ogier (University of La Rochelle, Laboratoire L3i)

Generating Abstractive Summaries from Meeting Transcripts. Siddhartha Banerjee (Pennsylvania State University), Prasenjit Mitra (Qatar Computing Research Institute), Kazunari Sugiyama (National University of Singapore)

TEXUS: A Task-based Approach for Table Extraction and Understanding. Roya Rastan (UNSW), Hye-Young Paik (University of New South Wales), John Shepherd (University of New South Wales)

An Approach to Convert NCL Applications into Stereoscopic 3D. Roberto Gerson Azevedo (PUC-Rio), Gulherme Lima (PUC-Rio), Luiz Fernando Gomes Soares (Pontifícia Universidade Católica do Rio de Janeiro)

Spatio-temporal validation of multimedia documents. Joel Dos Santos (Universidade Federal Fluminense), Christiano Braga (Universidade Federal Fluminense), Débora C. Muchaluat Saade (Universidade Federal Fluminense), Cécile Roisin (Univ. Grenoble Alpes, Inria, LIG), Nabil Layaïda (Inria, LIG, Univ. Grenoble Alpes)

Concept Hierarchy Extraction from Textbooks. Shuting Wang (Penn State University), Chen Liang (Penn State University), Zhaohui Wu (Penn State University), Kyle Williams (Penn State University), Bart Pursel (Penn State University), C. Lee Giles (Penn State University)

Combining Advanced Information Retrieval and Text-Mining for Digital Humanities. Antoine Widlöcher (Laboratoire GREYC, Université de Caen Basse-Normandie, CNRS, UMR 6072), Nicolas Béchet (IRISA), Jean-Marc Lecarpentier (Laboratoire GREYC, Université de Caen Basse-Normandie, CNRS, UMR 6072), Yann Mathet (Laboratoire GREYC, Université de Caen Basse-Normandie, CNRS, UMR 6072), Julia Roger (Identité et Subjectivité, Université de Caen Basse-Normandie)

Accepted Short Papers (Oral Presentation)

Creating eBooks with Accessible Graphics Content. Cagatay Goncu (Monash University), Kim Marriott (Monash University)

Enhancing Exploration with a Faceted Browser through Summarization. Grzegorz Drzadzewski (University of Waterloo), Frank Tompa (University of Waterloo)

MSoS: A Multi-Screen-Oriented Web Page Segmentation Approach. Mira Sarkis (Telecom ParisTech), Cyril Concolato (Telecom ParisTech), Jean-Claude Dufourd (Telecom Paristech)

Multi-oriented Text Extraction from Information Graphics. Falk Böschen (Kiel University), Ansgar Scherp (ZBW Leibniz Information Centre for Economics and Kiel University)

VEDD: A Visual Editor for Creation and Semi-Automatic Update of Derived Documents. Kim Marriott (Monash University), Mingzheng Shi (Monash University), Michael Wybrow (Monash University)

Hiding information in multiple level-line moirés. Thomas Walger (EPFL), Roger David Hersch (EPFL)

BBookX: An Automatic Book Creation Framework. Chen Liang (The Pennsylvania State University), Shuting Wang (Penn State University), Zhaohui Wu (The Pennsylvania State University), Kyle Williams (The Pennsylvania State University), Bart Pursel (The Pennsylvania State University), C. Lee Giles (The Pennsylvania State University)

Detecting XSLT Rules Affected by Schema Evolution. Yang Wu (Graduate School of Library, Information and Media Studies University of Tsukuba), Nobutaka Suzuki (Faculty of Library, Information and Media Science University of Tsukuba)

Document Layout Optimization with Automated Paraphrasing. Yusuke Kido (The University of Tokyo), Hikaru Yokono (The National Institute of Informatics), Goran Topić (The National Institute of Informatics), Akiko Aizawa (The National Institute of Informatics)

A Quantitative and Qualitative Assessment of Automatic Text Summarization Systems. Jamilson Batista (Federal University of Pernambuco), Rodolfo Ferreira (Federal Rural University of Pernambuco), Hilário Tomaz (Federal University of Pernambuco), Rafael Ferreira (Federal Rural University of Pernambuco), Rafael Dueire Lins (Federal University of Pernambuco), Steven Simske (Hewlett-Packard Labs), Gabriel Silva (Federal Rural University of Pernambuco), Marcelo Riss (Hewlett-Packard Brazil)

Automatic Extraction of Figures from Scholarly Documents. Sagnik Ray Choudhury (The Pennsylvania State University), Prasenjit Mitra (The Pennsylvania State University), C. Lee Giles (Pennsylvania State University)

Knuth-Plass revisited: Flexible line-breaking for automatic document layout. Tamir Hassan (Hewlett-Packard Laboratories), Andrew Hunter (Hewlett-Packard Laboratories)

Investigation of Ancient Manuscripts based on Multispectral Imaging. Fabian Hollaus (Vienna University of Technology - Computer Vision Lab), Markus Diem (Vienna University of Technology - Computer Vision Lab), Stefan Fiel (Vienna University of Technology - Computer Vision Lab), Florian Kleber (Vienna University of Technology - Computer Vision Lab), Robert Sablatnig (Vienna University of Technology - Computer Vision Lab)

Automatic Document Classification using Summarization Strategies. Rafael Ferreira (Federal University of Pernambuco), Rafael Lins (Federal University of Pernambuco), Fred Freitas (Federal University of Pernambuco), Luciano Cabral (Federal University of Pernambuco), Steven Simske (Hewlett-Packard Labs), Marcelo Riss (Hewlett-Packard Brazil)

Efficient Computation of Co-occurrence Based Word Relatedness. Jie Mei (Dalhousie University), Xinxin Kou (Dalhousie University), Zhimin Yao (Dalhousie University), Andrew Rau-Chaplin (Faculty of Computer Science, Dalhousie University), Aminul Islam (Faculty of Computer Science, Dalhousie University), Abidalrahman Moh'D (Faculty of Computer Science, Dalhousie University), Evangelos Milios (Faculty of Computer Science, Dalhousie University)

Interlinking English and Chinese RDF Data Using BabelNet. Tatiana Lesnikova (INRIA), Jérôme Euzenat (INRIA & Univ. Grenoble), Jérôme David (INRIA Rhône-Alpes)

Filling the gaps: Improving Wikipedia stubs. Siddhartha Banerjee (Pennsylvania State University), Prasenjit Mitra (Qatar Computing Research Institute)

Madoko: Scholarly Documents for the Web. Daan Leijen (Microsoft Research)

Accepted Short Papers (Poster Presentation)

Fine Grained Access Interactive Personal Health Records. Helen Balinsky (HP Laboratories), Nassir Mohammad (HP)

Does a Split-View Aid Navigation Within Academic Documents?. Juliane Franze (Monash University and Fraunhofer), Kim Marriott (Monash University), Michael Wybrow (Monash University)

An Approach for Designing Proofreading Views in Publishing Chains. Léonard Dumas (Université de Technologie de Compiègne), Stéphane Crozat (Université de Technologie de Compiègne), Bruno Bachimont (Université de Technologie de Compiègne), Sylvain Spinelli (Kelis)

High-Quality Capture of Documents on a Cluttered Tabletop with a 4K Video Camera. Chelhwon Kim (University of California, Santa Cruz), Patrick Chiu (FXPAL), Henry Tang (FXPAL)

Segmentation of overlapping digits through the emulation of a hypothetical ball and physical forces. Alberto Nicodemus Lopes Filho (CIn - UFPE), Carlos Mello (Universidade Federal de Pernambuco)

AERO: An extensible framework for adaptive web layout synthesis. Rares Vernica (HP Labs), Niranjan Damera Venkata (HP Labs)

Automatic Text Document Summarization Based on Machine Learning. Gabriel Silva (Federal University of Pernambuco), Rafael Lins (Federal University of Pernambuco), Luciano Cabral (CIn-UFPE), Rafael Ferreira (Federal University of Pernambuco), Hilário Tomaz (Federal University of Pernambuco), Steven Simske (Hewlett-Packard Labs), Marcelo Riss (Hewlett-Packard)

Searching Live Meeting Documents "Show me the Action". Laurent Denoue (FXPAL), Scott Carter (FXPAL), Matthew Cooper (FXPAL)

Multimedia Document Structure for Distributed Theatre. Jack Jansen (CWI: Centrum Wiskunde & Informatica), Michael Frantzis (Goldsmiths), Pablo Cesar (CWI: Centrum Wiskunde & Informatica)

Change Classification in Graphics-Intensive Digital Documents. Jeremy Svendsen (University of Victoria), Alexandra Branzan Albu (University of Victoria)