Semantic PDF Segmentation for Legacy Documents in Technical - PowerPoint PPT Presentation

Apr 03, 2023 •429 likes •595 views

Semantic PDF Segmentation for Legacy Documents in Technical Documentation Jan Oevermann jan.oevermann@dfki.de SEMANTiCS 2018, Vienna, 13.09.18 Technical Documentation 2 Most common: PDF documents Digital Paper, archival &

Semantic PDF Segmentation for Legacy Documents in Technical Documentation Jan Oevermann jan.oevermann@dfki.de SEMANTiCS 2018, Vienna, 13.09.18
Technical Documentation 2 Most common: PDF documents • “Digital Paper”, archival & distribution • ISO Standard, guaranteed reproduction, ubiquitous support Best practice: XML content components • Self-contained building blocks, e.g. chapter-sized, ~150-500 words • Reuse, translation, aggregation, delivery 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Motivation 3 Online Portal Search Description Task De Desc sc De Desc sc Desc Desc Desc Task Task XML XM Task Task Task XML XM XML XML PDF PDF 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Motivation 4 Only safety information of the document I need maintenance information about the fuel injection Everything about the hydraulic pump in technical overview or technical data Faceted search Information request with semantic concepts which can be used as facets 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Motivation 5 Limitations of PDF • Semantic structure gets lost • No metadata for (overlapping) segments • Large documents (>200p) only accessible via full text search Idea • Use knowledge from structured XML content components • Manually annotated semantic concepts / metadata • Apply trained model on text extracted from PDF • Find segments which are semantically relevant 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Procedure model 6 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Training / Classification 7 Learning phase Classification Weighting Feature extraction (TF-ICF-CF) (Bag o n-grams) Training data Model (VSM) Classifier New data cosine similarity/ Prediction (unclassified) k -nearest neighbour 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Chunking 8 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Chunking / Classification 9 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
10 Range finding 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Metadata generation 11 https://iirds.org/ 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
12 Metadata generation 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
13 Application Live demo Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
14 Results 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
Outlook & Conclusion 15 Outlook • Other text sorts (e.g. patents) or document types (e.g. Word) • Combination with other techniques (formatting / heuristics) Conclusion • Method relies on text and is formatting-independent • No splitting of PDF, just additional metadata • Good results in detecting semantic segments • Identified ranges can be provided in a standardized format 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna
16 Contact Jan Oevermann Code & Demo jan.oevermann@dfki.de github.com/j-oe/segments www.janoevermann.de segments.fastclass.de 13.09.18 Jan Oevermann (DFKI), SEMANTiCS 2018, Vienna

Recommend

Semantic segmentation Image classification Object detection Semantic segmentation Evolution

Accel : A Corrective Fusion Network for Efficient Semantic Segmentation on Video Samvit Jain , Xin Wang , Joseph Gonzalez RISE Lab, UC Berkeley Semantic segmentation Image classification Object detection Semantic segmentation Evolution

857 views • 13 slides

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation Hengshuang Zhao The Chinese University of Hong Kong May 29, 2019 Part I: I: Semantic Segmentation Semantic Segmentation background car person

510 views • 39 slides

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L. Lazebnik. Outline Bottom-up segmentation Superpixel segmentation Semantic segmentation Metrics Architectures

1.28k views • 49 slides

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08 Outline Overview of segmentation problem Semantic segmentation Instance Segmentation Our work Definition of segmentation problem Image

900 views • 54 slides

Context For Semantic Segmentation Gang Yu Collaborators Changqian Yu

Context For Semantic Segmentation Gang Yu Collaborators Changqian Yu Jingbo Wang Chao Peng Xiangyu Zhang Changxin Gao Nong Sang Gang Yu Jian Sun Outline Revisit Semantic Segmentation Context for Semantic

774 views • 43 slides

Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation

Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation Outline Exploring Context with Deep Structured Models Guosheng Lin, Chunhua Shen, Ian Reid, Anton van dan Hengel; Efficient Piecewise Training

992 views • 45 slides

An Overview of Semantic Image Segmentation with Deep Learning Simone Bonechi Outline

An Overview of Semantic Image Segmentation with Deep Learning Simone Bonechi Outline Semantic Image Segmentation Deep Network for Semantic Segmentation FCN (Fully Convolutional Neural Network) DeconvNet PSPNet (Pyramid Scene

746 views • 22 slides

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador,

Budget-aware Semi-Supervised Semantic and Instance Segmentation Miriam Bellver, Amaia Salvador, Jordi Torres, Xavier Giro-i-Nieto Women In Computer Vision - CVPR 2019 Motivation Semantic segmentation Instance segmentation Pixel-level

282 views • 15 slides

LID Challenge: Weakly Supervised Semantic Segmentation 3d place solution NoPeopleAllowed: The 3

LID Challenge: Weakly Supervised Semantic Segmentation 3d place solution NoPeopleAllowed: The 3 step approach to weakly supervised semantic segmentation Mariia Dobko, Ostap Viniavskyi, Oles Dobosevych UCU & SoftServe team The Machine

771 views • 20 slides

A new metric for evaluating semantic segmentation: leveraging global and contour accuracy Eduardo

Introduction Semantic segmentation Accuracy evaluation Conclusions A new metric for evaluating semantic segmentation: leveraging global and contour accuracy Eduardo Fernandez-Moral 1 , Renato Martins 1 , Denis Wolf 2 , and Patrick Rives 1 1

786 views • 40 slides

Temporally Distributed Networks for Fast Video Semantic Segmentation Ping Hu 1 Fabian Caba

Temporally Distributed Networks for Fast Video Semantic Segmentation Ping Hu 1 Fabian Caba Heilbron 2 Oliver Wang 2 Zhe Lin 2 Stan Sclaroff 1 Federico Perazzi 2 1 Boston University 2 Adobe Research Challenge Video Semantic Segmentation frame

984 views • 11 slides

Semantic Image Segmentation and Web-Supervised Visual Learning Florian Schroff Andrew Zisserman

Semantic Image Segmentation and Web-Supervised Visual Learning Florian Schroff Andrew Zisserman University of Oxford, UK Antonio Criminisi Microsoft Research Ltd, Cambridge, UK Outline Part I: Semantic Image Segmentation Goal:

794 views • 63 slides

Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong, Bohyung

Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong, Bohyung Han Mehmet Gnel What is this paper about? A novel semantic segmentation algorithm Convolution & Deconvolution layers Fully

1.03k views • 37 slides

Lecture: Segmentation Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab

Semantic Segmentation Lecture: Segmentation Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab 17-Oct-2019 1 St Stanfor ord University CS 131 Roadmap Semantic Segmentation Pixels Segments Images Videos Web Neural

744 views • 59 slides

Deep learning 8.4. Networks for semantic segmentation Fran cois Fleuret

Deep learning 8.4. Networks for semantic segmentation Fran cois Fleuret https://fleuret.org/ee559/ Nov 2, 2020 The historical approach to image segmentation was to define a measure of similarity between pixels, and to cluster groups of

222 views • 18 slides

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS Paper by Chen,

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS Paper by Chen, Papandreou, Kokkinos, Murphy, Yuille Slides by Josh Kelle (with graphics from the paper) Semantic Segmentation Goal: Partition the image into

520 views • 22 slides

Deep Watershed Transform for Instance Segmentation Min Bai & Raquel Urtasun To appear at

Deep Watershed Transform for Instance Segmentation Min Bai & Raquel Urtasun To appear at IEEE CVPR 2017 in Hawaii Presented at NVIDIA GTC 2017 Semantic Segmentation Input: RGB Image Output at each pixel: Semantic label

465 views • 28 slides

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Francis Engelmann*

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Francis Engelmann* Theodora Kontogianni* Alexander Hermans Bastian Leibe Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Problem Statement Input

140 views • 10 slides

GCN INTRODUCTION AND ITS APPLICATION IN 3D POINT CLOUD SEMANTIC SEGMENTATION Yisong Li (NVIDIA),

GCN INTRODUCTION AND ITS APPLICATION IN 3D POINT CLOUD SEMANTIC SEGMENTATION Yisong Li (NVIDIA), Guohao Li (KAUST) Grid Data vs General Graphs CNN vs GCN ResGCN OUTLINE Experiments on 3D Cloud Point Segmentation Sequential

1.31k views • 102 slides

Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Je

Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Je ff Donahue, Trevor Darrell, Jitendra Malik UC Berkeley Tech Report @ http://arxiv.org/abs/1311.2524 Detection & Segmentation input

704 views • 37 slides

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline Overview Three Levels of Segmentation Basic Segmentation Semantic Segmentation (FCN, DeepLab) Instance Segmentation (Mask-RCNN)

449 views • 28 slides

and Background for Semantic Segmentation Yu Liu and Michael S. Lew Leiden Institute of Advanced

IEEE International Conference on Image Processing (ICIP 2017), Beijing, China Improving the Discrimination Between Foreground and Background for Semantic Segmentation Yu Liu and Michael S. Lew Leiden Institute of Advanced Computer Science,

470 views • 34 slides

Efficient Semantic Segmentation using Gradual Grouping Nikitha Vallurupalli 1 , Sriharsha

Efficient Semantic Segmentation using Gradual Grouping Nikitha Vallurupalli 1 , Sriharsha Annamaneni 1 , Girish Varma 1 , C V Jawahar 1 , Manu Mathew 2 , Soyeb Nagori 2 IIIT Hyderabad 1 , TI Bangalore 2 Image / annotation from cityscapes dataset

309 views • 13 slides

Optimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation Caner Hazrba

Optimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation Caner Hazrba Joint work with Julia Diebold and Daniel Cremers Optimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation Caner Hazrba

794 views • 21 slides