SLDC: an open-source workflow for object detection in - PowerPoint PPT Presentation

SLDC: an open-source workflow for object detection in multi-gigapixel images Romain Mormont, Jean-Michel Begon, Renaud Hoyoux, Rapha¨ el Mar´ ee Systems and modeling, Department of EE & CS, University of Li` ege, Belgium 12th September 2016 1 / 20

Outline 1. Context 2. SLDC Framework How it works Features 3. SLDC at work: thyroid nodule malignancy Thyroid case Cytomine Data Workflow Results 4. Conclusion and future works 2 / 20

Context Microscope slide smeared with thyroid cell samples (15 gigapixels). 3 / 20

Context Microscope slide smeared with core samples (11 gigapixels). 4 / 20

Context Microscope slide smeared with lung cell samples (3 gigapixels). 5 / 20

Context • Huge slides usually analysed manually ! • Machine learning (ML) and image processing (IP) could be used to assist humans • Problems of object detection and classification 6 / 20

SLDC: framework SLDC is an open-source Python framework created for accelerating development of large image analysis workflows. How ? • It encapsulates problem-independent logic (parallelism, memory limitation due to large images handling,. . . ) • It provides a concise way of declaring problem dependant components (segmentation, object classification,. . . ) 7 / 20

SLDC: how it works 8 / 20

SLDC: features • Tile-based processing to avoid loading a full image into memory • Several level of parallelism : tiles, objects, images,... • A customizable logging system providing a rich feedback about the execution • Effortless integration with other Python libraries: scikit-learn (ML), open-cv (IP), PyCuda (GPU),... 9 / 20

SLDC at work: thyroid case Aim: detect cells with inclusion and proliferative architectural patterns 10 / 20

SLDC at work: Cytomine is a web-based environment enabling collaborative multi-gigapixel image analysis. (Website: www.cytomine.be . Mar´ ee & al., Bioinformatics; 2016). 11 / 20

SLDC at work: data • 84 images with size ranging from 4 to 18 gigapixels • 68 annotated images • 5921 labelled annotations made by cytopathologists 1 (a) Annot. per group (b) Annot. per term 1 Team of Pr. Isabelle Salmon, Department of Pathology, Faculty of Medecine, ULB 12 / 20

SLDC at work: data (cont’d) (c) Pattern annot. per group (d) Pattern annot. per term (e) Proliferative (malignant) (f) Normal patterns (benign) 13 / 20

SLDC at work: data (cont’d) (g) Cell annot. per group (h) Cell annot. per term (i) Cells with incl. (malignant) (j) Normal cells (benign) 14 / 20

SLDC at work: workflow 15 / 20

SLDC at work: workflow (cont’d) 16 / 20

SLDC at work: workflow (cont’d) Classification is performed based on the detected object’s crop image using random subwindows and extremely randomized trees 2 . Proliferative vs. normal patterns : Cell with inclusion vs. normal cells : Accuracy: 0.8523 Accuracy: 0.8625 Precision: 0.6310 Precision: 0.8363 Recall: 0.4930 Recall: 0.9493 Normal Inclusion Normal Prolif. Normal 881 62 Normal 158 55 Inclusion 109 106 Prolif. 15 281 2 Mar´ ee et al., Pattern Recognition Letters ; 2016 17 / 20

SLDC at work: results Time (1st pass): 4 min 38 sec Time (2nd pass): 2 min 04 sec Objects found: 18882 Cells found: 17802 Patterns found: 1080 Jobs: 64 Size: 131072 × 57856 Max memory usage: 159.414 Go Time (1st pass): 12 min 24 sec Time (2nd pass): 7 min 10 sec Objects found: 76133 Cells found: 69820 Patterns found: 6313 Jobs: 64 Max memory usage: 179.855 Go Size: 163840 × 95744 18 / 20

Conclusion and future works 1. Framework Production-ready ! Open-source and generic. Still some minor improvements to make (parallelization, dispatching,...) Feel free to use it : https://github.com/waliens/sldc 2. Thyroid workflow: At this point, too many false positives. Need to improve the classifiers and the segmentation procedures 19 / 20

Thank you for your attention ! Any question ? 20 / 20

SLDC: toy example The aim is to detect circles in the following image. As a bonus, we want to know their center color. 21 / 20

SLDC: toy example (cont’d) # Defining a segmenter class CustomSegementer(Segmenter): """All non-black pixels are in an object of interest""" def segment(self, image): return (image > 0).astype(np.uint8) # Defining a dispatching rule class CircleRule(DispatchingRule): """A rule which matches circle polygons""" def evaluate_batch(self, image, polygons): return [circularity(p) > 0.85 for p in polygons] # Defining a polygon classifier class ColorClassifier(PolygonClassifier): """ A classifier which returns the color (greyscale) of the center pixel of the object """ def predict_batch(self, image, polygons): classes = [center_pxl_color(image, p) for p in polygons] probas = [1.0] * len(polygons) return classes, probas 22 / 20

SLDC: toy example (cont’d) # Build the workflow builder = WorkflowBuilder() builder.set_n_jobs(100) builder.set_segmenter(CustomSegementer()) builder.add_classifier(CircleRule(), ColorClassifier(), disp_label="circle") workflow = builder.get() # Process an image results = workflow.process(image) # Go through the detected objects for polygon, dispatch, label, proba in results: print "Detected polygon {}".format(polygon) print "Dispatched by '{}'".format(dispatch) print "Predicted class {}".format(label) print "Probability {}".format(proba) print "" 23 / 20

SLDC: toy example (cont’d) Detected polygon POLYGON ((...)) Dispatched by 'circle' Predicted class 128 Probability 1.0 Detected polygon POLYGON ((...)) Dispatched by 'circle' Predicted class 255 Probability 1.0 24 / 20

SLDC: scalability (a) Evolution of the execution (b) Execution times per giga- times when varying the number pixels. of available processors 25 / 20

SLDC: an open-source workflow for object detection in - PowerPoint PPT Presentation

SLDC: an open-source workflow for object detection in multi-gigapixel images Romain Mormont, Jean-Michel Begon, Renaud Hoyoux, Rapha el Mar ee Systems and modeling, Department of EE & CS, University of Li` ege, Belgium 12th September

Peoplesoft Workflow Peoplesoft Workflow Technology Technology Putting Customer First SOA IT

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

From image classification to object detection Image classification Object detection Image source

Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1 / 48 Object Detection The

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

STAR-CCM+ in your Workflow Bill Jester, CD-adapco STAR-CCM+ in your workflow Contents

Day 8 Workflow Cloud Resource Provisioning Todays Agenda Introduction What is workflow?

workflow: workflow: QSPR = Quantitative Structure Property

A Workflow Workflow for for Retrieving Retrieving Orthologous Orthologous A Promoters and I

Make Money With Open Source What is Open Source? Community Free software vs. open source

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML

Lecture 11: Object detection Contains slides from S. Lazebnik, R. Girshick, B. Hariharan 1

and Retrieval Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H.

Decoupling your Spring boot Microservices With an open source workflow engine Orchestrating

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

BigStation: Enable Scalable Real-time Signal Processing in Large MU-MIMO Systems Qing Yang

GAUSS - GEANT4 based simulat ion f or LHCb GEANT4 Workshop 2 Oct ober 2002 W. Pokor ski /

Placement resource view visualization $ openstack resource provider tree balazs.gibizer@est.tech

CS4402-9535: Many-core Computing with CUDA Marc Moreno Maza University of Western Ontario,

Elementary Functions Part 1, Functions Lecture 1.0a, Excellence in Algebra: Exponents Dr. Ken W.

Current progress in higher-order curvature flow Glen Wheeler 6 th October 2020 Asia-Pacific

HCAL uTCA Readout Crate Ethernet GBT links from front-ends 12 AMC Slots Power 1 H C M

iSCSI Requirements draft-haagens-ips-iscsireqs-00.txt Randy Haagens Director, Networked Storage