improving manufacturing plants
play

Improving Manufacturing Plants Through Big Data Analytics SDS2019 - PowerPoint PPT Presentation

Improving Manufacturing Plants Through Big Data Analytics SDS2019 14 June 2019 Prof. Dr. Kurt Stockinger Martin Weber Marc Schni Agenda Use Case Goals Architecture Blueprint Experiment Conclusions Evaluation


  1. Improving Manufacturing Plants Through Big Data Analytics SDS2019 14 June 2019 Prof. Dr. Kurt Stockinger Martin Weber Marc Schöni

  2. Agenda • Use Case • Goals • Architecture Blueprint • Experiment • Conclusions • Evaluation • Q&A / Discussion

  3. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  4. About Midor • Founded in 1928 • Located in Meilen ZH • 600 Employees • Part of M-Industry • Produce 250’000 items daily for Migros und others • 32 production lines, 940 different products (different biscuits, ice cream, snacks, dessert powder, etc.)

  5. Introduction • Production line 16 produces the Blévita, one of Midors sales hits • Short production stops are reducing the output • What causes these disturbances? • Can they be predicted?

  6. Line 16 Batter mixing Oven Slots 1-9 Packaging & Labelling

  7. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  8. Goals Qualitative Goals Quantitative Goals 1. Improve efficiency 1. Find the most relevant features causing the 2. Flexible and scalable disturbances architecture 2. Latency for Inference of 3. Allows processing of various < 5 seconds data formats 3. Cost should be at worst proportional to amount of processed data

  9. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  10. System Context

  11. Conceptual Architecture

  12. Solution Architecture

  13. Solution Architecture

  14. Solution Architecture

  15. Solution Architecture M P M A P X P

  16. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  17. Objective • Over a period of ~1 year data about short production stoppages was collected ( → Label) • Over the same period additional data about orders, climate conditions etc. were captured ( → Features) • Is it possible to find a pattern in these datasets regarding the occurrence of short stops?

  18. Building the dataset y x 1 , x 2 , … x n FAcility Management FAMS.XLSX System (FAMS) 3rd party system Amperes oven: 32A Rel. Humidity: 54% Product: Blévita Gruyère Short stop: Nein BaroHygro.XML Climate System 3rd party system Big Data Analytics Solution „System under Design“ Manufacturing Orders.XML Execution System (MES) 3rd party system Gateway Signal-Stream Production line 3rd party system

  19. Splitting the dataset unbalanced dataset (100%) stops no stops unbalanced verification dataset (10%) Unbalanced dataset for modelling (90%) Balanced dataset Test Train 10x Cross-Validation Model Training

  20. Modelquality Accuracy Precision F1 Score Recall AUC Random Forest .852 .818 .897 .856 .911 Gradient Boosted Tree .847 .814 .892 .852 .909 Logistic Regression .629 .616 .644 .630 .677 Support Vector Machine .612 .692 .588 .636 .659 Naïve Bayes .572 .680 .552 .610 .602

  21. Feature Reduction Comparison of model quality metrics depending on number of features Model quality metrics (see legend) number of features

  22. Important Features

  23. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  24. Findings: Machine Learning • We can predict shorts stops with a F1-Score of about 85% • The integration of different data sources took the most time • It is not possible to a priori estimate the importance of predictors/features per data source → Integrate all data sources • The prediction itself does not provide a business value without additional steps (operationalization)

  25. Conclusion: Data import Time Data source 1 ? ? ? Data source 2 Data source 3 Events Target Dataset (combination)

  26. Conclusions: Architecture • Findings from modelling provide new boundary conditions for the big data architecture: • Number and kind of data sources • Amount of data • Requirements for inference service (Compute, Memory) • Principles of Lambda-Architecture have proven their effectiveness • Benefits of Kapa architecture (single code base) using libraries • Tools for ML-Pipeline export & operationalization are in early stages • Monitoring of data quality is a crucial success factor

  27. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

  28. Evaluation Quality Goals Quantity Goals 1. Improve efficiency 1. Find the most relevant Features causing the 2. Flexible and scalable disturbances architecture * 2. Latency for Inference of 3. Allows processing of various < 5 seconds data formats 3. Cost should be at worst proportional to amount of processed data

  29. Agenda • Use Case • Goals • Solution Architecture • Experiment • Conclusions • Evaluation • Q&A / Discussion

Recommend


More recommend