data integration of legacy erp system based on ontology
play

Data Integration of Legacy ERP System based on Ontology Learning - PowerPoint PPT Presentation

Data Integration of Legacy ERP System based on Ontology Learning from SQL Scripts Chuangtao Ma machuangtao@caesar.elte.hu Department of Information Systems, Etvs Lornd University This PhD project was guided by Dr. Molnr Blint Outline


  1. Data Integration of Legacy ERP System based on Ontology Learning from SQL Scripts Chuangtao Ma machuangtao@caesar.elte.hu Department of Information Systems, Eötvös Loránd University This PhD project was guided by Dr. Molnár Bálint

  2. Outline • Introduction & Motivation • Research Question Statement • Related Work • Proposed Solutions & Plan • Conclusion & Future Work

  3. Introduction & Motivation Legacy ERP system is a kind of enterprise management systems, were developed in several decades ago, that is no longer being enhanced. Unfriendly user Diversity of Code Interface Language & platform • Character-based user • C++, Java, Php, etc . • interface. Delphi, IBM AS/400, • Visual Studio, etc. Unavailability of the data- access interface. Outdated Various DBMS Technology • Visual FoxPro, Access, • Obsolete hardware. SQL Server, Oracle, • Poor modularity. MySQL, etc.

  4. Introduction & Motivation An increasing number of the enterprise decide to build the BI system for responding the dynamic business environment. Data mining & analysis 01 • Build the centralized data warehousing. • Integration of the heterogenous data from existing system. Business Process Reengineering(BPR) 02 • Redesign and improve the business process and architecture of the ERP. • Improve data dissemination and decision making based on advanced technology.

  5. Introduction & Motivation Replace all of the existing Modernize and Integrate the legacy ERP system existing legacy ERP system Pros Pros - Save the costs, effort, and time. - Advanced technologies and system. - Reduce the risk of the project. - Unified and centralized data center . - Cons Cons - Invest more budget and time. - Limited performance of the BI system. - - Potential risk for upgrading. - Put more effort to integrate legacy system. -

  6. Research Question Statement • It is a challengeable task to integrate the legacy ERP system efficiently and effectively, since the diversity of code language, various DBMS and unavailability of APIs. How to achieve the unified description of Q1: Unified description business processes (BP) and efficient of Business Process integration among different sub-organizations? How to access and integrate the data from the various DBMS of legacy ERP systems efficiently? Q2: Data Integration How to check the consistency of the ontologies and evaluate the correctness of the integration Q3: Result Evaluation results?

  7. Related Work  Data access technology of legacy system Common Business- Knowledge Discovery Ontology-based Data Oriented meta-Model (KDM) Access(OBDA) Language(COBOL) • Pérez-Castillo, et al, • Calvanese, D, • Millham, R, et et al al (2011) propose KDM (2017) introduced (2009) employed to represent the the OBDA technology the COBOL to access artifacts of legacy to extract the log the data from the systems as entities, data from legacy relational database relationships and information. of legacy systems. attributes. • However, ontology-based data access from distributed data- source still requires the data access interface to be available.

  8. Related Work  Ontology learning & Knowledge Extraction Extraction algorithm based on Common RDF Model 2009 Extraction algorithm based on RDF (Resource Description Framework) were designed to extract the knowledge from legacy systems. Process Mining & Sequential Pattern Mining 2011 A dynamic knowledge extraction approach based on process mining and sequential pattern mining are proposed respectively . Ontology Learning 2014 Semi-automated generation ontology approach from existing textual documents based on ontology learning is proposed . • However, the knowledge extraction based on ontology learning is still in the early phase to be explored.

  9. Related Work  Semantic data integration Ontology-based semantic integration (OBA-SI) Data integration The heterogeneous data was integrated by The traditional data the semantic mapping of the ontologies. integration approaches, including , rule-based, middleware framework, Linked data based semantic data integration and so forth. A semantic integration approach exploiting linked data are presented to achieve RDF data integration based on query rewriting. • For the ontology-based semantic integration, the efficiency of the semantic integration is limited by the efficiency and quality of the constructed ontology.

  10. Proposed Solutions & Plan • This PhD project focus on the integration of legacy ERP system based on ontology learning framework. • Unified description of the business Business process. Process • Integration of the business process. Integration Efficient and • Ontology learning model from SQL effectiveness scripts. Data Integration of the legacy ERP systems Integration • Semantic data integration based on ontology.

  11. Proposed Solutions & Plan Integrated Business Business Business Business Process Process 1 Process 2 Process i Business Process level Entity 1 Entity 2 Entity i PK PK_ID1 PK PK_ID2 PK PK_ID i Data Integration FK_BID2 FK_BID2 FK_BID i level Integrated Legacy ERP System Database Database 1 Database 2 Database i System Integration level Legacy ERP System Legacy ERP System Legacy ERP system Integrated Sub-ogranization 1 Sub-ogranizaiton 2 Sub-ogranizaiton i ERP System

  12. Proposed Solutions & Plan  Unified description & integration of business process • The description logic language will be used to achieve the unified description and representation of the business process. • The ontology alignment technology will be adopted to achieve the integration of the business process. 1. Jung, J. J. (2009) ‘Semantic business process integration based on ontology alignment’, Expert Systems with Applications. Els evier Ltd, 36(8), pp. 11013 – 11020. doi: 10.1016/j.eswa.2009.02.086.

  13. Proposed Solutions & Plan  Data integration based on ontology learning • The input of the ontology learning model is SQL scripts document and the output is the corresponding ontologies and knowledge graph. Data Access Knowledge Extraction & Ontology Generation Import Pre- Scripts Processing Export Documents LS1 Database SQL Scripts 1 from DBMS Store in Extract SQL Scripts 2 LS2 Database Graph DB Terms Evaluate Export Extract from Ontology Relationship LS i Database SQL Scripts i Quality DBMS Store as Generate RDF Triples OWL LS n Database SQL Scripts n

  14. Proposed Solutions & Plan  Data integration based on ontology learning • Ontology-based semantic data integration. • The heterogenous data will be integrated based on the interoperability of the ontologies and knowledge graph, the specific demo of data integration is depicted. Num Order Order Is_Abbr_of Number Transaction Is_Property_of PK Order_id PK Order_id Is_Property_of Is_Synonym_with Is_Abbr_of Order FK User_id FK Customer_id Service Place_of Is_Property_of Tran Is_Sub_of Product_id Good_id Customer Is_Purchased_by Good Number Administrator Tran_Num Is_Sub_of Is_Sub_of Is_Sub_of Is_Purchased_by Price Price Product User Vendors Is_Sub_of Order_state Tran_state

  15. Proposed Solutions & Plan  Legacy ERP system integration • Evaluate the accuracy of integration result and integrate legacy ERP system for achieving the centralized management and decision. Entity 1 Entity 2 PK PK_ID1 FK_BID2 • Check the consistency of FK_BID1 Legacy ERP System 1 Legacy ERP System2 PK PK_ID2 the ontologies generated Ontology Learning Ontology Ontology term term by ontology learning from extraction extraction SQL scripts. Ontology matching • Evaluate the semantic O 11 ,O 12 , O 13 O 1m O 2 1 ,O 22 , O 23 O 2n Integrated Entity accuracy of the data PK PK_ID FK_BID integration. Integrated Integrated database ERP system

  16. Conclusion & Future Work In this project, the architecture of the ontology learning framework was proposed to integrate heterogenous data from various legacy ERP systems efficiently. 1 The approach for generating ontologies by ontology learning from the relational database SQL scripts is proposed, and the specified steps of the ontology learning are illustrated. 2

  17. Conclusion & Future Work • This project is in its initial exploration phase, there are some works that should be investigated and conducted in the future. 1 Knowledge extraction algorithm from SQL scripts • The knowledge extraction algorithm based on NLP will be designed to extract the knowledge from SQL scripts. Ontology generation approaches from RDF schema 2 • Ontology generation approaches from RDF schema will be studied to generate the ontology for the integration of heterogeneous data. 3 Design the tools to support data integration • The data integration tool based on ontology learning from SQL scripts will be designed and developed to support the data integration.

  18. I would welcome any question and suggestion.

Recommend


More recommend