a stock prediction system using open source software
play

A Stock Prediction System using open-source software Fred Melo - PowerPoint PPT Presentation

A Stock Prediction System using open-source software Fred Melo William Markito fmelo@pivotal.io wmarkito@pivotal.io @fredmelo_br @william_markito It's all about DATA Prediction Data Sources Look for patterns Machine Learning is the answer


  1. A Stock Prediction System using open-source software Fred Melo William Markito fmelo@pivotal.io wmarkito@pivotal.io @fredmelo_br @william_markito

  2. It's all about DATA Prediction Data Sources Look for patterns

  3. Machine Learning is the answer Clustering Genetic Algorithms Neural Networks

  4. Applying Machine Learning Train with historical dataset Apply model to the new input

  5. Why so hard? Hard to scale Hard to make it real-time Hard to add new data sources Why?

  6. Traditional models are reactive and static Store Analytics Data Lake HDFS No real-time information Hard to change ETL based Labor intensive Data-source specific Inefficient

  7. Stream-based, real-time closed-loop analytics are needed In-Memory Data Stream Pipeline Real-Time Data Expert System / Data Lake HDFS Machine Learning Continuous Learning Multiple Data Sources Continuous Improvement Real-Time Processing Continuous Adapting Store Everything

  8. How can it be addressed? Info Look at past trends Neural Network (for similar input) Evaluate current input Analysis Score / Predict

  9. How can it be addressed? Info Filter Neural Network Analysis [ json ]

  10. How can it be addressed? Info Filter Enrich Neural Network Analysis

  11. How can it be addressed? Info Filter Enrich Transform Neural Network Analysis

  12. How can it be addressed? Info Neural Network Filter Enrich Transform Analysis

  13. How can it be addressed? Info Neural Network Filter Enrich Transform Analysis Transform

  14. How can it be addressed? Neural Network Real-time In-Memory Data Grid scoring Train

  15. How can it be addressed? Neural Network In-Memory Data Grid Update Push Front-end

  16. Streaming real-time analytics architecture Other Sources and Destinations Distributed Computing JMS Fast Data Ingest Transform Sink SpringXD Store / Analyze Predict / Machine Learning

  17. Demo Architecture Fast Data HTTP Sink Transform Split Filter Predict Sink HTTP SpringXD Push Machine Learning Extensible Open-Source Fault-Tolerant Horizontally Scalable Dashboard

  18. splitter http-server Simulator geode-json Transformer client splitter http-client tap geode-json obj-to-json client shell - R SpringXD

  19. Data Stream Pipelining SpringXD ANALYZE INGEST / SINK PROCESS • Little or no coding required • Import and invoke PMML jobs • Call Spark, Reactor or RxJava easily • Dozens of built-in connectors • Built-in configurable filtering, • Call Python, R, Madlib and other splitting and transformation • Seamless integration with Kafka, tools Sqoop • Out-of-box configurable jobs for • Built-in configurable counters and batch processing • Create new connectors easily gauges using Spring

  20. Scale-Out and HA Architecture SpringXD XD admin Split Filter Transform Sink Ingest SpringXD Stream Deployment XD Nodes XD Nodes XD Nodes XD Nodes XD Nodes Ingest Split Transform Sink Filter Messaging

  21. Demo Architecture Fast Data HTTP Sink Transform Split Filter Predict Sink HTTP SpringXD Push Machine Learning Extensible Open-Source Fault-Tolerant Horizontally Scalable Dashboard

  22. Geode client-server architecture

  23. Partitioned Regions

  24. Event handling

  25. Demo Architecture Fast Data HTTP Sink Transform Split Filter Predict Sink HTTP SpringXD Push Machine Learning Extensible Open-Source Fault-Tolerant Horizontally Scalable Dashboard

  26. Neural Networks

  27. Neural Networks

  28. Neural Network price(x) medium medium avg (x) avg (x+1) relative strength (x)

  29. Neural Network

  30. Demo Architecture Fast Data HTTP Sink Transform Split Filter Predict Sink HTTP SpringXD Push Machine Learning Extensible Open-Source Fault-Tolerant Horizontally Scalable Dashboard

  31. Demo Time

  32. splitter http-server Simulator geode-json Transformer client splitter http-client tap geode-json obj-to-json client shell - R SpringXD

  33. SpringXD http://projectgeode.org http://projects.spring.io/spring-xd http://www.r-project.org

  34. The demo code is on GitHub! @fredmelo_br @william_markito Follow-up: In-Memory Unconference 
 "A place for all things in-memory: projects, people, ideas, roadmaps, discussions." 
 Location: Hill Country A/B” 
 Weds 4:15pm - 6pm. (after this talk)

Recommend


More recommend