Data Analysis and Map-Reduce with MongoDB and pymongo Alexander C. - PowerPoint PPT Presentation

Feb 18, 2024 •577 likes •940 views

Data Analysis and Map-Reduce with MongoDB and pymongo Alexander C. S. Hendorf, EuroPython 2015, Bilbao @opotoc Alexander C. S. Hendorf Mannheim, Germany IT is my 'second career' developer @my own company opotoc IT GmbH mongoDB

Data Analysis and Map-Reduce with MongoDB and pymongo Alexander C. S. Hendorf, EuroPython 2015, Bilbao @opotoc
Alexander C. S. Hendorf • Mannheim, Germany • IT is my 'second career' • developer @my own company opotoc IT GmbH • mongoDB MUG organiser • speaker, sometimes trainer • EP2015 program WG co-chair
Today 1. mongoDB / document orientented database 2. What's the mongoDB aggregation framework? 3. Pipeline model 4. Pipeline stages 5. Map Reduce in mongoDB some live demos
Document oriented databases in 15 seconds document collection database json-like object do document do do do do do document do do do do do document do do do do do document do do do do do document do { document document document "_id": 1, document document "say": "Hello" document } do do do do do do do do do do no schema do do do do do do do do do do do do enforced
mongoDB aggregation framework • introduced with mongoDB 2.2 in 2012 • framework for data aggregation • documents enter a multi-stage pipeline that transforms the documents into an aggregated results • it's designed 'straight-forward' • all operations have an optimization phase which attempts to reshape the pipeline for improved performance
Pipeline is like a relay race $match $project $group something smart get the baton present nicely
• mongoDB 3.0 • WiredTiger storage engine • driver: pymongo • dataset 37GB, compressed with WT ~9GB • collection of playlists from the iTunes Music Store • playlists that appeared in some chart sometime in the past 3 years somewhere around the world

Recommend

Percona Backup for MongoDB Akira Kurogane Percona 3 - 2 - 1 MongoDB Percona Server for

Percona Backup for MongoDB Akira Kurogane Percona 3 - 2 - 1 MongoDB Percona Server for MongoDB Community Edition MongoDB Enterprise Edition Replica Set Cluster Percona Backup for MongoDB 2 Elements of MongoDB Backups 3 MongoDB oplog

1.43k views • 38 slides

MongoDB Building data model with MongoDB and Mongoose MVC Pattern Connect Express app to

MongoDB Building data model with MongoDB and Mongoose MVC Pattern Connect Express app to MongoDB with Mongoose Could use native MongoDB driver, but not easy to work with MongoDB native driver does not offer built-in way of

1.18k views • 27 slides

MongoDB Sharding 101 Agenda What is MongoDB? Single Instances Replica-set

MongoDB Sharding 101 Agenda What is MongoDB? Single Instances Replica-set architecture Shard architecture Q&A What is MongoDB MongoDB MongoDB is a free and open-source, cross-platform, document-oriented

601 views • 44 slides

MongoDB Thomas Schwarz, SJ MongoDB History 2007 Developed by 10gen as a Platform as a Service

MongoDB Thomas Schwarz, SJ MongoDB History 2007 Developed by 10gen as a Platform as a Service (PaaS) 2009 Open Source model is adopted 2013 10gen becomes MongoDB 2019 MongoDB as a service on Alibaba cloud MongoDB comes from

1.15k views • 66 slides

External Authentication with Percona Server for MongoDB and MongoDB Enterprise Jason Terpko DBA

External Authentication with Percona Server for MongoDB and MongoDB Enterprise Jason Terpko DBA @ Rackspace/ObjectRocket linkedin.com/in/jterpko 1 Overview Percona Server For MongoDB o SASL and LDAP o MongoDB Enterprise o Kerberos and

432 views • 25 slides

1. Instillations o https://www.mongodb.com/download-center/community 2. Download and Install

1. Instillations o https://www.mongodb.com/download-center/community 2. Download and Install MongoDB community server o Create a separate installation location /directory mongodb (for windows, c:\mongodb) and install your MongoDB in that

570 views • 3 slides

Everything You Know About MongoDB is Wrong (Probably) Mark Smith | MongoDB | @Judy2K Myth 0

Everything You Know About MongoDB is Wrong (Probably) Mark Smith | MongoDB | @Judy2K Myth 0 You think we havent seen this on YouTube @Gar1t on YouTube MongoDB is Web Scale Weve seen it. Weve bought the T-shirts. What is MongoDB?

1.29k views • 40 slides

Your First MongoDB Environment: What You Should Know Before Choosing MongoDB as Your Database Me

Your First MongoDB Environment: What You Should Know Before Choosing MongoDB as Your Database Me - @adamotonete Adamo Tonete Senior Technical Engineer Brazil Agenda What is MongoDB? The good side of MongoDB with details.

716 views • 26 slides

Information Retrieval in MongoDB Data storage, Indexing and Querying Kaustubh Dhokte (NB97699)

Information Retrieval in MongoDB Data storage, Indexing and Querying Kaustubh Dhokte (NB97699) University Of Maryland Baltimore County CMSC 676: Information Retrieval Agenda Introduction to MongoDB MongoDB Architecture MongoDB Storage

316 views • 20 slides

What's New in Percona Server for MongoDB? 2019 Q3: Enterprise Enhancements and v4.2 4:00 PM -

What's New in Percona Server for MongoDB? 2019 Q3: Enterprise Enhancements and v4.2 4:00 PM - 4:50 PM - Room B About Adamo and Akira Two of the most experienced MongoDB field experts in the world. Adamo w/ MongoDB: 2013 ~ MongoDB: 2009 ~

812 views • 66 slides

MongoDB and Java 8 Agenda Java8 Main Features MongoDB + Java8 Few Examples RX Driver 3 Java

MongoDB and Java 8 Agenda Java8 Main Features MongoDB + Java8 Few Examples RX Driver 3 Java 8 MongoDB Java Driver is JAVA6+ Complaint Java 8 Features and Improvements Lambda Expressions New Date API Stream API Type

741 views • 72 slides

Geospatial and MongoDB MongoDB Geospatial Features Agenda Query Examples Optimizations 2

Geospatial and MongoDB MongoDB Geospatial Features Agenda Query Examples Optimizations 2 Norberto Leite Developer Advocate Curriculum Engineer Twitter: @nleite norberto@mongodb.com 3 The Basics [Longitude, Latitude] Quiz Time! Which

1.25k views • 68 slides

MongoDB Backups, All Grown up! David Murphy David Murphy MongoDB Practice Manager for Percona

MongoDB Backups, All Grown up! David Murphy David Murphy MongoDB Practice Manager for Percona Past highlights MySQL & NoSQL Architect for Electronic Arts Original and Lead DBA for Objectrocket, the performance DBaaS for MongoDB

769 views • 30 slides

Declarative MapReduce 10/29/2018 1 MapReduce Examples Filter Map Aggregate Map Reduce

Declarative MapReduce 10/29/2018 1 MapReduce Examples Filter Map Aggregate Map Reduce Grouped aggregated Map Reduce Equi-join Map Reduce Map Reduce Non-equi-join 10/29/2018 2 Declarative Languages Describe what you want to do

414 views • 13 slides

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data Analytics and Interactive Visualization map-D data refined map-D A GPU Database for Real-Time Big Data Analytics and Interactive Visualization SC13

841 views • 32 slides

MongoDB Analysis with Prometheus and Grafana Akira Kurogane Percona Talk Overview The

MongoDB Analysis with Prometheus and Grafana Akira Kurogane Percona Talk Overview The 'math' in MongoDB metrics MongoDB's counters and 'gauges' mongodb_exporter metrics Prometheus equations PMM's Grafana dashboards

1.06k views • 31 slides

Compsc sci 201 201 Wo Work, Nbody dy, , ArrayL yLists ts Susan Rodger January 29, 2020

Compsc sci 201 201 Wo Work, Nbody dy, , ArrayL yLists ts Susan Rodger January 29, 2020 1/29/2020 Compsci 201, Spring 2020 1 F is for Folder aka Directory where things are stored in Git Function Abstraction a

857 views • 40 slides

How to do research March 6, 2013 Bill Freeman, CSAIL, MIT Computer Science and Artificial

How to do research March 6, 2013 Bill Freeman, CSAIL, MIT Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology The jump from problem sets to research can be hard. We sometimes see students who ace their

385 views • 3 slides

GoodReads Book Recommendation Service Yijun Tian, Vicky Bai, Zeynep Doganata

GoodReads Book Recommendation Service Yijun Tian, Vicky Bai, Zeynep Doganata Introduction/Related Work - A subclass of information filtering system that seek to predict the rating or preference that a user would give to an

628 views • 25 slides

CMPS 112: Spring 2019 Comparative Programming Languages Intro to Haskell Owen Arden

CMPS 112: Spring 2019 Comparative Programming Languages Intro to Haskell Owen Arden UC Santa Cruz Based on course materials developed by Nadia Polikarpova What is Haskell? A typed , lazy , purely functional programming

448 views • 18 slides

Four-Lesson Special The Holocaust, Anti-Semitism, and UsPart 4 June 7, 2016 Dean Bible

Four-Lesson Special The Holocaust, Anti-Semitism, and UsPart 4 June 7, 2016 Dean Bible Ministries www.deanbibleministries.org Dr. Robert L. Dean, Jr. The Existence of Evil Part 2 Jer. 29:11, For I know the plans that I have for

677 views • 20 slides

Wh -quantification in Alternative Semantics Michael Yoshitaka Erlewine (mitcho) National

Wh -quantification in Alternative Semantics Michael Yoshitaka Erlewine (mitcho) National University of Singapore mitcho@nus.edu.sg GLOW in Asia XII / SICOGG XXI Dongguk University, August 2019 Wh -quantification We commonly think of

1.73k views • 141 slides

Seq2Seq Models and Attention M. Soleymani Sharif University of Technology Spring 2020 Most

Seq2Seq Models and Attention M. Soleymani Sharif University of Technology Spring 2020 Most slides have been adopted from Bhiksha Raj, 11-785, CMU 2019, and some from Fei Fei Li and colleagues lectures, cs231n, Stanford 2017. Se Sequence-to

1.5k views • 135 slides

Big Data, Deep Learning and Other Allegories: Scalability and

Big Data, Deep Learning and Other Allegories: Scalability and Fault-tolerance of Parallel and Distributed Infrastructures Divy Agrawal Professor of

1.05k views • 58 slides