MongoDB By Bharath Subramanyam Relational Databases started to - PowerPoint PPT Presentation

MongoDB By Bharath Subramanyam

• Relational Databases started to become popular in the 80s and have been widely used since then • Properties of Relational Databases- Relational • Fixed Schema Databases • High Level Query Language (SQL) • ACID properties • Primitive Data Partitioning Technology

• Was not designed to run on clusters. Difficult to Problems with scale horizontally Relational Databases • Impedance mismatch problem with Relational Databases

Impedance Mismatch

• Johan Oskarsson – twitter #nosql • Characteristics • Non-relational NoSQL • Cluster Friendly • Schema Less • Mostly Opensource • Simple APIs and no joins

• Key Value Store (like a hashmap which is persistent) • Document Models (MongoDB) (Can group things into natural aggregates) Data Model • Column Family (Get the data with the row key and column family name) • Graph Models

• Database • Collections(Tables) • Document (Row) Document • Fields (Columns) Data model • _id field (Primary Key) • JSON or XML • MongoDB uses JSON format

JSON Format Basic Constructs { “id”: 1200, Base Value = Boolean, int, String.. Object = {} “ customerName ” : “Brad”, Array = [] “ lineItems ” : [ {“ productId ” : 501, “qty” : 5}, {“ productId ” : 553, “qty” : 2} ] }

• JSON object: set of unordered elements • elements: key/value pairs JSON • keys must be unique within an object • values can contain objects • empty value: null, [] (or simply omit element)

• MongoDB documents in a collection must have unique identifier JSON • Documents can be referenced using unique identifier

Mapping Relational Data to JSON

Mapping JSON to Relational DB

Aggregates • In OOP Orders and Line Items are created as different Classes • However, Orders and Line Items can be considered as one unit • In Relational Databases, the values are splattered across different tables • However, Document databases save this data in terms of a single unit Orders • It is easier to move back and forth this single unit (You get to store your aggregate at a single Line Item instead of it being spread across clusters)

A Problem with the Document Database • You want to query based on product as the aggregate • Would have to run a Map Reduce job • Problematic when you have to slice and dice your data Orders Line Item Product

Replicas • Why Replication? • High Availability of Data and no Downtime • Disaster Recovery • Replica set is a group of two or more nodes • In a replica set, one node is primary node and remaining nodes are secondary. • All data replicates from primary to secondary node. • At the time of automatic failover or maintenance, election establishes for primary and a new primary node is elected. • After the recovery of failed node, it again join the replica set and works as a secondary node.

Replicas

Sharding • Sharding is the process of breaking the data into pieces and storing them across multiple machines.

Sharding • Shard: This is where the collection data is actually stored. A shard is a replica set. • Config-Server- Config-servers track state about which servers contain what parts of a sharded collection. Sharded clusters have exactly 3 config servers. • Query Routers-The query router processes and targets the operations to shards and then returns results to the clients.

Consistency • Relational Databases – ACID (Atomic, Consistent, Isolation, Durable) • Aggregate Databases- Transaction within an aggregate is ACID. • Two types of Consistency issues- • Logical Consistency • Replication Consistency

Logical Consistency • User Server DB Server User get---> ----> <---- <----get v101 v101 Post v102 Post v102

Replication Consistency • 2 people booking a hotel room example.

CAP Theorem • Choose only 2 • Consistency • Availability • Partition Tolerance • There are levels of Availability and Consistency. A P MongoDB C

Generic MongoDB query • db.collection.find({query}, {projection}) • Eg. db.posts.find({"author" : "Dan Sullivan"}, {"title" : 1})

Thank You!

MongoDB By Bharath Subramanyam Relational Databases started to - PowerPoint PPT Presentation

MongoDB By Bharath Subramanyam Relational Databases started to become popular in the 80s and have been widely used since then Properties of Relational Databases- Relational Fixed Schema Databases High Level Query Language (SQL)

Percona Backup for MongoDB Akira Kurogane Percona 3 - 2 - 1 MongoDB Percona Server for

MongoDB Thomas Schwarz, SJ MongoDB History 2007 Developed by 10gen as a Platform as a Service

MongoDB Building data model with MongoDB and Mongoose MVC Pattern Connect Express app to

MongoDB Sharding 101 Agenda What is MongoDB? Single Instances Replica-set

Everything You Know About MongoDB is Wrong (Probably) Mark Smith | MongoDB | @Judy2K Myth 0

External Authentication with Percona Server for MongoDB and MongoDB Enterprise Jason Terpko DBA

1. Instillations o https://www.mongodb.com/download-center/community 2. Download and Install

Your First MongoDB Environment: What You Should Know Before Choosing MongoDB as Your Database Me

Information Retrieval in MongoDB Data storage, Indexing and Querying Kaustubh Dhokte (NB97699)

MongoDB Backups, All Grown up! David Murphy David Murphy MongoDB Practice Manager for Percona

What's New in Percona Server for MongoDB? 2019 Q3: Enterprise Enhancements and v4.2 4:00 PM -

MongoDB and Java 8 Agenda Java8 Main Features MongoDB + Java8 Few Examples RX Driver 3 Java

Geospatial and MongoDB MongoDB Geospatial Features Agenda Query Examples Optimizations 2

Introduction to MongoDB Kristina Chodorow kristina@mongodb.org Application PHP Apache

Dos and Donts of a Hybrid Environment MySQL and MongoDB Introduction Im Rick Vasquez a

MongoDB Analysis with Prometheus and Grafana Akira Kurogane Percona Talk Overview The

Office of Public Affairs Court Website Toolbox NCBC/FCCA Conference August 2016 Agenda

-: U ser M an ual :- 1. First the office user after receiving their user id and password from

Increasing the Power of Apparel Suppliers with Better Buying September 8, 2017 Hong Kong

QMF plugin to Vkontakte service Kirill A. Kulakov, Nikolai Agafonov, Pavel Shiryaev Petrozavodsk

Description For your topic, you and your partner will be the lecturers. You will have two steps:

HOW TO ACHIEVE REAL-TIME ANALYTICS ON A DATA LAKE USING GPUS Mark Brooks - Principal System

Contact information ef@math.wvu.edu mays@math.wvu.edu http://math.wvu.edu

Current Status of a Measurement of Hadronic Parity Violation in the Capture of Cold Neutrons on