Crowford Crowd Investment Data Portal Group 13 LABBE, Kevin Patrick Joseph MARTYNAVA, Karina THOMPSON, Julien Edward
Topics • Crowdfunding basics • Schema Mapping / ER (Gathering Data) • Data Fusion (Data Analysis) • Data Portal
● Crowdfunding ● Schema mapping / ER ● Data fusion ● Data Portal Crowdfunding
● Crowdfunding ● Schema Crowdfunding ● Data fus ● Data Po • Fund a project by a large number of people • Start-up, Video games, charity…
● Crowdfunding ● Schema Crowdfunding ● Data fus ● Data Po • over 2 billions $ • 100K projects • 10 milion contributors
● Crowdfunding ● Schema Crowford ● Data fus ● Data Po • Gather projects from different sources • Predict if a project will be successful or not
● Crowdfunding ● Schema mapping / ER ● Data Fusion ● Data Portal Schema Mapping / ER Gathering data
● Crowdfunding ● Schema mapping / ER Data Sources ● Data Fusion ● Data Portal
● Crowdfunding ● Schema mapping / ER Data Sources ● Data Fusion ● Data Portal • Same structure • Same theme (fund projects)
Crowdfund data Author(s) Project Idea Packages / Perks (Pen, Video game, Product…) What you get / Money
● Crowdfunding ● Schema mapping / ER Crawling ● Data Fusion ● Data Portal
● Crowdfunding ● Schema mapping / ER Crawling ● Data Fusion ● Data Portal • Use JavaScript to generate project page • Private API that generate JSON • Python script w/ http2 that generate requests
https://www.indiegogo.com/private_api/explore? filter_funding=&filter_percent_funded=&filter_q uick=new&filter_status=&pg_num=2
● Crowdfunding ● Schema mapping / ER Crawling ● Data Fusion ● Data Portal • Have to extract data from websites • 2 Spider bots (crawlers) using Scrapy
Crawling w/ Scrapy • Python Framework for extracting data • Write Spider (crawling bots) • Parse data and extract with xpath • Export data (Schema mapping)
• Initialize the spider
• Download and extract data • Export the item
• Export the item
● Crowdfunding ● Schema mapping / ER Data scheme ● Data Fusion ● Data Portal Author Project Perks / Packages
● Crowdfunding ● Schema mapping / ER Data scheme ● Data Fusion ● Data Portal Author Project Summary Perks / Packages Project
● Crowdfunding ● Schema mapping / ER Data scheme ● Data Fusion ● Data Portal • Project / Author • Project / Perks • Recommendation : Project / Related_Project • ER for multiple authors / perks / projects…
● Crowdfunding ● Schema mapping / ER Results ● Data Fusion ● Data Portal • Our working set • 36 000 projects • 65 000 authors • over 230 000 perks
● Crowdfunding ● Schema mapping / ● Data Fusion ● Data Portal Data fusion And other data analysis
● Crowdfu ● Schema Recommendation ● Data Fusion ● Data Por • Use buzzwords in project description • Use n-grams (word combination) • Similarity measures using Pairwise metrics • Linear kernels • Can be used for data fusion
● Crowdfu ● Schema Success Prediction ● Data Fusion ● Data Por • How much money has been collected • How much time • The average pledge • …
● Crowdfu ● Schema Success Prediction ● Data Fusion ● Data Por Random Forests Logistic regression RESULT 1 RESULT 2 COMPARISON RESULT
● Crowdfunding ● Schema mapping / ER ● Data fusion ● Data Portal Data Portal
● Crowdfunding ● Schema mappin Goal ● Data fusion ● Data Portal Browse successful projects
DataBase PostgreSQL
DataBase PostgreSQL Web Interface Django
DataBase PostgreSQL Web Interface Django Project List Project Info
● Crowdfunding ● Schema mappin Web Interface ● Data fusion ● Data Portal • Project list • Filter • Access project page • Allows you to download the datasets
Demo
Recommend
More recommend