Architecture of the Triposo travel guide Douwe Osinga (@dosinga) Thursday, 17 October 13
The Company Thursday, 17 October 13
Founded by Ex Googlers Thursday, 17 October 13
In Sydney Thursday, 17 October 13
Financed in California Thursday, 17 October 13
Headquartered in Berlin Thursday, 17 October 13
Distributed Team Thursday, 17 October 13
Quarterly Jamborees Thursday, 17 October 13
Last one on a bus in Spain Thursday, 17 October 13
Algorithms are King Thursday, 17 October 13
No Human Editing in our Guides Thursday, 17 October 13
The Product Thursday, 17 October 13
Our Mission: Build the best travel guide Thursday, 17 October 13
Mobile Thursday, 17 October 13
Works offline Thursday, 17 October 13
Covers the World Thursday, 17 October 13
Smart Thursday, 17 October 13
25,000 destinations around the world 500,000 points of interest ~5,000,000 downloads Thursday, 17 October 13
The Platform Thursday, 17 October 13
Big data Thursday, 17 October 13
Companies with Big data Thursday, 17 October 13
Do you have big data? Thursday, 17 October 13
We didn’t. Thursday, 17 October 13
Nice, but pricey Thursday, 17 October 13
Our current server room Thursday, 17 October 13
Coders are more expensive than Python code is slow Thursday, 17 October 13
Our System Architecture Thursday, 17 October 13
Building A database of the world Thursday, 17 October 13
The Flow Google S3 Spreadsheet Wikipedia Beansie Buildguide OSM Dropbox Snapshot Potala Wikitravel Pipeline Uluru Berlin Crawlers Thursday, 17 October 13
Crawl 20 Sources Thursday, 17 October 13
Split everything in Pois & Locs Thursday, 17 October 13
Put every thing back together Thursday, 17 October 13
When are two things the same? Thursday, 17 October 13
When they are: Similar in location and name Thursday, 17 October 13
OSM Geohashing Wikitravel Wikpedia Thursday, 17 October 13
Suroit Camping vs Camping le Suroite Thursday, 17 October 13
Shingling! Suroit Camping vs Camping le Suroite Thursday, 17 October 13
ampi, camp, itca, mpin, oitc, ping, roit, suro, tcam, uroi vs ampi, camp, esur, gles, ingl, lesu, mpin, ngle, oite, ping, roit, suro, uroi 60% Overlap Thursday, 17 October 13
John’s Bar and Grill vs Restaurant John Thursday, 17 October 13
Stop Shingles! Thursday, 17 October 13
Haarlem Library vs Haarlem City Hall Thursday, 17 October 13
Location Stop Shingles Thursday, 17 October 13
Van Gogh Museum vs Van Gogh Hotel Thursday, 17 October 13
Types from names Thursday, 17 October 13
Cafe Sydney Opera House vs Sydney Opera House Thursday, 17 October 13
Hmm Thursday, 17 October 13
Side effect: Cuisine guesser Thursday, 17 October 13
Making data useful Thursday, 17 October 13
Some further processing... Thursday, 17 October 13
Learning from pictures Thursday, 17 October 13
Learning from Wikipedia Thursday, 17 October 13
Wikipedia article distribution Thursday, 17 October 13
Learning from ratings? Thursday, 17 October 13
No Soup for you Thursday, 17 October 13
Opinion Mining Thursday, 17 October 13
Natural Language Processing Thursday, 17 October 13
If you count RegExps... Thursday, 17 October 13
Further experiments http://labs.triposo.com Thursday, 17 October 13
Can I suggest something? Thursday, 17 October 13
The Weather Thursday, 17 October 13
Text Your location Thursday, 17 October 13
Time Thursday, 17 October 13
Personalization Thursday, 17 October 13
Personalization Thursday, 17 October 13
Personalization Thursday, 17 October 13
Conclusion Thursday, 17 October 13
Conclusion If you can’t solve a problem in 50 lines of python running on a server in the kitchen, it must be really hard. Thursday, 17 October 13
Thanks! Douwe Osinga (@dosinga) Thursday, 17 October 13
Recommend
More recommend