effective localization crowdsourcing
play

Effective Localization Crowdsourcing Ratnadeep Debnath - PowerPoint PPT Presentation

Effective Localization Crowdsourcing Ratnadeep Debnath rtnpro@indifex.com Who? @rtnpro Languages L anguages L anguages ......... ..... I personally believe we developed language because of our deep inner need to complain. Jane


  1. Effective Localization Crowdsourcing Ratnadeep Debnath rtnpro@indifex.com

  2. Who? @rtnpro

  3. Languages L anguages L anguages ......... .....

  4. “I personally believe we developed language because of our deep inner need to complain.” —Jane Wagner

  5. 95% People on Earth with native Language other than English

  6. 50% Internet users speaking no English at all

  7. 120 Published languages on Wordpress.com

  8. 150 Wikipedia languages with > 1500 articles

  9. 4x Chances to buy something from websites in native language

  10. $25 Additional revenue per $1 spent on localization

  11. Develop for an international audience

  12. Usability Accessibility Developers

  13. 13% American websites available in multiple languages

  14. What's the solution?

  15. Internationalization , i18n & Localization , L10n

  16. I18n & L10n

  17. Scope ● Language ● Culture ● Writing Conventions ● Subject to regulatory compliance

  18. Why localize? ● Reach to a larger number of users ● Users are more comfortable in native language ● Language is a key factor to develop relations ● Localization is essential for global operations ● Achieve higher company revenues

  19. How does L10n work (from the gettext manual) Original C Sources ───> Preparation ───> Marked C Sources ───╮ │ ╭─────────<─── GNU gettext Library │ ╭─── make <───┤ │ │ ╰─────────<────────────────────┬───────────────╯ │ │ │ ╭─────<─── PACKAGE.pot <─── xgettext <───╯ ╭───<─── PO Compendium │ │ │ ↑ │ │ ╰───╮ │ │ ╰───╮ ├───> PO editor ───╮ │ ├────> msgmerge ──────> LANG.po ────>────────╯ │ │ ╭───╯ │ │ │ │ │ ╰─────────────<───────────────╮ │ │ ├─── New LANG.po <────────────────────╯ │ ╭─── LANG.gmo <─── msgfmt <───╯ │ │ │ ╰───> install ───> /.../LANG/PACKAGE.mo ───╮ │ ├───> "Hello world!" ╰───────> install ───> /.../bin/PROGRAM ───────╯

  20. Localization is a pain

  21. A sample L10n use case

  22. Workflow ● Mark, export strings (PO format) ● Release string freeze ● Translator: VCS checkout ● Translate w/ specialized tools ● Get 'em files back ● SSH ☠ ☢ ☹ , Email , Tickets ● For every friggin release

  23. Challenges ● Too darn hard ● Community isolation ● Quality ● Scalability ● Always more languages, more users

  24. Why Transifex? ● Abstracts various VCS systems ● Easy to use ● Better coordination ● Upstream friendly ● Prevents duplication of work ● Automate L10n workflow ● Open source

  25. History ● 2007: Initial development of Transifex sponsored by Google under its "Google Summer of Code" program ● 2008: Fedora adopts Transifex as its official Localization platform ● 2010: Indifex is chosen by Intel and Nokia to manage the localization of MeeGo ● 2011: Tranisfex is used by 2,000 open source projects and 10,000 users. More than 5 million words have been translated, reaching an audience of more than 30 million people! ● Late 2011: Indifex offers support for the localization of proprietary projects too. Release of an Enterprise-level product, both in self- hosted and managed solutions.

  26. 10K foot view of Transifex's features www.transifex.net/tour/features/overview/

  27. Current selected features of Transifex ● L10n workflow automation ● Workflow control / Project management ● Team management / Communication ● Effective crowdsourcing ● Rich User Interface, web based application ● Translator tools: TM, suggestions, etc ● Quality control / assurance ● Scalable, accessible, open-source ● SaaS, fremium plans ● Social features

  28. Upcoming selected features of Transifex ● Translator marketplace ● Data-mining, automatic rating of translators ● Integrate L10n workflow with any type of content

  29. Transifex Versions ● Transifex.net ● SaaS, plug-n-play, batteries-included ● 1.8K projects, 10.7K users, 50M words ● Transifex Enterprise Edition ● Robust, high-performance, intranet ● Enterprise modules (TM, mgmt, QΑ) ● Transifex Community Edition (open-source)

  30. Transifex Versions

  31. Overview of Transifex Team ● 7-strong global team ● Selected clients: Intel, Nokia, Mozilla, Red Hat ● Selected open-source projects/partners: Fedora, MeeGo, Firefox, Django, Creative Commons, Joomla ● 2 years of specialized L10n services ● Profitable, 100% yearly growth ● Long open source involvement

  32. Overview of Transifex Team

  33. How does L10n work with Transifex

  34. Under the Hood

  35. Django

  36. Our technologies Lotte Rich user interface – Python, Django (MVC) ● ● Many file formats / types of content Advanced web design (Django ● ● Teams Persmissions management – templates-javascript-AJAX) ● Translation history/memory Postgres (RDBMS) ● ● Quality control & assurance MongoDB, memcached (NonRel) ● ● Project/Content management Full-text indexing / search ● ● 3 rd party web services NginX Scalable deployment – ● ● Message queues/server-side Social authentication & features ● ● “Middle level” Transifex apps asynchronous workers ● Unit testing Community features ● ● – WebAPI CLI client – Time Release management ● ● Django addons Data-mining/translator auto-rating ● ● – Logging time based statistics ●

  37. Project / Content management Meet any project's needs ● Project ● Resources ● Categories ● Releases ● Hubs ● Outsourced access

  38. 3 rd Party Web services ● GoogleChart ● Recaptcha ● Gravatar ● Getsatisfaction ● Yahoo pipes

  39. Social authentication & features (includes coming up too) ● Google, Facebook, Twitter, etc ● Tweet my week's translation work ● Tweet my project's progress ● Show tweets that have the '#tx' hashtag in my Transifex public profile ● Show me translation projects my friends like

  40. “Middle level” Transifex apps

  41. – Python Django (MVC) ● MVC ● Template system ● Reusability, plugability ● Forms ● Rapid development ● Caching framework ● DRY ● Extension to python's ● Admin interface testing framework ● ORM ● Ready-to-go authentication ● Regular expression URL ● Cross site request forgery dispatcher protection

  42. Advanced web-design ● Django templating system ● Javascript ● AJAX ● HTML5 ● CSS3

  43. PostgreSQL (Relational database) ● Procedural ● Data types ● User-defined languages ● Indexes objects ● Triggers ● Inheritance ● Multi-version ● Replication ● Add-ons Concurrency Control ● Rules

  44. Non-relational Databases ● MongoDB ● memcached ● Redis

  45. Full-text indexing / search ● Levenshtein algorithm ● Haystack ● Solr

  46. Scalable deployment ● NginX ● Load balancer ● Webserver ● Database server ● Static page server

  47. Scalable deployment 1

  48. Message queues ● Asynchronous workers ● Don't interrupt user-experience ● Run in the background ● Can run distributively

  49. Unit-testing ● Test-suite that simulates all use-cases ● Ensure new changes don't “break” old functionality ● Continuous testing ● Browser test-suite: Selenium

  50. Django-addons ● Open-source project, contributed by us to the community ● Take Django's reusable applications to the next level ● Activate/deactivate addons with a simple command ● Easily “assemble” enterprise Transifex edition

  51. A ride through Transifex

  52. Get started

  53. Plans

  54. Signup

  55. Signin

  56. Dashboard

  57. Add new project - Basic

  58. Add new project - Advanced

  59. Project details

  60. – Project details Upload resource

Recommend


More recommend