Linguistic Research Infrastructure - LiRI Linguistic Research Infrastructure Information event October 11, 2019 LiRI team members 10/18/2019 Title of the presentation, Author Page 1
Linguistic Research Infrastructure - LiRI Introduction Elisabeth Stark (project leader, member of LiRI board)
Linguistic Research Infrastructure - LiRI Overall idea: The LiRI Architecture Page 3
Linguistic Research Infrastructure - LiRI LiRI mission and strategy We build a new laboratory (collection of devices and facilities) for linguistics and language and speech sciences, plus data storage/processing/science via a group of experts (“ LiRI staff”) . We aim at enabling scientific cooperation by providing access to state-of-the-art research infrastructures providing access to shared research resources providing ample data science support participating in teaching and advising graduate students (MA, PhD level) providing compatibility to the major European and international research infrastructure standards (e.g. CLARIN, FAIR principles) enabling access to both national and international funding sources for research projects that require an excellent digital research infrastructure Our vision: LiRI as a starting point for mid- and large-scale collaborative national and international third- party funded research projects. 18.10.2019 Seite 4
Linguistic Research Infrastructure - LiRI Context: What happened until now • March 2017: Invitation to submit short proposals (> 5 mio CHF) for the „Swiss Roadmap for Research Infrastructures 2021-2024 “ Co-application by two overarching linguistic units ( ZüKL and URPP „Language and Space“) • January 2018: Successful evaluation, invitation to submit long proposal • July 2018: „A“ evaluation by SNSF (three external experts) • October 2018: Decision of board of UZH to establish LiRI with local funding (= continuous internal applications for each funding year plus SNSF applications for larger devices, R‘Equip , by LiRI team) • 17.04.2019 Integration in the Swiss Roadmap • First large-scale research infrastructure in linguistics in Switzerland Page 5
Linguistic Research Infrastructure - LiRI Facilities and devices at LiRI
Linguistic Research Infrastructure - LiRI LiRI staff • (Administration/coordination) • System administrator • Technician (for lab/devices) • Data acquisition expert • Data processing expert / software development • Data scientists (from 2021 onwards) Plus ‘LIS team’ until July 2020: cl specialists plus software developer to set up the “Linguistic Information System”, in collaboration with local IT unit S3IT
Linguistic Research Infrastructure - LiRI Examples of possible projects in LiRI and users of LiRI • Interaction in larger groups : eye-tracking devices, software, computing power (3d model) • The role of input and intake in language acquisition : Videocameras, eye-tracking devices, LENA devices, esp. for fieldwork • Language and communication skills of the elderly people : EEG systems (stationary and mobile), ABR, NIRS • Speaker recognition and understanding of speech articulation : sound-proof cabins, articulatograph, measurement devices User groups : - researchers at universities (local, national, international); - specialized research institutions; - partners beyond academia, also industry ( Forensisches Institut , speech pathologists, etc.). Page 8
Linguistic Research Infrastructure - LiRI What has been done so far - Constitution of the LiRI Board (Sabine Stoll, Martin Volk, Elisabeth Stark) plus LiRI Team (Volker Dellwo, Martin Meyer, Wolfgang Kesselheim; coordination: Agnes Kolmer). – Today: Constitution of LiRI SAB: Prof Shanley E.M. Allen , TU Kaiserslautern; Prof Lars Borin , University of Gothenburg; Prof Anne-Lise Giraud , Université de Genève; Prof Stuart Rosen , University College London UCL; Prof Lukas Rosenthaler , University of Basel.b - Launch of a dedicated website, some interviews - Submission of proposals (SNSF, local funding schemes) to finance the LiRI devices - Hiring of one data scientist, specialist on Data Acquisition (in the field): Dagmar Jung - Constitution of a team to set up the Linguistic Information system in collaboration with S3IT (head: Marcel Riedi): Taras Zakharko, Gerold Schneider, Stefan Vrankovic - Collaboration with the Data Services Team for interface to SwissUbase (Andrea Malits, Florian Steurer) - Ongoing work on rules of procedures / price list for fees - Advertisement of a second job position (focus on data management/processing, data bases, software development) plus a technician, see website Page 9
Linguistic Research Infrastructure - LiRI Next steps - Finalizing rules of procedures / price list for fees, implement a charter of access recognition of LiRI as a local UZH technology platform - Hire staff (data processing specialist/software developer and technician) - Acquire and install devices and facilities - Set up the laboratory - Start of LiRI on July 1 2020. Page 10
Linguistic Research Infrastructure - LiRI Data Acquisition Units @ LiRI and LiRI research environment Presenters: Sabine Stoll, Dagmar Jung, Wolfgang Kesselheim, Martin Meyer, Volker Dellwo 10/12/19 LiRI Page 1
Linguistic Research Infrastructure - LiRI Interest groups • Neurolinguistics Language Language • Language Development Interaction Development • Phonetics • Interaction Studies Language Language Production Processing • Digital Linguistics 10/12/19 LiRI Page 2
Linguistic Research Infrastructure - LiRI Why LiRI? Evolving new methods in all disciplines of Language Sciences: Equipment and support Data collection and preparation Enabling data reproducibility 10/12/19 LiRI Page 3
Linguistic Research Infrastructure - LiRI LiRI data acquisition unit What we do Provide linguistic data acquisition devices • Support for linguistic research and a matching • Evidence-based linguistics and big data research environment • Help with Laboratory and Field set-up scenarios • Help with acquiring and • LiRI-LAB infrastructure and processing textual data portable devices • How to design a corpus – experience based on Best • Localized software for data Practices processing and analysis • Ensure sustainability of data 10/12/19 LiRI Page 4
Linguistic Research Infrastructure - LiRI Language Data Scientists and LAB technician Which equipment for the research question? How are the devices employed successfully? Which workflow ? How to go about data management , data files and metadata? What about ethics , informed consent? Where should the data be archived , in which format, what about access rights…? 10/12/19 LiRI Page 5
Linguistic Research Infrastructure - LiRI Primary data: Workflow management Field Written Lab Audiovisual data Textual data Experimental data Data Annotation -> Corpus compilation 10/12/19 LiRI Page 6
�� ������� �� �������� ���� ������� ���� ������ �������� ����������� ����� ������ ���� ������ ������� ���������� Linguistic Research Infrastructure - LiRI How to design a corpus – weekly record video monthly yearly experience based on Best Practices: corrections on demand collect metadata store media send ELAN Workflow management CRDN enter metadata translation report recordings transcription Recording media to FNUNIV integrate media process media media to CRDN Metadata create backup FNUNIV media to UZH Annotation update database integrate media assign TA Analysis process metadata check ELAN UZH integrate ELAN archive Publication (versioning) assign glosser glossing release answer questions send questions export to R Archiving check TBX update/send TBX anonymize integrate TBX corrections 10/12/19 LiRI Page 7
Linguistic Research Infrastructure - LiRI Primary data: written/textual • Collecting digital text data (e.g. by web crawling) • Scanning and OCR of printed texts or handwritten manuscripts • Crowd sourcing and Citizen Science • Optical Character Recognition ( OCR ): • which equipment and settings • which software to use involving complex page layout • which export formats depending on the further needs for analysis or editing 10/12/19 LiRI Page 8
Linguistic Research Infrastructure - LiRI Available equipment in the field and in the lab LENA speech analysis devices High resolution video cameras (4K) Eye-tracking (LAB and portable) Time-of-Flight Cameras Hi-end acoustic recording equipment Electroencephalogram (EEG) Electromagnetic Articulograph (EMA) Functional Near Infrared Spectroscopy (fNIRS) 10/12/19 LiRI Page 9
Linguistic Research Infrastructure - LiRI Language Development – Acquisition devices How much input? LENA technology is standard for measuring talk with children. LENA uses a small wearable audio recorder that is combined with a speech recognition algorithm : it automatically analyzes and segments the audio data in different time frames 10/12/19 LiRI Page 10
Linguistic Research Infrastructure - LiRI LENA 10/12/19 LiRI Page 11
Linguistic Research Infrastructure - LiRI LENA in the field 10/12/19 LiRI Page 12
Linguistic Research Infrastructure - LiRI LENA: automatic analysis of audio environment 10/12/19 LiRI Page 13
Recommend
More recommend