Grants:Programs/Wikimedia Community Fund/Rapid Fund/WikiTutur 2.0:Expanding Open-Access Lexicographic Materials for Languages in Western Indonesia (ID: 22733637)

statusUnder review
WikiTutur 2.0: Expanding Open-Access Lexicographic Materials for Languages in Western Indonesia
proposed start date2024-10-01
proposed end date2025-02-01
budget (local currency)81185000 IDR
budget (USD)4908.02 USD
grant typeIndividual
funding regionunknown region
decision fiscal year2023-24
applicant• Ardzun
organization (if applicable)• N/A

This is an automatically generated Meta-Wiki page. The page was copied from Fluxx, the grantmaking web service of Wikimedia Foundation where the user has submitted their application. Please do not make any changes to this page because all changes will be removed after the next update. Use the discussion page for your feedback. The page was created by CR-FluxxBot.

Applicant Details

edit
Main Wikimedia username. (required)

Ardzun

Organization

N/A

If you are a group or organization leader, board member, president, executive director, or staff member at any Wikimedia group, affiliate, or Wikimedia Foundation, you are required to self-identify and present all roles. (required)

N/A

Describe all relevant roles with the name of the group or organization and description of the role. (required)


Main Proposal

edit
1. Please state the title of your proposal. This will also be the Meta-Wiki page title.

WikiTutur 2.0: Expanding Open-Access Lexicographic Materials for Languages in Western Indonesia

2. and 3. Proposed start and end dates for the proposal.

2024-10-01 - 2025-02-01

4. Where will this proposal be implemented? (required)

Indonesia

5. Are your activities part of a Wikimedia movement campaign, project, or event? If so, please select the relevant project or campaign. (required)

Not applicable

6. What is the change you are trying to bring? What are the main challenges or problems you are trying to solve? Describe this change or challenges, as well as main approaches to achieve it. (required)

Challanges: This edition of WikiTutur is a continuation of the first edition, which has been met with enthusiastic responses. Through record-a-thons using Lingua Libre and edit-a-thons in Indonesian Wiktionary, the first edition has collected lexicographic materials from a total of 59 distinctly named lects (regardless of dialect/language status), or 39 languages by ISO 639-3 codes. However, out of these languages, only 6 languages have more than 1,000 audio files, with even fewer unique lexical items. Most contributions from the first edition were limited to basic vocabularies, with little-to-no complex and derivational forms.

As with the previous edition, Indonesian Wiktionary is chosen as the primary Wikimedia project to contribute in this program because it has the potential to reach the most audience, and bridge the communication gap between speakers of different indigenous languages of Indonesia. Currently, most lexical entries from these languages in Indonesian Wiktionary only consist of simple one-word glosses with occasional audio examples, much of which were from the previous edition of WikiTutur. The usability of these entries are limited, as they rarely provide cross-reference to derived lexical items, synonyms, and spelling variants.

Goals: With this second edition, we are hoping to expand audio and textual lexicographic materials in these languages, not only in terms of breadth but also the depth. For this initial phase, we are planning to focus on the languages of Western Indonesia (Greater Sunda Islands). We will collect data from languages in the area, increasing the number of their Lingua Libre audio recordings and improving the quality of their entries in Indonesian Wiktionary. We will add more details in the definitions for each entry, especially those with multiple senses of meaning. We will organize entries around root words by interlinking synonyms and spelling variants, as well as derived words and phrases to improve their usability. Lastly, we will also record audio for example sentences to inform the users on the words’ variations in pronunciations (including suprasegmental elements such as intonations) in different contexts.

Approaches: There are two main approaches that we employ to solve these challenges and achieve the changes we hoped for, which are 1) collaboration and 2) community development.

  1. Collaboration

For this edition, we seek collaboration with Wikimedia Indonesia to engage with local governmental agencies, educational institutions, and communities of native speakers for their expertise and resources. Through these engagements, we are also hoping to attract a diverse array of participants to contribute media content in Wikimedia Commons (through Lingua Libre), textual content in Indonesian Wiktionary, or both. By involving these groups, we hope to achieve a comprehensive coverage of lexicographic data in the targeted languages.

  1. Community development

We will engage with local communities to raise awareness about the ways that Wikimedia projects can be utilized in efforts for language documentation. We will conduct workshops for participants in order to train them both in recording with Lingua Libre and contributing to Wiktionary. We hope that these educational initiatives will empower community members to take an active role in improving the digital presence of their languages. In addition, by building strong relationships with these communities, we can encourage their active participation and support for language documentation and the Wikimedia movement.

7. What are the planned activities? (required) Please provide a list of main activities. You can also add a link to the public page for your project where details about your project can be found. Alternatively, you can upload a timeline document. When the activities include partnerships, include details about your partners and planned partnerships.

Two main activities are planned:

  1. Lokakarya (workshops)
    Lokakarya or workshops are face-to-face events conducted with direct participation from the local groups and native speaker communities across different regions. The events are divided into three sessions, which are 1) introduction to Lingua Libre and Wiktionary and how they can be used to support language documentation, 2) structured record-a-thon with premade lists of lexical items in the target language(s), and 3) structured edit-a-thon in the Indonesian Wiktionary to improve the entries by inputting the audio files, main senses of meaning, and other lexicographic information.
  2. Kopdar (meet-ups & edit-a-thons)
    Kopdar are regular meet-up events in between the Lokakarya, during which the participants from Wikimedia Jakarta will tidy up the entries in Indonesian Wiktionary by interlinking synonyms and spelling variants, cross-referencing the derived and complex forms, as well as inputting more audio files, among others.

These activities will include collaboration and engagement with local agencies, institutions and communities, the full list of which can be found in the answer Quetion 9. The complete timeline of planned activities throughout the program is attached here.


8. Describe your team. Please provide their roles, Wikimedia Usernames and other details. (required) Include more details of the team, including their roles, usernames, Wikimedia group, and whether they are salaried, volunteers, consultants/contractors, etc. Team members involved in the grant application need to be aware of their involvement in the project.

The team for this program consists of 16 people, with 3 staff and 13 non-staff members. Staff (3 people)

  • Ardzun: Project manager. Responsible for managing the whole project and is the primary contact person. Represents the team in communication with the local groups and native speaker communities. Manages financial affairs. Trains participants during the workshop events. Receives salary. Previously also a project manager for the first edition of WikiTutur.
  • Bangrapip: Project assistant. Prepares and processes data from the registration and survey forms. Designs and manages digital content for social media publication. Trains participants during the workshop events. Organizes and analyzes lexicographic materials collected during the program. Previously a non-staff trainer for the first edition of WikiTutur, also a linguist by training.
  • Swarabakti: Project assistant. Creates slide decks to be presented during the workshop events. Prepares guides for workshops and trains the trainers. Trains participants during the workshop events. Organizes and analyzes lexicographic materials collected during the program. Previously a non-staff trainer for the first edition of WikiTutur, also a junior researcher in linguistics.

Non-staff trainers from Wikimedia Jakarta community (6 people)

Train participants during the workshop events along with the staff trainers. Organize lexicographic materials collected during the program.

Non-staff assistant trainers (7 people) Consist of volunteers from:

Wikimedia Medan (2 people)
Wikimedia Banjar (1 person)
Wikimedia Bandar Lampung (1 person)
Wikimedia Padang (1 person)
Wikimedia Pontianak (1 person)
KlubWiki Universitas Brawijaya (1 person)
Non-group local Wikimedian from Bengkulu (1 person)

Help the team to prepare the sites and logistics for the workshop events. Assist staff and non-staff trainers in communicating with the participants throughout the workshop events, especially during the structured record-a-thon and edit-a-thon sessions, in which the participants will be separated into groups.

9. Who are the target participants and from which community? How will you engage participants before and during the activities? How will you follow up with participants after the activities? (required)

As mentioned above, this program will be done in collaboration with Wikimedia Indonesia to reach out and engage with local governmental agencies, educational institutions, and communities of native speakers. These include:

  1. Balai Bahasa Sumatera Barat (West Sumatra Language Agency), a regional governmental agency responsible for the management, development, and cultivation of indigenous languages in the province of West Sumatra, based in Padang, the provincial seat of government.
  2. Balai Bahasa Sumatera Utara (North Sumatra Language Agency), a regional governmental agency responsible for the management, development, and cultivation of indigenous languages in the province of North Sumatra, based in Medan, the provincial seat of government.
  3. Institut Dayakologi, a private institution focusing on research, development, and advocacy of the indigenous Dayak peoples of Kalimantan, based in Pontianak, West Kalimantan.
  4. Wikimedia Banjar, a local community focusing on Wikimedia projects and Wikimedia-related programs in Banjar language, based in Banjarmasin, South Kalimantan.
  5. Wikimedia Bandar Lampung, a local community focusing on Wikimedia projects and Wikimedia-related programs in Bandar Lampung metropolitan area, Lampung.
  6. Wikimedia Medan, a local community focusing on Wikimedia projects and Wikimedia-related programs in Medan metropolitan area.
  7. Wikimedia Padang, a local community focusing on Wikimedia projects and Wikimedia-related programs in Padang metropolitan area.
  8. Wikimedia Pontianak, a local community focusing on Wikimedia projects and Wikimedia-related programs in Pontianak metropolitan area.
  9. KlubWiki Universitas Brawijaya, a student club focusing on Wikimedia projects managed by local Wikimedians in Brawijaya University, based in Malang, East Java.
  10. Non-group local Wikimedians, especially those based in the city of Bengkulu, Bengkulu.

By reaching out and engaging with these groups, we are hoping to involve diverse groups of participants, ranging from linguists and lexicographers from the regional language agencies, university students, active Wikimedians, and other native speakers from the local communities.

Engagement with the participants began by promoting information related to the programs to interested parties and providing them with registration forms. After reaching out to potential participants, they will be filtered based on their native language proficiency in the target languages (across different age groups and gender), and digital competency (for younger speakers specifically). The participants will need to consent to provide materials with irrevocable Creative Commons licenses; they may withdraw any time prior to the events if they wish. Then, the participants will be invited into online chat groups for further communication. Prior to workshop events, the team will work with the participants before workshop events in order to prepare lists for record-a-thons as well as other lexicographic information. Follow-ups in the form of surveys and other information related to the program will also be relayed through the chat groups.

10. Does your project involve work with children or youth? (required)

No

10.1. Please provide a link to your Youth Safety Policy. (required) If the proposal indicates direct contact with children or youth, you are required to outline compliance with international and local laws for working with children and youth, and provide a youth safety policy aligned with these laws. Read more here.

N/A

11. How did you discuss the idea of your project with your community members and/or any relevant groups? Please describe steps taken and provide links to any on-wiki community discussion(s) about the proposal. (required) You need to inform the community and/or group, discuss the project with them, and involve them in planning this proposal. You also need to align the activities with other projects happening in the planned area of implementation to ensure collaboration within the community.
12. Does your proposal aim to work to bridge any of the content knowledge gaps (Knowledge Inequity)? Select one option that most apply to your work. (required)

Language

13. Does your proposal include any of these areas or thematic focus? Select one option that most applies to your work. (required)

Culture, heritage or GLAM

14. Will your work focus on involving participants from any underrepresented communities? Select one option that most apply to your work. (required)

Linguistic / Language

15. In what ways do you think your proposal most contributes to the Movement Strategy 2030 recommendations. Select one that most applies. (required)

Innovate in Free Knowledge

Learning and metrics

edit
17. What do you hope to learn from your work in this project or proposal? (required)
  1. What is the percentage increase in the number of Lingua Libre audio files for Western Indonesian languages in Wikimedia Commons after the second edition of WikiTutur?
  2. What is the percentage increase in the number of lexical entries for the languages of Western Indonesia in Indonesian Wiktionary after the second edition of WikiTutur?
  3. What are the challenges in compiling lexicographic materials from the target languages for Lingua Libre recording and Wiktionary entries?
  4. How does the quality of Wiktionary entries in the second edition of WikiTutur compare to those from the first edition, particularly in terms of depth and complexity?
18. What are your Wikimedia project targets in numbers (metrics)? (required)
Number of participants, editors, and organizers
Other Metrics Target Optional description
Number of participants 98
Number of editors 80
Number of organizers 19
Number of content contributions to Wikimedia projects
Wikimedia project Number of content created or improved
Wikipedia
Wikimedia Commons 4000
Wikidata
Wiktionary 1500
Wikisource
Wikimedia Incubator
Translatewiki
MediaWiki
Wikiquote
Wikivoyage
Wikibooks
Wikiversity
Wikinews
Wikispecies
Wikifunctions or Abstract Wikipedia
Optional description for content contributions.

N/A

19. Do you have any other project targets in numbers (metrics)? (optional)

No

Main Open Metrics Data
Main Open Metrics Description Target
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
20. What tools would you use to measure each metrics? Please refer to the guide for a list of tools. You can also write that you are not sure and need support. (required)

We will use outreach dashboard tool for measuring and viewing participant’s contributions. We also use category tag for measuring the pages.

Financial proposal

edit
21. Please upload your budget for this proposal or indicate the link to it. (required)

https://docs.google.com/spreadsheets/d/16_2sJHNc63sSOhjlPUml_qcF7myjGOnYFquVV18S4qg/edit?gid=0#gid=0


22. and 22.1. What is the amount you are requesting for this proposal? Please provide the amount in your local currency. (required)

81185000 IDR

22.2. Convert the amount requested into USD using the Oanda converter. This is done only to help you assess the USD equivalent of the requested amount. Your request should be between 500 - 5,000 USD.

4908.02 USD

We/I have read the Application Privacy Statement, WMF Friendly Space Policy and Universal Code of Conduct.

Yes

Endorsements and Feedback

edit

Please add endorsements and feedback to the grant discussion page only. Endorsements added here will be removed automatically.

Community members are invited to share meaningful feedback on the proposal and include reasons why they endorse the proposal. Consider the following:

  • Stating why the proposal is important for the communities involved and why they think the strategies chosen will achieve the results that are expected.
  • Highlighting any aspects they think are particularly well developed: for instance, the strategies and activities proposed, the levels of community engagement, outreach to underrepresented groups, addressing knowledge gaps, partnerships, the overall budget and learning and evaluation section of the proposal, etc.
  • Highlighting if the proposal focuses on any interesting research, learning or innovation, etc. Also if it builds on learning from past proposals developed by the individual or organization, or other Wikimedia communities.
  • Analyzing if the proposal is going to contribute in any way to important developments around specific Wikimedia projects or Movement Strategy.
  • Analysing if the proposal is coherent in terms of the objectives, strategies, budget, and expected results (metrics).

Endorse