2019/Grants/A bot to add reference support to Wikidata statements
Title:
A bot to add reference support to Wikidata statements
Name:
Houcemeddine Turki
Wikimedia username:
Csisc
E-mail address:
turkiabdelwahebhotmail.fr
Resume:
Born in May 24, 1994, Houcemeddine Turki is a long-term Wikimedian and a medical student at University of Sfax, Tunisia. He is also a published researcher in Computational Linguistics, Scientometrics and Biomedical Informatics. My publications: https://scholar.google.ca/citations?user=u25grGjf85sC&hl=en My Wikimania 2019 presentations: https://commons.wikimedia.org/wiki/Category:Wikimania_2019_sessions_of_Houcemeddine_Turki
Geographical impact:
Worldwide
Type of project:
Technology
What is your idea?
My idea consists on creating a bot to process news feed and open source search engines to find references to unsupported statements in Wikidata.
Why is it important?
Wikidata statements that are not supported by references are not trustworthy enough to be considered. Adding accessible reference URLs to them will let possible to verify the accuracy of Wikidata statements and consequently to enhanced the quality of Wikidata database.
Is your project already in progress?
I already developed a Python code to retrieve references to biomedical statements from PubMed Central. The principle of the algorithm is explained in https://www.jclinepi.com/article/S0895-4356(17)31073-9/abstract and in https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(18)30094-7
How is it relevant to credibility and Wikipedia? (max 500 words)
Finding references to Wikidata statements will ameliorate the quality of Wikidata-based bot-generated Wikipedia articles, particularly in the context of COVID-19 pandemic.
What is the ultimate impact of this project?
- Reducing the number of unsupported Wikidata statements
- Ameliorate the reference support for Wikipedia articles
Could it scale?
Of course, the bot can later evolve so that it can add references to Wikipedia articles.
Why are you the people to do it?
- I am an editor of Wikidata with over 100000 edits. https://xtools.wmflabs.org/ec/www.wikidata.org/Csisc
- I am a published author in Computational Linguistics and I have the required skills to build the bot
What is the impact of your idea on diversity and inclusiveness of the Wikimedia movement?
This bot can reduce deletion rates in Wikipedia and Wikidata. More new editors will be encouraged to contribute more to Wikipedia and Wikidata when they find their work fixed.
What are the challenges associated with this project and how you will overcome them?
- Internet connectivity matters: We will use a high-speed internet connection option (4G).
- Legal concerns: We will use open license tools and materials.
- High-scale data to process: We will buy a high performance personal computer.
How much money are you requesting?
7500 TND
How will you spend the money?
500 TND will be used to purchase high speed internet connection for the project. 7000 TND will be used to purchase a high performance personal computer
How long will your project take?
6 months
Have you worked on projects for previous grants before?