2019/Grants/A bot to add reference support to Wikidata statements

From WikiConference North America
< 2019‎ | Grants
Revision as of 17:10, 11 May 2020 by Csisc (talk | contribs) (Created page with "{{WCNA 2019 Grant Submission |name=Houcemeddine Turki |username=Csisc |email=turkiabdelwaheb{{@}}hotmail.fr |resume=Born in May 24, 1994, Houcemeddine Turki is a long-term Wik...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search


Title:

A bot to add reference support to Wikidata statements

Name:

Houcemeddine Turki

Wikimedia username:

Csisc

E-mail address:

turkiabdelwaheb@hotmail.fr

Resume:

Born in May 24, 1994, Houcemeddine Turki is a long-term Wikimedian and a medical student at University of Sfax, Tunisia. He is also a published researcher in Computational Linguistics, Scientometrics and Biomedical Informatics. My publications: https://scholar.google.ca/citations?user=u25grGjf85sC&hl=en My Wikimania 2019 presentations: https://commons.wikimedia.org/wiki/Category:Wikimania_2019_sessions_of_Houcemeddine_Turki

Geographical impact:

Worldwide

Type of project:

Technology

What is your idea?

My idea consists on creating a bot to process news feed and open source search engines to find references to unsupported statements in Wikidata.

Why is it important?

Wikidata statements that are not supported by references are not trustworthy enough to be considered. Adding accessible reference URLs to them will let possible to verify the accuracy of Wikidata statements and consequently to enhanced the quality of Wikidata database.

Is your project already in progress?

I already developed a Python code to retrieve references to biomedical statements from PubMed Central. The principle of the algorithm is explained in https://www.jclinepi.com/article/S0895-4356(17)31073-9/abstract and in https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(18)30094-7

How is it relevant to credibility and Wikipedia? (max 500 words)

Finding references to Wikidata statements will ameliorate the quality of Wikidata-based bot-generated Wikipedia articles, particularly in the context of COVID-19 pandemic.

What is the ultimate impact of this project?

  • Reducing the number of unsupported Wikidata statements
  • Ameliorate the reference support for Wikipedia articles

Could it scale?

Of course, the bot can later evolve so that it can add references to Wikipedia articles.

Why are you the people to do it?

What is the impact of your idea on diversity and inclusiveness of the Wikimedia movement?

This bot can reduce deletion rates in Wikipedia and Wikidata. More new editors will be encouraged to contribute more to Wikipedia and Wikidata when they find their work fixed.

What are the challenges associated with this project and how you will overcome them?

  • Internet connectivity matters: We will use a high-speed internet connection option (4G).
  • Legal concerns: We will use open license tools and materials.
  • High-scale data to process: We will buy a high performance personal computer.

How much money are you requesting?

7500 TND

How will you spend the money?

500 TND will be used to purchase high speed internet connection for the project. 7000 TND will be used to purchase a high performance personal computer

How long will your project take?

6 months

Have you worked on projects for previous grants before?

https://meta.wikimedia.org/wiki/Grants:Project/Rapid/Csisc/SPARQL:_Be_connected_to_Wikidata