Difference between revisions of "Submissions:2014/Answering Big Questions With Wikidata"
Line 1: | Line 1: | ||
<!-- Simply provide information about your submission below and save the page. --> |
<!-- Simply provide information about your submission below and save the page. --> |
||
;Title of the submission: |
;Title of the submission: |
||
− | Answering Big Questions With Wikidata |
+ | Answering Big Questions With Wikidata |
;Themes ([[Submissions#Proposal Themes|Proposal Themes]] - Community, Tech, Outreach, GLAM, Education): |
;Themes ([[Submissions#Proposal Themes|Proposal Themes]] - Community, Tech, Outreach, GLAM, Education): |
||
Line 28: | Line 28: | ||
;Abstract ''(at least 300 words to describe your proposal)'': |
;Abstract ''(at least 300 words to describe your proposal)'': |
||
− | Wikidata is live |
+ | Wikidata is live and starting to fulfill its potential as a data repository, but its also becoming more than the sum of its data. Wikipedia in its early stages started to see success as an encyclopedia, but then also proved invaluable for researchers as a way to understand online collaboration, and how people interacted around free-text content. Likewise the aspects of what Wikidata can tell us as a corpus, not just its individual facts, is slowly emerging. From knowing which Wikipedias have the highest percentage of their Biography articles about women, to visualizing the planet with geodata, new world perspective is being uncovered. This workshop will be a hands-on showing researchers and programmers how to answer their big questions with Wikidata. Its core points will be: |
− | #What Wikidata is from a technical |
+ | #What Wikidata is from a technical viewpoint: |
##Native format |
##Native format |
||
##Structure of a Wikidata Item |
##Structure of a Wikidata Item |
||
Line 36: | Line 36: | ||
#Example Questions already answered: |
#Example Questions already answered: |
||
##Which language Wikipedias are the most unique? |
##Which language Wikipedias are the most unique? |
||
− | ##Which |
+ | ##Which language's Biography articles composition are most female? |
##What is the name of every language in every language? |
##What is the name of every language in every language? |
||
##What are the most popular book genres in each language Wikipedia? |
##What are the most popular book genres in each language Wikipedia? |
||
Line 42: | Line 42: | ||
#How to use Pywikipedia to get the data live. |
#How to use Pywikipedia to get the data live. |
||
##Code walkthrough. |
##Code walkthrough. |
||
− | ##Utilizing what links here along with |
+ | ###Utilizing what links here along with |
− | ## |
+ | ###new classes in the pywikibot library. |
#How to use WDA to work with the data offline. |
#How to use WDA to work with the data offline. |
||
##WDA is a python script |
##WDA is a python script |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
##More elegant solutions have already received funding. |
##More elegant solutions have already received funding. |
||
⚫ | |||
− | ##Code walkthrough. |
||
Revision as of 00:56, 21 March 2014
- Title of the submission
Answering Big Questions With Wikidata
- Themes (Proposal Themes - Community, Tech, Outreach, GLAM, Education)
Tech
- Type of submission (Presentation Types - Panel, Workshop, Presentation, etc)
Workshop
- Author of the submission
Max Klein
- E-mail address
isalix@gmail.com
- Username
- US state or country of origin
California
- Affiliation, if any (organization, company etc.)
None
- Personal homepage or blog
- Abstract (at least 300 words to describe your proposal)
Wikidata is live and starting to fulfill its potential as a data repository, but its also becoming more than the sum of its data. Wikipedia in its early stages started to see success as an encyclopedia, but then also proved invaluable for researchers as a way to understand online collaboration, and how people interacted around free-text content. Likewise the aspects of what Wikidata can tell us as a corpus, not just its individual facts, is slowly emerging. From knowing which Wikipedias have the highest percentage of their Biography articles about women, to visualizing the planet with geodata, new world perspective is being uncovered. This workshop will be a hands-on showing researchers and programmers how to answer their big questions with Wikidata. Its core points will be:
- What Wikidata is from a technical viewpoint:
- Native format
- Structure of a Wikidata Item
- Data Types
- Example Questions already answered:
- Which language Wikipedias are the most unique?
- Which language's Biography articles composition are most female?
- What is the name of every language in every language?
- What are the most popular book genres in each language Wikipedia?
- Denny's Coastline and Subway Maps
- How to use Pywikipedia to get the data live.
- Code walkthrough.
- Utilizing what links here along with
- new classes in the pywikibot library.
- Code walkthrough.
- How to use WDA to work with the data offline.
- WDA is a python script
- Downloads incremental dumps.
- At minimum you can do text parsing on a huge file
- More elegant solutions have already received funding.
- Demo of the Java library.
- WDA is a python script
- Length of presentation/talk (see Presentation Types for lengths of different presentation types)
- 75 Minutes
This workshop could be done in 60 minutes.
- Will you attend WikiConference USA if your submission is not accepted?
Yes if I also receive travel scholarship.
- Slides or further information (optional)
Have given a similar talk at Wikimedia Foundation Headquarters video here.
And blog posts [1], [2], [3], and [4].
- Special request as to time of presentations
Interested attendees
If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with four tildes. (~~~~).
- Add your username here.