Submissions:2016/Linked data in the hands of biological researchers: A model organism database powered by Wikidata
Jump to navigation Jump to search
Revision as of 23:17, 31 August 2016 by Putmantime (Created page with "<!-- Simply provide information about your submission below and save the page. --> ;Title: Linked data in the hands of biological researchers: A model organism database powere...")
- Linked data in the hands of biological researchers: A model organism database powered by Wikidata
- Health care and science
- Type of submission
- Timothy E. Putman
- E-mail address
- The Scripps Research Institute
- Rapid improvements in sequencing technologies have resulted in tens of thousands of sequenced bacterial genomes. For decades this type of data has been stored in government funded, large scale databases like the National Center for Biotechnology Information (NCBI) and in primary publications in Pubmed. That knowledge is then extracted and documented by professional curators. Major drawbacks to large scale, expert curated databases are the scope of data (rapidly becoming too large to curate professionally) and the expense of maintaining and extending the infrastructure required. Wikidata is an, openly editable, semantic web compatible framework for knowledge representation and its knowledge integration capabilities make it ideally suited to the challenge of representing the exploding body of information about microbial genomics. In addition to its its technical suitability, the diverse nature of its content goes well beyond life sciences creating a much larger user base with vested interest in seeing it endure. We are developing a microbial specific semantic data model in Wikidata modeling bacterial species, genes, proteins, diseases they cause, and drugs that treat them. The unique ability of Wikidata to allow users to enter data directly provides a central place for community curation, the only feasible way to handle the scope of this data. To get the community involved, we are building a web application that brings our data model of linked organisms, genes, proteins, diseases and drugs to the to the community in an intuitive and powerful interface. This community driven aggregation of knowledge represents a new way of curating massive datasets through community distribution of labor, collective contribution and collective gain.
- Length of presentation
- Special schedule requests
- Preferred room size
- Will you attend WikiConference North America if your submission is not accepted?
If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with four tildes. (~~~~).
- Add your username here.