Difference between revisions of "User:Econterms/WikiProject Patents"
Jump to navigation
Jump to search
(enriched and detailed) |
(→Lightning talk presentation to WikiConference North America 2018: for recent patent, other software is already specialized and good) |
||
(11 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
⚫ | |||
⚫ | |||
+ | {{TOCright}} |
||
+ | * Wikidata can record basic information (not detailed information) about tens of millions of patents, someday. Right now there are only a few hundred. |
||
+ | * Project's goal: set standards for patent data on Wikidata, and make it easy |
||
⚫ | |||
+ | * Focus: patents from before 1923, because |
||
⚫ | |||
+ | ** They're beyond copyright |
||
− | * I will start to upload new patent items using QuickStatements, still just a few |
||
+ | ** Their claims (almost?) never apply any more<ref>[https://commons.wikimedia.org/wiki/Commons:Village_pump/Copyright#Reproductions_of_patent_text_and_illustrations Commons Village pump discussion of possible copyrighting of later patent documents]</ref> |
||
− | * We have some basic standards on how to record a patent. Some things need fixing and new properties. |
||
+ | ** Patents were shorter and simpler back then |
||
− | * Here we are focused on recording patents from before 1923. Patents that old aren't copyrighted, aren't secret, and no longer have claims that still apply (to my knowledge -- there could be an exception). |
||
+ | ** There are not as many: Fewer than 100K annually worldwide before 1910. The numbers grew exponentially. Now, 3 million a year, on the order of 9,000 a day. |
||
+ | ** This is relevant to my off-wiki research, tracking aero technology back then |
||
+ | ** There exists a lot of specialized software to manage the most recent patents, which are relevant to industry today |
||
+ | * I've begun a conversation with WIPO (World Intellectual Property Organization, the UN unit that manages the more recent treaty's relationships) |
||
+ | = <center>'''Patent data elements'''</center> = |
||
⚫ | |||
+ | [[File:Otto Lilienthal patent DE-1893-84417.png|600px|right]] |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
− | ** Use country (P17) and then the Q-number of the national government/country. It does not need to be a country that still exists. This technique is perhaps more flexible, and it will be necessary to use this option if it is not known what bureau received the patent application. |
||
⚫ | |||
− | * Filing date: Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved |
+ | * '''Filing date''': Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved |
− | * Grant date: Certification by a government that the patent is accepted, and applies in the jurisdiction. |
+ | * '''Grant date''': Certification by a government that the patent is accepted, and applies in the jurisdiction. (Might be more complicated with later international treaties.) |
− | ** Filing and grant seem to be more complicated when there is an international phase, since the later Patent and Cooperation Treaty |
||
− | * Applicant(s) -- there's always at least one ; can include company or university or government lab |
+ | * '''Applicant(s)''' -- there's always at least one ; can include company or university or government lab |
− | * Inventors: Zero or more; Might like to mark their order |
+ | * '''Inventors''': Zero or more; Might like to mark their order for some we have "author name strings", for others Q-ids (same for scientific publications) |
− | * Title: |
+ | * '''Title''': Applicants give a title in the language of t |
+ | * '''Patent number''' -- inherited from years ago, e.g. US821393 -- works for those on google patents, and automatically links to that source |
||
− | * Patent number -- not all patents can use the current property "patentnumber" which has the format US###### |
||
+ | ** PROBLEM: too strict a format ; what to do for the ones that don't fit the format? |
||
− | * Page title on Wikidata |
||
⚫ | |||
− | * Assignee |
||
− | * Pointer to URL somewhere with more information, possibly the full text and diagrams -- THERE IS NO ONE PERFECT SITE FOR THIS |
||
+ | * link to Wikisource if patent document is there |
||
− | ; Possible good outcome of getting these basics into Wikidata -- We could add patent offices to the Authority Control line, maybe (?) |
||
⚫ | |||
⚫ | |||
+ | * Assignee? Important in industrialization |
||
+ | * Pointer to URL with more information, possibly the full text and diagrams -- '''There is not yet a site and covers the 19th century completely. Wikidata could be the best site for this, someday.''' |
||
+ | |||
+ | = <center>'''Possible good outcome from getting patents onto Wikdiata'''</center> = |
||
⚫ | |||
[[File:AGBell article lower section with authority control.png||right]] |
[[File:AGBell article lower section with authority control.png||right]] |
||
− | * |
+ | * Link together patents transcribed on Wikisource |
+ | * Chart patent counts by inventor, country, tech topic; Time lines |
||
+ | * Other insights? |
||
+ | |||
+ | = <center>'''Next steps'''</center> = |
||
+ | * There are a few hundred patents on Wikidata. I will upload more, probably QuickStatements (thanks to Jarekt's help), still just a few |
||
+ | * Here's the QuickStatements: https://tools.wmflabs.org/quickstatements/#/batch |
||
+ | <pre> |
||
+ | CREATE |
||
+ | LAST Len "Patent US-1906-827017" |
||
+ | LAST P31 Q253623 |
||
+ | LAST P1476 en:"Wing of flying machines" S1246 "US827017" S813 +2018-10-19T00:00:00Z/11 S248 Q3235742 |
||
+ | </pre> |
||
+ | * Any input? How should this be done? What would be useful to you? |
||
+ | |||
+ | === References === |
||
+ | <references /> |
Latest revision as of 18:06, 20 October 2018
Lightning talk presentation to WikiConference North America 2018
- Wikidata can record basic information (not detailed information) about tens of millions of patents, someday. Right now there are only a few hundred.
- Project's goal: set standards for patent data on Wikidata, and make it easy
- The WikiProject Patents page: Wikidata's WikiProject Patents
- Focus: patents from before 1923, because
- They're beyond copyright
- Their claims (almost?) never apply any more[1]
- Patents were shorter and simpler back then
- There are not as many: Fewer than 100K annually worldwide before 1910. The numbers grew exponentially. Now, 3 million a year, on the order of 9,000 a day.
- This is relevant to my off-wiki research, tracking aero technology back then
- There exists a lot of specialized software to manage the most recent patents, which are relevant to industry today
- I've begun a conversation with WIPO (World Intellectual Property Organization, the UN unit that manages the more recent treaty's relationships)
Patent data elements
- Instance of (P31) A patent item should be an instance of either patent (Q253623) or U.S. Patent (Q43305660), perhaps both. That property is the one to query (search) that is unique to patents.
- Page title -- one standard form: Patent US-1906-827017, Patent CA-1914-153820 -- different titles are fine too
- Country where filed: Here are three options; freely use any or all. They express slightly different things
- Use issued by (P2378) and identify the office to which the patent was filed -- e.g. US Patent and Trademark Office, Japan Patent office (JPO)
- Or, "applies to jurisdiction" (P1001) and then the Q-id of the government; or, country (P17) and then the Q-id of the national government/country. The country may not still exist.
- Filing date: Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved
- Grant date: Certification by a government that the patent is accepted, and applies in the jurisdiction. (Might be more complicated with later international treaties.)
- Applicant(s) -- there's always at least one ; can include company or university or government lab
- Inventors: Zero or more; Might like to mark their order for some we have "author name strings", for others Q-ids (same for scientific publications)
- Title: Applicants give a title in the language of t
- Patent number -- inherited from years ago, e.g. US821393 -- works for those on google patents, and automatically links to that source
- PROBLEM: too strict a format ; what to do for the ones that don't fit the format?
- link to Wikisource if patent document is there
- Link to Q-id or string of Parent patent or child patent ?
- Assignee? Important in industrialization
- Pointer to URL with more information, possibly the full text and diagrams -- There is not yet a site and covers the 19th century completely. Wikidata could be the best site for this, someday.
Possible good outcome from getting patents onto Wikdiata
- We could add patent offices to the Authority Control line, maybe -- like USPTO, or WIPO, and if user clicks can get to a list of patents on Wikidata
- Link together patents transcribed on Wikisource
- Chart patent counts by inventor, country, tech topic; Time lines
- Other insights?
Next steps
- There are a few hundred patents on Wikidata. I will upload more, probably QuickStatements (thanks to Jarekt's help), still just a few
- Here's the QuickStatements: https://tools.wmflabs.org/quickstatements/#/batch
CREATE LAST Len "Patent US-1906-827017" LAST P31 Q253623 LAST P1476 en:"Wing of flying machines" S1246 "US827017" S813 +2018-10-19T00:00:00Z/11 S248 Q3235742
- Any input? How should this be done? What would be useful to you?