Difference between revisions of "User:Econterms/WikiProject Patents"

From WikiConference North America
Jump to navigation Jump to search
(footnote about copyrighting of patents)
(→‎Lightning talk presentation to WikiConference North America 2018: for recent patent, other software is already specialized and good)
 
(9 intermediate revisions by the same user not shown)
Line 1: Line 1:
; Lightning talk presentation to WikiConference North America 2018
+
= <center>'''Lightning talk presentation to WikiConference North America 2018'''</center> =
  +
{{TOCright}}
 
* WIkidata can record basic information (not detailed information) about tens of millions of patents, someday. Right now there are only a few hundred.
+
* Wikidata can record basic information (not detailed information) about tens of millions of patents, someday. Right now there are only a few hundred.
* We have some basic standards on how to record a patent. We discuss that below. Some things need fixing and new properties.
+
* Project's goal: set standards for patent data on Wikidata, and make it easy
 
* The WikiProject Patents page: [[d:Wikidata:WikiProject_Patents|Wikidata's WikiProject Patents]]
 
* The WikiProject Patents page: [[d:Wikidata:WikiProject_Patents|Wikidata's WikiProject Patents]]
   
  +
* Focus: patents from before 1923, because
* Here we'll focus on recording patents from before 1923. Patents that old aren't copyrighted, aren't secret, and no longer have claims that still apply (to my knowledge -- there could be an exception).<ref>[https://commons.wikimedia.org/wiki/Commons:Village_pump/Copyright#Reproductions_of_patent_text_and_illustrations Commons Village pump discussion of possible copyrighting of later patent documents]</ref>
 
  +
** They're beyond copyright
* I am familiar with historical European and North American patents, less so with modern ones.
 
 
** Their claims (almost?) never apply any more<ref>[https://commons.wikimedia.org/wiki/Commons:Village_pump/Copyright#Reproductions_of_patent_text_and_illustrations Commons Village pump discussion of possible copyrighting of later patent documents]</ref>
  +
** Patents were shorter and simpler back then
  +
** There are not as many: Fewer than 100K annually worldwide before 1910. The numbers grew exponentially. Now, 3 million a year, on the order of 9,000 a day.
  +
** This is relevant to my off-wiki research, tracking aero technology back then
  +
** There exists a lot of specialized software to manage the most recent patents, which are relevant to industry today
  +
* I've begun a conversation with WIPO (World Intellectual Property Organization, the UN unit that manages the more recent treaty's relationships)
  +
  +
= <center>'''Patent data elements'''</center> =
  +
[[File:Otto Lilienthal patent DE-1893-84417.png|600px|right]]
 
* '''Instance of''' (P31) A patent item should be an instance of either patent (Q253623) or U.S. Patent (Q43305660), perhaps both. That property is the one to query (search) that is unique to patents.
 
* '''Page title''' -- one standard form: '''Patent US-1906-827017''', '''Patent CA-1914-153820''' -- different titles are fine too
 
* Country where filed: Here are three options; freely use any or all. They express slightly different things
 
** Use '''issued by''' (P2378) and identify the office to which the patent was filed -- e.g. US Patent and Trademark Office, Japan Patent office (JPO)
 
** Or, "applies to jurisdiction" (P1001) and then the Q-id of the government; or, country (P17) and then the Q-id of the national government/country. The country may not still exist.
 
* '''Filing date''': Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved
 
* '''Grant date''': Certification by a government that the patent is accepted, and applies in the jurisdiction. (Might be more complicated with later international treaties.)
 
* '''Applicant(s)''' -- there's always at least one ; can include company or university or government lab
 
* '''Inventors''': Zero or more; Might like to mark their order for some we have "author name strings", for others Q-ids (same for scientific publications)
 
* '''Title''': Applicants give a title in the language of t
  +
* '''Patent number''' -- inherited from years ago, e.g. US821393 -- works for those on google patents, and automatically links to that source
  +
** PROBLEM: too strict a format ; what to do for the ones that don't fit the format?
   
  +
* link to Wikisource if patent document is there
* A patent item should be an instance of (P31) either patent (Q253623) or U.S. Patent (Q43305660), perhaps both. That property is the one to query (search) that is unique to patents.
 
 
* Link to Q-id or string of Parent patent or child patent ?
* Page title can be of this form: Patent US-1906-827017, Patent CA-1914-153820 -- or another form if the editor prefers
 
  +
* Assignee? Important in industrialization
* Country where filed: Here are three options; freely use any or all. They express slightly different things. Is one best?
 
 
* Pointer to URL with more information, possibly the full text and diagrams -- '''There is not yet a site and covers the 19th century completely. Wikidata could be the best site for this, someday.'''
** Use issued by (P2378) and identify the office with which the patent was filed -- generally a bureau that is an instance of patent office (Q1148446)) -- e.g. US Patent and Trademark Office, Japan Patent office (JPO)
 
** Use applies to jurisdiction (P1001) and then the Q-number of the national government/country.
 
** Use country (P17) and then the Q-number of the national government/country. It does not need to be a country that still exists. This technique is perhaps more flexible, and it will be necessary to use this option if it is not known what bureau received the patent application.
 
* Filing date: Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved
 
* Grant date: Certification by a government that the patent is accepted, and applies in the jurisdiction.
 
** Filing and grant seem to be more complicated when there is an international phase, since the later Patent and Cooperation Treaty
 
* Applicant(s) -- there's always at least one ; can include company or university or government lab
 
* Inventors: Zero or more; Might like to mark their order -- some are notable enough for wikidata, others just name strings
 
* Title: A string in the language of
 
* Patent number -- not all patents can use the current property "patentnumber" which has the format US###### -- and seems to require that the patent is on google patents -- what number do we use if patentnumber doesn't work?
 
* Page title on Wikidata
 
* Parent patent or child patent
 
* Assignee
 
* Pointer to URL somewhere with more information, possibly the full text and diagrams -- THERE IS NO ONE PERFECT SITE FOR THIS. Wikidata could be the best site for this, someday.
 
   
; Possible good outcome of getting these basics into Wikidata -- We could add patent offices to the Authority Control line, maybe (?)
+
= <center>'''Possible good outcome from getting patents onto Wikdiata'''</center> =
; like USPTO, or WIPO, and if the user clicks there would get an automatic list of patents from Wikidata
+
* We could add patent offices to the Authority Control line, maybe -- like USPTO, or WIPO, and if user clicks can get to a list of patents on Wikidata
 
[[File:AGBell article lower section with authority control.png||right]]
 
[[File:AGBell article lower section with authority control.png||right]]
* some patents could/should be transcribed onto Wikisource
+
* Link together patents transcribed on Wikisource
  +
* Chart patent counts by inventor, country, tech topic; Time lines
  +
* Other insights?
   
; Next steps
+
= <center>'''Next steps'''</center> =
* I will start to upload new patent items using QuickStatements, still just a few
+
* There are a few hundred patents on Wikidata. I will upload more, probably QuickStatements (thanks to Jarekt's help), still just a few
 
* Here's the QuickStatements: https://tools.wmflabs.org/quickstatements/#/batch
 
* Here's the QuickStatements: https://tools.wmflabs.org/quickstatements/#/batch
  +
<pre>
  +
CREATE
  +
LAST Len "Patent US-1906-827017"
  +
LAST P31 Q253623
  +
LAST P1476 en:"Wing of flying machines" S1246 "US827017" S813 +2018-10-19T00:00:00Z/11 S248 Q3235742
  +
</pre>
 
* Any input? How should this be done? What would be useful to you?
 
* Any input? How should this be done? What would be useful to you?
   

Latest revision as of 18:06, 20 October 2018

Lightning talk presentation to WikiConference North America 2018

  • Wikidata can record basic information (not detailed information) about tens of millions of patents, someday. Right now there are only a few hundred.
  • Project's goal: set standards for patent data on Wikidata, and make it easy
  • The WikiProject Patents page: Wikidata's WikiProject Patents
  • Focus: patents from before 1923, because
    • They're beyond copyright
    • Their claims (almost?) never apply any more[1]
    • Patents were shorter and simpler back then
    • There are not as many: Fewer than 100K annually worldwide before 1910. The numbers grew exponentially. Now, 3 million a year, on the order of 9,000 a day.
    • This is relevant to my off-wiki research, tracking aero technology back then
    • There exists a lot of specialized software to manage the most recent patents, which are relevant to industry today
  • I've begun a conversation with WIPO (World Intellectual Property Organization, the UN unit that manages the more recent treaty's relationships)

Patent data elements

Otto Lilienthal patent DE-1893-84417.png
  • Instance of (P31) A patent item should be an instance of either patent (Q253623) or U.S. Patent (Q43305660), perhaps both. That property is the one to query (search) that is unique to patents.
  • Page title -- one standard form: Patent US-1906-827017, Patent CA-1914-153820 -- different titles are fine too
  • Country where filed: Here are three options; freely use any or all. They express slightly different things
    • Use issued by (P2378) and identify the office to which the patent was filed -- e.g. US Patent and Trademark Office, Japan Patent office (JPO)
    • Or, "applies to jurisdiction" (P1001) and then the Q-id of the government; or, country (P17) and then the Q-id of the national government/country. The country may not still exist.
  • Filing date: Formal date of submission of the patent application, and generally speaking the date on which the patent goes into force legally once it's approved
  • Grant date: Certification by a government that the patent is accepted, and applies in the jurisdiction. (Might be more complicated with later international treaties.)
  • Applicant(s) -- there's always at least one ; can include company or university or government lab
  • Inventors: Zero or more; Might like to mark their order for some we have "author name strings", for others Q-ids (same for scientific publications)
  • Title: Applicants give a title in the language of t
  • Patent number -- inherited from years ago, e.g. US821393 -- works for those on google patents, and automatically links to that source
    • PROBLEM: too strict a format ; what to do for the ones that don't fit the format?
  • link to Wikisource if patent document is there
  • Link to Q-id or string of Parent patent or child patent ?
  • Assignee? Important in industrialization
  • Pointer to URL with more information, possibly the full text and diagrams -- There is not yet a site and covers the 19th century completely. Wikidata could be the best site for this, someday.

Possible good outcome from getting patents onto Wikdiata

  • We could add patent offices to the Authority Control line, maybe -- like USPTO, or WIPO, and if user clicks can get to a list of patents on Wikidata
AGBell article lower section with authority control.png
  • Link together patents transcribed on Wikisource
  • Chart patent counts by inventor, country, tech topic; Time lines
  • Other insights?

Next steps

 CREATE								
 LAST	Len	"Patent US-1906-827017"						
 LAST	P31	Q253623						
 LAST	P1476	en:"Wing of flying machines"	S1246	"US827017"	S813	+2018-10-19T00:00:00Z/11	S248	Q3235742
  • Any input? How should this be done? What would be useful to you?

References