Patent dataset

Description:

This dataset spans 37 years (January 1, 1963 to December 30, 1999), and includes all the utility patents granted during that period, totaling 3,923,922 patents. The citation graph includes all citations made by patents granted between 1975 and 1999, totaling 16,522,438 citations. The data was originally released by NBER (http://www.nber.org/patents/), and then processed as five tables and a graph, including patent table, inventor table, assignee table, class table, category table, citation graph.

Complete Data Statistics:
Relational Table:
Patent Inventor Assignee Class Category
Rows 2,923,922 4,301,229 175,115 426 36
Attributes 23 11 7 4 5
Directed Graph:
Citation
Nodes 3,774,768
Edges 16,518,948

Sample Data:

patent.table:

"PATENT","GYEAR","GDATE","APPYEAR","COUNTRY","POSTATE","ASSIGNEE","ASSCODE","CLAIMS","NCLASS",...
3070801,1963,1096,,"BE","",,1,,269,...
3070802,1963,1096,,"US","TX",,1,,2,...
3070803,1963,1096,,"US","IL",,1,,2,...
3070804,1963,1096,,"US","OH",,1,,2,...
3070805,1963,1096,,"US","CA",,1,,2,...

where

  • "PATENT" is "Patent number"
  • "GYEAR" is "Grant year"
  • "GDATE" is "Grant date"
  • "APPYEAR" is "Application year"
  • "COUNTRY" is "Country of first inventor"
  • "POSTATE" is "State of first inventor (if US)"
  • "ASSIGNEE" is "Assignee identifier (missing 1963-1967)"
  • "ASSCODE" is "Assignee Type",
  • "CLAIMS" is "Number of claims",
  • "NCLASS" is "Main patent class (3 digit) "
inventor.table:

"PATENT","LASTNAM","FIRSTNAM","MIDNAM","MODIFNAM","STREET","CITY","POSTATE","COUNTRY","ZIP","INVSEQ"
3858241,"Durand","Philip","E.","","","Hudson","MA","US","",1
3858241,"Norris","Lonnie","H.","","","Milford","MA","US","",2
3858242,"Gooding","Elwyn","R.","","120 Darwin Rd.","Pinckney","MI","US","48169",1
3858243,"Pierron","Claude","Raymond","","","Epinal","","FR","",1
3858243,"Jenny","Jean","Paul","","","Decines","","FR","",2

where

  • "PATENT" is "Patent number"
  • "LASTNAM" is "Last name of inventor"
  • "FIRSTNAM" is "First name of inventor"
  • "MIDNAM" is "Middle name of inventor"
  • "MODIFNAM" is "Surname modifier"
  • "STREET" is "Street address"
  • "CITY" is "City"
  • "POSTATE" is "State code"
  • "COUNTRY" is "Country code"
  • "ZIP" is "Zip code"
  • "INVSEQ" is "Inventor sequence number"
assignee.table:
"ASSIGNEE","ASSNAME","CNAME","CUSIP","OWN","PNAME","SNAME"
5,"AAA ASSOCIATES, INC.",,,,,
10,"AAA OFFICE COFFEE SERVICE, INC.",,,,,
15,"AAA PIPE CLEANING CORPORATION",,,,,
20,"AAA PRODUCTS INTERNATIONAL INC.",,,,,
25,"AAA SALES & ENGINEERING, INC.",,,,,

where

  • "ASSIGNEE" is "Assignee identifier"
  • "ASSNAME" is "Patent assignee name"
  • "CNAME" is "Compustat name"
  • "CUSIP" is "Compustat CUSIP"
  • "OWN" is "Percent of ownership"
  • "PNAME" is "Parent name"
  • "SNAME" is "Subsidiary name"
category.table:

"CAT","SUBCAT","SUBCATNAME","CATNAMESHORT","CATENAMELONG"
1,11,Agriculture & Food & Textiles,Chemical,Chemical
1,12,Coating,Chemical,Chemical
1,13,Gas,Chemical,Chemical
1,14,Organic Compounds,Chemical,Chemical
1,15,Resins,Chemical,Chemical

where

  • "CAT" is "Technological category"
  • "SUBCAT" is "Technological sub-category"
  • "SUBCATNAME" is "Sub-category name"
  • "CATNAMESHORT" is "Category short name"
  • "CATENAMELONG" is "Category long name"
class.table:

"CLASS","CNAME","SUBCAT","CAT"
1,** Classification Undetermined **,69,6
2,Apparel,63,6
4,Baths, Closets, Sinks, and Spittoons,65,6
5,Beds,65,6
7,Compound Tools,59,5

where

  • "CLASS" is "Class identifier"
  • "CNAME" is "Class name"
  • "SUBCAT" is "Technological sub-category"
  • "CAT" is "Technological category"
citation.graph:

"CITING","CITED"
3858241,956203
3858241,1324234
3858241,3398406
3858241,3557384
3858241,3634889
3858242,1515701
3858242,3319261
3858242,3668705
3858242,3707004

Example Queries:

Description
Query 01 Find all patents which has more than 5 times citation.
Query 02 Find all patents which has more than 5 times citation and belongs to chemical category.
Query 03 Find all patents which is invented by influencing inventors (whose citation is more than 10 times).
Query 04 Find all assignees which assigns the patents cited more than 5 times and belongs to CUSIP.
Query 05 Find all assignees which assigns the patents cited by chemical category patents.
Query 06 Find the inventor who has most citation in chemical category.
Query 07 Count average citation of each inventor.
Query 08 Find all inventors who has been cited more than average citation.
Query 09 Find all inventors who has invented more than 3 categories patents with at least 3 times citation.
Query 10 Find all inventors who cooperates with the influencing inventors, but not belong to influencing inventors.

Download Files: Patent.zip