count forward citations

Here you can post your opinions, ask questions and share experiences on the PATSTAT product line. Please always indicate the PATSTAT edition (e.g. 2015 Autumn Edition) and the database (e.g. PATSTAT Online, MySQL, MS SQL Server, ...) you are using.
Post Reply

a497652958
Posts: 8
Joined: Tue Jun 26, 2018 6:39 pm

count forward citations

Post by a497652958 » Thu Nov 11, 2021 11:19 am

hi,

I am trying to the count total number of forward citations of US patents that have certain IPC code. Then I group the result by publication year. Here is my code:

Code: Select all

select count (distinct(a212.pat_publn_id)) as number , year(a211.publn_date) AS date

from patstat2021a.dbo.tls201_appln a1


join patstat2021a.dbo.tls209_appln_ipc a209 on a1.appln_id = a209.appln_id
join patstat2021a.dbo.tls207_pers_appln a207 on a1.appln_id = a207.appln_id 
join patstat2021a.dbo.tls206_person a206 on a207.person_id = a206.person_id

join patstat2021a.dbo.tls211_pat_publn a211 on a211.appln_id = a1.appln_id
inner join patstat2021a.dbo.tls212_citation a212 on a211.pat_publn_id = a212.cited_pat_publn_id


where (a209.ipc_class_symbol like 'G06F%17/20%' 
or a209.ipc_class_symbol like 'G06F%17/21%'
or a209.ipc_class_symbol like 'G06F%17/22%'
or a209.ipc_class_symbol like 'G06F%17/23%'
or a209.ipc_class_symbol like 'G06F%17/24%'
or a209.ipc_class_symbol like 'G06F%17/25%'
or a209.ipc_class_symbol like 'G06F%17/26%'
or a209.ipc_class_symbol like 'G06F%17/27%'
or a209.ipc_class_symbol like 'G06F%17/28%')
or a209.ipc_class_symbol like 'G06K%9%'
or a209.ipc_class_symbol like 'G06T%7%'
or a209.ipc_class_symbol like 'G10L%13/027%'
or a209.ipc_class_symbol like 'G10L%15%'
or a209.ipc_class_symbol like 'G10L%17%'
or a209.ipc_class_symbol like 'G10L%25/66%'
or a209.ipc_class_symbol like 'G06F%15/18%')


and year(a211.publn_date) between 2000 and 2020
and a1.granted= 'Y' 
and a211.publn_first_grant= 'Y'

and a1.appln_auth='US'



and a206.person_ctry_code ='US'
and a1.ipr_type = 'PI' 
and a212.cited_pat_publn_id <> 0

group by year(a211.publn_date)
order by date
The displayed result significantly larger than I expected, and I am wondering if it is because my code is wrong. Could you please check my code?

Many thanks,
Muyang


EPO / PATSTAT Support
Posts: 440
Joined: Thu Feb 22, 2007 5:33 pm
Contact:

Re: count forward citations

Post by EPO / PATSTAT Support » Thu Nov 11, 2021 12:16 pm

Hello Muyang,
as far as I can see, your query is correct, apart from a mistake in one of the ipc_class_symbol and a bracket that is not needed. Observe that I removed the embedded wild card in the classification codes;it can lead to errors because of the way the IPC classification hierarchy is formed. The number of characters before the / should always be 8, and therefore blanks should be added as illustrated below.

When doing this kind of analysis, one can always write a query to double check one specific application in detail (or publication in your case), and see if the numbers add up.
Here is an example of your query, but just looking at 1 publication:

Code: Select all

select   count (distinct(a212.pat_publn_id)) as number , year(a211.publn_date) AS date
from patstat2021a.dbo.tls201_appln a1
join patstat2021a.dbo.tls209_appln_ipc a209 on a1.appln_id = a209.appln_id
join patstat2021a.dbo.tls207_pers_appln a207 on a1.appln_id = a207.appln_id 
join patstat2021a.dbo.tls206_person a206 on a207.person_id = a206.person_id
join patstat2021a.dbo.tls211_pat_publn a211 on a211.appln_id = a1.appln_id
join patstat2021a.dbo.tls212_citation a212 on a211.pat_publn_id = a212.cited_pat_publn_id
where (a209.ipc_class_symbol like 'G06F  17/20%' 
or a209.ipc_class_symbol like 'G06F  17/21%'
or a209.ipc_class_symbol like 'G06F  17/22%'
or a209.ipc_class_symbol like 'G06F  17/23%'
or a209.ipc_class_symbol like 'G06F  17/24%'
or a209.ipc_class_symbol like 'G06F  17/25%'
or a209.ipc_class_symbol like 'G06F  17/26%'
or a209.ipc_class_symbol like 'G06F  17/27%'
or a209.ipc_class_symbol like 'G06F  17/28%'
or a209.ipc_class_symbol like 'G06K  9%'
or a209.ipc_class_symbol like 'G06T  7%'
or a209.ipc_class_symbol like 'G10L  13/027%'
or a209.ipc_class_symbol like 'G10L  15%'
or a209.ipc_class_symbol like 'G10L  17%'
or a209.ipc_class_symbol like 'G10L  25/66%'
or a209.ipc_class_symbol like 'G06F  15/18%')
and year(a211.publn_date) between 2000 and 2020
and a1.granted= 'Y' 
and a211.publn_first_grant= 'Y'
and a1.appln_auth='US' 
and a206.person_ctry_code ='US'
and a1.ipr_type = 'PI' 
and a212.cited_pat_publn_id <> 0
and publn_nr = '7437338' and publn_auth = 'US' and publn_kind = 'B1'
group by year(a211.publn_date)
order by date
The result is 30, meaning that the US publication (B1) was cited 30 times.
If you now double check in ESPACENET, you will see that there are only 20 forward citations.
https://worldwide.espacenet.com/patent/ ... S7437338B1
But ESPACENET only shows 1 record per family, so 20 citing families.
We can then double check this by querying a list with all the citing publications and add the citing (docdb) family.

Code: Select all

SELECT DISTINCT  
cited.pat_publn_id
,cited.publn_auth +cited.publn_nr+ ' '+cited.publn_kind cited_pub
,cited.publn_date pubilcation_date_cited
,citing.publn_auth + citing.publn_nr +' '+ citing.publn_kind citing_pub
,citing.publn_date date_citing
,citing_app.docdb_family_id

FROM tls211_pat_publn as citing 
JOIN tls212_citation as j ON citing.pat_publn_id = j.pat_publn_id
JOIN tls211_pat_publn AS cited ON j.cited_pat_publn_id = cited.pat_publn_id
join tls201_appln citing_app on citing_app.appln_id = citing.appln_id
WHERE 
cited.publn_auth = 'US' and cited.publn_nr = '7437338' and cited.publn_kind = 'B1'
order by cited.pat_publn_id, citing.publn_date
The result is 31 citing publications, but one has the publication year 2021 (US10965622 B2--> 2021-03-30) which is outside the range you specified. If you then count the number of distinct families you come up to 20, which is exactly the same as what we see in Espacenet.

Use the same approach to double check a couple of publication, and you hopefully comes to similar results.
PATSTAT Support Team
EPO - Vienna
patstat @ epo.org


a497652958
Posts: 8
Joined: Tue Jun 26, 2018 6:39 pm

Re: count forward citations

Post by a497652958 » Fri Nov 12, 2021 2:25 am

Thank you very much


EPO / PATSTAT Support
Posts: 440
Joined: Thu Feb 22, 2007 5:33 pm
Contact:

Re: count forward citations

Post by EPO / PATSTAT Support » Fri Nov 12, 2021 4:05 pm

Keep in mind that your query ONLY looks at forward citations linked to the publication of the grant - the so called B document-. Examiner praxis is to rather cite the earliest publication, which is the A document (or even a priority filing). Researchers who use citations to measure for example technology flows will normally use family-family citations in order to eliminate examiner bias and double counting.
PATSTAT Support Team
EPO - Vienna
patstat @ epo.org


a497652958
Posts: 8
Joined: Tue Jun 26, 2018 6:39 pm

Re: count forward citations

Post by a497652958 » Mon Nov 15, 2021 1:36 pm

Hi,

Thanks for helping out.

Actually, would you please show me a sample code for counting the family-family citations?


EPO / PATSTAT Support
Posts: 440
Joined: Thu Feb 22, 2007 5:33 pm
Contact:

Re: count forward citations

Post by EPO / PATSTAT Support » Mon Nov 15, 2021 4:22 pm

Family-family citations count is a pre-aggregated value in the tls201_appn table with the name: nb_citing_docdb_fam . So you don't need to do any calculations.
This is the query that shows you the number for the same application as above.

Code: Select all

SELECT app.appln_auth, app.appln_nr
,app.appln_kind, app.appln_filing_date
,app.nb_citing_docdb_fam
FROM tls201_appln app
join tls211_pat_publn pub on app.appln_id =pub.appln_id
WHERE
publn_auth = 'US' and publn_nr = '7437338'
or more to what your selection was based on:

Code: Select all

SELECT distinct app.docdb_family_id,app.appln_auth, app.appln_nr
,app.appln_kind, app.appln_filing_date, app.granted
,app.nb_citing_docdb_fam
FROM tls201_appln app
join tls211_pat_publn pub on app.appln_id =pub.appln_id
join tls209_appln_ipc a209 on app.appln_id = a209.appln_id
where (a209.ipc_class_symbol like 'G06F  17/20%' 
or a209.ipc_class_symbol like 'G06F  17/21%'
or a209.ipc_class_symbol like 'G06F  17/22%'
or a209.ipc_class_symbol like 'G06F  17/23%'
or a209.ipc_class_symbol like 'G06F  17/24%'
or a209.ipc_class_symbol like 'G06F  17/25%'
or a209.ipc_class_symbol like 'G06F  17/26%'
or a209.ipc_class_symbol like 'G06F  17/27%'
or a209.ipc_class_symbol like 'G06F  17/28%'
or a209.ipc_class_symbol like 'G06K  9%'
or a209.ipc_class_symbol like 'G06T  7%'
or a209.ipc_class_symbol like 'G10L  13/027%'
or a209.ipc_class_symbol like 'G10L  15%'
or a209.ipc_class_symbol like 'G10L  17%'
or a209.ipc_class_symbol like 'G10L  25/66%'
or a209.ipc_class_symbol like 'G06F  15/18%')
and year(pub.publn_date) between 2000 and 2020
and app.granted= 'Y' 
and app.appln_auth='US' 
and app.ipr_type = 'PI' 
order by app.docdb_family_id
PATSTAT Support Team
EPO - Vienna
patstat @ epo.org


a497652958
Posts: 8
Joined: Tue Jun 26, 2018 6:39 pm

Re: count forward citations

Post by a497652958 » Tue Nov 16, 2021 5:49 am

Thanks!!


Post Reply