different number country-specific applications
Posted: Mon Jun 14, 2021 11:18 am
Hi,
I previously worked with PATSTAT 2016 and have recently switched to the latest version, PATSTAT Spring 2021. I only extract data for patent applications filed by applicants with Swedish address.
When I compared data extracted from PATSTAT 2016 and from PATSTAT 2021 for some overlapping years (2000-2015), I noticed that there are some observations that were previously identified as Swedish (i.e., PERSON_CTRY_CODE=="SE" from TLS206) have any other country code but not Swedish in PATSTAT 2021. The number of such applications, however, is relatively low, appr. 4580 of 288386, or 1,6%.
I tried to investigate the issue and found two reasons why that might occur:
1. there was a Swedish applicant when the application was filed but there is a new applicant with an address in another country in the latest application. For example, in the case of change in patent ownership;
2. it seems that in PATSTAT 2016, one application could have multiple observations. For example, for appln_nr==10530001, there are at least two different appln_id and appln_nr_epodoc. appln_nr_epodoc are almost identical but one has the letter "D" at the end. It seems that in PATSTAT 2021, there are no such duplicates, i.e., there is only one unique applications id for one real patent application.
Do my findings make sense? Are there any other reasons why the number of "Swedish applications" in PATSTAT 2016 might be larger than in PATSTAT 2021 or why they don't match exactly? Is there anything else I should be aware of?
I previously worked with PATSTAT 2016 and have recently switched to the latest version, PATSTAT Spring 2021. I only extract data for patent applications filed by applicants with Swedish address.
When I compared data extracted from PATSTAT 2016 and from PATSTAT 2021 for some overlapping years (2000-2015), I noticed that there are some observations that were previously identified as Swedish (i.e., PERSON_CTRY_CODE=="SE" from TLS206) have any other country code but not Swedish in PATSTAT 2021. The number of such applications, however, is relatively low, appr. 4580 of 288386, or 1,6%.
I tried to investigate the issue and found two reasons why that might occur:
1. there was a Swedish applicant when the application was filed but there is a new applicant with an address in another country in the latest application. For example, in the case of change in patent ownership;
2. it seems that in PATSTAT 2016, one application could have multiple observations. For example, for appln_nr==10530001, there are at least two different appln_id and appln_nr_epodoc. appln_nr_epodoc are almost identical but one has the letter "D" at the end. It seems that in PATSTAT 2021, there are no such duplicates, i.e., there is only one unique applications id for one real patent application.
Do my findings make sense? Are there any other reasons why the number of "Swedish applications" in PATSTAT 2016 might be larger than in PATSTAT 2021 or why they don't match exactly? Is there anything else I should be aware of?