Thanks for your help.
I adjusted the query to the following code:
Code: Select all
SELECT DISTINCT *
FROM tls201_appln a
JOIN tls207_pers_appln pa on a.appln_id = pa.appln_id
JOIN tls206_person p on pa.person_id = p.person_id
WHERE a.appln_id < 900000000 -- exclude artificial applications (see PATSTAT Data Catalog for details)
AND appln_filing_year = 2013
AND applt_seq_nr > 0 -- consider only applicants
AND psn_name like 'Siemens%' -- do not require a blank after SIEMENS, otherwise you would miss the plain name "SIEMENS"
AND granted=1
Order by psn_name
Like you mentioned in your last post, I get some unexpected results.
(e.g. Siemens, SIEMENS SHANGHAI MEDICAL EQUIPMENT, SIEMENS SCHWEIZ etc.)
Now I should be able to spot outliners and delete them.
I have two more questions:
First: Why do I get different results using this two search requests?
First request:
Code: Select all
SELECT DISTINCT *
FROM tls201_appln a
JOIN tls207_pers_appln pa on a.appln_id = pa.appln_id
JOIN tls206_person p on pa.person_id = p.person_id
WHERE a.appln_id < 900000000 -- exclude artificial applications (see PATSTAT Data Catalog for details)
AND appln_filing_year = 2013
AND applt_seq_nr > 0 -- consider only applicants
AND psn_name like 'Siemens%' -- do not require a blank after SIEMENS, otherwise you would miss the plain name "SIEMENS"
AND granted=1
Order by psn_name
Result: 2279
Second request:
Code: Select all
SELECT DISTINCT a.*
FROM tls201_appln a
JOIN tls207_pers_appln pa on a.appln_id = pa.appln_id
JOIN tls206_person p on pa.person_id = p.person_id
WHERE a.appln_id < 900000000 -- exclude artificial applications (see PATSTAT Data Catalog for details)
AND appln_filing_year = 2013
AND applt_seq_nr > 0 -- consider only applicants
AND psn_name like 'Siemens%' -- do not require a blank after SIEMENS, otherwise you would miss the plain name "SIEMENS"
AND granted=1
Result: 2222
According to the first request I select all columns and with the second request I select all columns of table "tls201_appln".
Why is there a difference in the two results when I only search for "Siemens" in the column psn_name?
Second: What happens when there is more than one applicant per application?
Are these patents counted twice?
I think after these questions everything should be clarified.
Thanks for you help! The support of the EPO is awesome.