How to remove duplicate publications in the family. ?

Tips and tricks on how to get the most out of Espacenet
Post Reply

sujith3g
Posts: 9
Joined: Fri Jul 10, 2015 10:32 am

How to remove duplicate publications in the family. ?

Post by sujith3g » Fri Jul 10, 2015 10:50 am

I have used http://ops.epo.org/3.1/rest-services/fa ... /EP1981983 to get the inpadoc family members. It gives me 10 results. But actually there are only 5 inpadoc family members listed espacenet(http://worldwide.espacenet.com/publicat ... cale=en_EP).

I have noticed that most of them are different publications of the same patent. So is there any query/argument can be used with ops family service to remove these duplicates ?

Thank you.


sujith3g
Posts: 9
Joined: Fri Jul 10, 2015 10:32 am

Re: How to remove duplicate publications in the family. ?

Post by sujith3g » Mon Jul 13, 2015 10:47 am

Can you explain ? Here http://ops.epo.org/3.1/rest-services/fa ... /EP1981983 all the 10 results have the same Family ID, So how do I remove the duplicate publications with same Family ID?


sujith3g
Posts: 9
Joined: Fri Jul 10, 2015 10:32 am

Re: How to remove duplicate publications in the family. ?

Post by sujith3g » Mon Jul 13, 2015 5:11 pm

I have checked the XML for EP1981983, http://ops.epo.org/3.1/rest-services/fa ... /EP1981983, And every result has the same Family ID:"39268929".
But I want to filter 5 results from these 10 patents.
In this case how do I filter it out ?
Is there any way to identify the duplicates if every result has same Family ID ?

This is the 5 family members for EP1981983 http://worldwide.espacenet.com/publicat ... cale=en_EP


EPO / OPS Support
Posts: 1298
Joined: Thu Feb 22, 2007 5:32 pm

Re: How to remove duplicate publications in the family. ?

Post by EPO / OPS Support » Tue Jul 14, 2015 8:09 am

Sorry, I need to apologise for misleading you with my answer, I only noticed it today that I was suppose to ask you to filter out Application reference DOC ID and not family ID.

So, if you have a look at the list you will see that all publications listed in Espacenet are also listed in OPS except that Espacenet has an ability to gather all publication steps for one application in one record (1 member) whereas OPS is only indicating to you via the Application reference DOC ID that certain publications belong to one and the same application. Bu

If you want to de duplicate records in OPS you need to write your script in such way that it will only present to you one document stage per application. Application reference DOC ID is unique information that can combine all documents resulting from that one single application:
family member (7):

Family ID :39268929
country:US
doc-number:2009197243
kind:A1
date:20090806

Application reference:Doc ID:58077725

family member (8):

Family ID :39268929
country:US
doc-number:8364409
kind:B2
date:20130129
Application reference:Doc ID :58077725
Espacenet uses this same information to provide you with what you see in their system. I hope this is useful and sorry again for me not noticing my mistake before.

Kind regards,

OPS support


sujith3g
Posts: 9
Joined: Fri Jul 10, 2015 10:32 am

Re: How to remove duplicate publications in the family. ?

Post by sujith3g » Tue Jul 14, 2015 8:52 am

Thanks, for your support.


sujith3g
Posts: 9
Joined: Fri Jul 10, 2015 10:32 am

Re: How to remove duplicate publications in the family. ?

Post by sujith3g » Tue Jul 14, 2015 1:04 pm

I have tried what you mentioned above in EP1981982, there are 12 results in ops family service http://ops.epo.org/3.1/rest-services/fa ... /EP1981982.

I have filtered the results with unique Application Reference ID, then I got 11 results.
But there are only 10 results listed in Espacenet http://worldwide.espacenet.com/publicat ... cale=en_EP.

Espacenet is not showing the 7th result obtained from OPS Family Service, But it has a unique Application Ref ID.

I think I'm still missing one more condition to filter the results given by OPS Family service.

Is there anything(other than Application_Reference_ID) to be considered while filtering out results obtained using OPS Family service ?

Thanks in advance.


look4answer
Posts: 20
Joined: Wed Jan 16, 2013 3:45 pm

Re: How to remove duplicate publications in the family. ?

Post by look4answer » Tue Jul 14, 2015 1:22 pm

I am not sure what you mean by removing them. If you look closely, even though they all have the same family ID, they are different publication.

(1) EP1981982A1
(2) EP1981982B1
(3) AT507301T
...

Therefore, they all are different information.


EPO / OPS Support
Posts: 1298
Joined: Thu Feb 22, 2007 5:32 pm

Re: How to remove duplicate publications in the family. ?

Post by EPO / OPS Support » Tue Jul 14, 2015 1:32 pm

Hi

This is a separate issue, nothing to do with de-duplicating,

OPS is giving you a family member no.7. which is only available in OPS database and not in Espacenet:
family member (7):


Family ID : 38089148
country:DE
doc-number:602007014183
kind:D1
date:20110609

Application reference:Doc ID :334021908
You need to distinguished between two databases you are trying to compare:
- Espacenet is a database that includes only real publications and therefore does not show any documents with kind codes such as D1 or A0 (those are announcements of filling).

- OPS family service takes data from Docdb database, which is our biblio master database and that one includes all available documents, also kind codes D1 and A0 est.

If you want a result list the same as in Espacenet (but please consider that Espacenet is no professional database and usually patent searches would only use it for quick first overview) you have to also filter our publications with kind codes such as D1 or A0 and others not available in Espacenet.

But for completes and correctness I would suggest that you rather use results from OPS family service/DOCDB instead.

To know which kind codes are not available in Espacenet look at the concordance list (first table) available here: http://www.epo.org/searching/data/data/ ... gular.html.

If the kind code has empty field under Epodoc column that means that such document will not be found in Espacenet.

Kind regards,

OPS support


sgattamaus
Posts: 1
Joined: Fri Oct 23, 2015 8:44 am

Re: How to remove duplicate publications in the family. ?

Post by sgattamaus » Fri Oct 23, 2015 8:52 am

Hi.

Question:

Is there a way in Espacenet to make an "applicant search" and retrieve ONLY ONE document from a patent family (to avoid atons of hits....)?
Or, in general, how can we filter out from a list of result those which have the same priority or are a national phase of the same PCT?

Regards,

MC


EPO / OPS Support
Posts: 1298
Joined: Thu Feb 22, 2007 5:32 pm

Re: How to remove duplicate publications in the family. ?

Post by EPO / OPS Support » Tue Oct 27, 2015 9:32 am

Hi,

As this is Espacenet related question we have moved it into Espacenet forum. Our colleagues will reply as soon as they can.

OPS support


Post Reply