I'm trying to get all patents where the tile and abstract (both in English) as well as any kind of citations and IPC codes are available.
Currently I'm trying to get this data through linked-data's SPARQL from the available samples and other resources I constructed something like this:
Code: Select all
prefix cpc: <http://data.epo.org/linked-data/def/cpc/>
prefix dcterms: <http://purl.org/dc/terms/>
prefix ipc: <http://data.epo.org/linked-data/def/ipc/>
prefix mads: <http://www.loc.gov/standards/mads/rdf/v1.rdf>
prefix owl: <http://www.w3.org/2002/07/owl#>
prefix patent: <http://data.epo.org/linked-data/def/patent/>
prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#>
prefix skos: <http://www.w3.org/2004/02/skos/core#>
prefix st3: <http://data.epo.org/linked-data/def/st3/>
prefix text: <http://jena.apache.org/text#>
prefix vcard: <http://www.w3.org/2006/vcard/ns#>
prefix xsd: <http://www.w3.org/2001/XMLSchema#>
SELECT DISTINCT * {
?publication rdf:type patent:Publication ;
patent:titleOfInvention ?title ;
dcterms:abstract ?abstract ;
patent:classificationIPCInventive ?ipc;
(patent:citesPatentPublication+ | patent:citesPatentPublication*/patent:citationNPL) ?citedDocument
.
} LIMIT 10
I believe this should be solvable through GROUP BY, group_concat or the like, however if I try any of that, the request just times out. No error.
I noticed that in json view (which I prefer), the title comes with an xml:lang tag, and in text view has the format "text"@en, but I was unable to achieve any results with these either.