extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

The dbo:spouse / dbp:spouse information should be extracted as an array

Open pkleef opened this issue 4 years ago • 3 comments

Issue validity

See: http://dief.tools.dbpedia.org/server/extraction/en/extract?title=Joe+Biden&revid=&format=trix&extractors=custom and http://dbpedia.org/resource/Joe_Biden

Error Description

Looking at http://dbpedia.org/resource/Joe_Biden we can see several bad triple patterns:

dbo:spouse
    dbr:Jill_Biden
    dbr:1972_United_States_Senate_election_in_Delaware
    dbr:Neilia_Hunter_Biden
    
dbp:spouse
    1966-08-27 (xsd:date)
    1972-12-18 (xsd:date)
    1977-06-17 (xsd:date)
    dbr:Jill_Biden
    dbr:Neilia_Hunter_Biden(en)
    died (en)

It looks like the extractor cartridge for Person does not parse the spouse information as an array.

Also the dbr:1972_United_States_Senate_election_in_Delaware also indicates bad parsing.

Pinpointing the source of the error

Details

I believe the code should be changed to use the same pattern as for the dbo:termPeriod e.g.

dbo:spouse 
     dbr:Joe_Biden__Spouse__1
     dbr:Joe_Biden__Spouse__2
     dbr:Joe_Biden__Spouse__3
      

pkleef avatar Sep 21 '21 12:09 pkleef