extraction-framework
extraction-framework copied to clipboard
The dbo:spouse / dbp:spouse information should be extracted as an array
Issue validity
See: http://dief.tools.dbpedia.org/server/extraction/en/extract?title=Joe+Biden&revid=&format=trix&extractors=custom and http://dbpedia.org/resource/Joe_Biden
Error Description
Looking at http://dbpedia.org/resource/Joe_Biden we can see several bad triple patterns:
dbo:spouse
dbr:Jill_Biden
dbr:1972_United_States_Senate_election_in_Delaware
dbr:Neilia_Hunter_Biden
dbp:spouse
1966-08-27 (xsd:date)
1972-12-18 (xsd:date)
1977-06-17 (xsd:date)
dbr:Jill_Biden
dbr:Neilia_Hunter_Biden(en)
died (en)
It looks like the extractor cartridge for Person does not parse the spouse information as an array.
Also the dbr:1972_United_States_Senate_election_in_Delaware also indicates bad parsing.
Pinpointing the source of the error
Details
I believe the code should be changed to use the same pattern as for the dbo:termPeriod e.g.
dbo:spouse
dbr:Joe_Biden__Spouse__1
dbr:Joe_Biden__Spouse__2
dbr:Joe_Biden__Spouse__3