SimonR
SimonR
I created a set of affiliation strings from Pubmed abstracts which all include 'Harvard' (University, medical school, etc) and ran them through match_affil, After downloading the most recent grid.csv dataset...
Why are the lines `affil_text = re.sub('2 ', ' ', affil_text)` `affil_text = re.sub('2. ', ' ', affil_text)` present ? They create incorrect zip code results with an afiiliation string...
I've moved our tagging server from a Solr 6.5.1 instance running the SolrTextTagger code on github to the built-in tagger handler in Solr 7.4.0. The metrics we collect for bulk...