affiliation_parser
affiliation_parser copied to clipboard
Simple python parser for MEDLINE, Pubmed OA affiliation string
actually, I cannot find anything import nltk. Can we remove this requirement?
```python from affiliation_parser import parse_affil affil_text = "Department of gynecology and obstetrics, university hospital of South Reunion Island, BP 350, 97448 Saint-Pierre cedex, Reunion; Faculty of medicine, university of Reunion,...
```python from affiliation_parser import parse_affil parse_affil("School of Humanities and Social Science, The Chinese University of Hong Kong, Shenzhen, Longgang District, Shenzhen, P. R. China.") {'full_text': 'School of Humanities and Social...
An example: 2 Department of Pediatrics, Med. Fac. Semmelweis Univ., Tüzoltó u. 7, H-1094 Budapest, Hungary. [email protected] https://semmelweis.hu/english/faculties/medicine/departments-aok/2nd-department-of-paediatrics/
from NCBI's doc, email is always the last element in affiliation, split by ' ' and check whether the last one is a email could get a more accurate result...
I created a set of affiliation strings from Pubmed abstracts which all include 'Harvard' (University, medical school, etc) and ran them through match_affil, After downloading the most recent grid.csv dataset...
Why are the lines `affil_text = re.sub('2 ', ' ', affil_text)` `affil_text = re.sub('2. ', ' ', affil_text)` present ? They create incorrect zip code results with an afiiliation string...
When `distance > 0.1` the following line is executed: `dist = affiliation_check(grid_data[i], affil)` However, `grid_data` is not defined. In fact, it is not mentioned again on the whole file. I...
I'm trying to import both pars_affil and match_affil and it shows the following error: ` File "C:\Users\Mine\Anaconda3\lib\subprocess.py", line 1224, in _execute_child startupinfo) FileNotFoundError: [WinError 2]The system cannot find the file...