ChemDataExtractor issues

Upgrade to Python > 3.6?

5

I know this package has not been updated for some time; however, is there any way how we can install it in more recent python versions?

giordan12

online demo hangs (left running for an hour or more)

1

http://chemdataextractor.org/results/abb704de-ca52-4bc4-973b-d34ee8f1407a

sgbaird

I have been working with this library to extract chem information from HTML pages. I followed http://chemdataextractor.org/demo and saved https://pubs.rsc.org/en/content/articlelanding/2015/TC/C5TC02626A as an html(input3.html) file. Below is my code. with open('input/input3.html',...

tarinidash

How to create custom parser to get entities in given Text ?

1

Can anyone let me know , how to write custom parser to fetch Chemical molecule name with constituents details in desired format [Chemical name + addition : Constituents],[Chemical name +...

shailavij

Unable to read in xml file

3

[USpatenttest.xml.zip](https://github.com/mcs07/ChemDataExtractor/files/5243401/USpatenttest.xml.zip) Having trouble reading in this XML file with the generic XMLReader. It's downloaded from the WIPO patenscope site. I run: `from chemdataextractor import Document` `f = open('USpatenttest.xml', 'rb')` `doc=Document.from_file(f)`...

sophiatabchouri

Extracting entities inside an entity

4

Does anyone knows how to write a custom parser to extract a named entity inside an entity. For example from the following sentence I want to extract 'boiling' which will...

gihanpanapitiya

kindly share a sample/example which is hosted at http://chemdataextractor.org/demo

1

Kindly share or provide a sample which showcased in http://chemdataextractor.org/demo Thanks in advance,

nnarahari-tech

https breaks NlmXmlReader

In the NlmXmlReader class ```python def detect(self, fstring, fname=None): """""" if fname and not (fname.endswith('.xml') or fname.endswith('.nxml')): return False if b'xmlns="http://jats.nlm.nih.gov/ns/archiving' in fstring: return True if b'JATS-archivearticle1.dtd' in fstring: return...

maddenfederico

Download too slow almost 0

4

China's access to the Internet is too slow

user-agent-eng

Regex expression of \S* is not recognized.

I am trying to create a custom parser to extract the boiling points from the following texts, so that the text between "boiling point" and "of" is optional. ``` Paragraph(u'The...

gihanpanapitiya

ChemDataExtractor
ChemDataExtractor copied to clipboard

Metadata

Upgrade to Python > 3.6?

online demo hangs (left running for an hour or more)

Code for Demo Files

How to create custom parser to get entities in given Text ?

Unable to read in xml file

Extracting entities inside an entity

kindly share a sample/example which is hosted at http://chemdataextractor.org/demo

https breaks NlmXmlReader

Download too slow almost 0

Regex expression of \S* is not recognized.

← Metadata

Owner

Metadata

ChemDataExtractor ChemDataExtractor copied to clipboard

Metadata

← Metadata

Owner

Metadata

ChemDataExtractor
ChemDataExtractor copied to clipboard