python-wordsegment icon indicating copy to clipboard operation
python-wordsegment copied to clipboard

RecursionError on segment call

Open irmo322 opened this issue 4 years ago • 6 comments

Hi,

I'm having trouble with following code:

import wordsegment

wordsegment.load()
text = "The article went on to say, “For in the pizza shops rich and poor harmoniously congregate; they are the only places where the members of Neapolitan aristocracy—far haughtier than those of any other part of Italy—may be seen (eating) their favorite delicacy side by side with their own coachmen and valets and barbers.”"
wordsegment.segment(text)

It fails with a RecursionError. RecursionError: maximum recursion depth exceeded while calling a Python object

I'm using python 3.8 on ubuntu 20.04.

irmo322 avatar May 31 '21 10:05 irmo322

Can you share more of the trace back?

grantjenks avatar May 31 '21 15:05 grantjenks

Trace back error : error.txt

irmo322 avatar May 31 '21 16:05 irmo322

Works for me:

$ ipython
Python 3.9.1 (v3.9.1:1e5d33e9b9, Dec  7 2020, 12:10:52) 
Type 'copyright', 'credits' or 'license' for more information
IPython 7.24.0 -- An enhanced Interactive Python. Type '?' for help.

In [1]: import wordsegment

In [2]: wordsegment.load()

In [3]: text = "The article went on to say, “For in the pizza shops rich and poor harmoniously congregate; they are the only places where 
   ...: the members of Neapolitan aristocracy—far haughtier than those of any other part of Italy—may be seen (eating) their favorite deli
   ...: cacy side by side with their own coachmen and valets and barbers.”"
   ...: 

In [4]: wordsegment.segment(text)
Out[4]: 
['the',
 'article',
 'went',
 'on',
 'to',
 'say',
 'for',
 'in',
 'the',
 'pizza',
 'shops',
 'rich',
 'and',
 'poor',
 'harmoniously',
 'congregate',
 'they',
 'are',
 'the',
 'only',
 'places',
 'where',
 'the',
 'members',
 'of',
 'neapolitan',
 'aristocracy',
 'far',
 'haugh',
 'tier',
 'than',
 'those',
 'of',
 'any',
 'other',
 'part',
 'of',
 'italy',
 'may',
 'be',
 'seen',
 'eating',
 'their',
 'favorite',
 'delicacy',
 'side',
 'by',
 'side',
 'with',
 'their',
 'own',
 'coachmen',
 'and',
 'valets',
 'and',
 'barbers']

grantjenks avatar Jun 01 '21 00:06 grantjenks

Weird, it doesn't work for me in Python 3.8.

grantjenks avatar Jun 01 '21 00:06 grantjenks

See the PR. If I set the CHUNK_SIZE to 200 then it works for me in 3.8

grantjenks avatar Jun 01 '21 00:06 grantjenks

Thank you for the support :)

irmo322 avatar Jun 01 '21 23:06 irmo322