budou icon indicating copy to clipboard operation
budou copied to clipboard

Non-breaking space character (/u00A0) causes AssertionError

Open lacymorrow opened this issue 4 years ago • 2 comments

Here is the problem string: Chatbot\u00a0\u2013

Traceback (most recent call last):
  File "<console>", line 5, in <module>
  File "/usr/local/lib/python3.6/site-packages/budou/parser.py", line 78, in parse
    chunks = self.segmenter.segment(source, language)
  File "/usr/local/lib/python3.6/site-packages/budou/tinysegmentersegmenter.py", line 94, in segment
    assert source[seek] == ' '
AssertionError

https://github.com/google/budou/blob/87d9b81bdd21d1a41436df140e1bc08d817119a3/budou/tinysegmentersegmenter.py#L94

lacymorrow avatar Mar 04 '21 22:03 lacymorrow

Absolutely 👍

On Thu, Aug 15, 2024 at 12:19 manojks1999 @.***> wrote:

@lacymorrow https://github.com/lacymorrow Can I work on this?

— Reply to this email directly, view it on GitHub https://github.com/google/budou/issues/115#issuecomment-2291634979, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAKAEROEUAAZKJLTL27KLX3ZRTIHVAVCNFSM6AAAAABMSPTAJOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOJRGYZTIOJXHE . You are receiving this because you were mentioned.Message ID: @.***>

lacymorrow avatar Aug 15 '24 16:08 lacymorrow

@lacymorrow , made a changes pls look into it.

manojks1999 avatar Aug 21 '24 18:08 manojks1999