Skip to content

bug: maiyamok repeats the wrong word #955

@bact

Description

@bact

Description

maiyamok(["ทำไม", "คน", "ดีๆ", " ", "ๆ", "ๆ", " ", "ถึง", "ทำ", "ไม่ได้"])

returns

['ทำไม', 'คน', 'คน', 'คน', 'คน', ' ', 'ถึง', 'ทำ', 'ไม่ได้']

Expected results

["ทำไม", "คน", "ดี", "ดี", "ดี", "ดี", " ", "ถึง", "ทำ", "ไม่ได้"]

Current results

https://github.com/PyThaiNLP/pythainlp/actions/runs/11627805664/job/32381817268

======================================================================
FAIL: test_normalize (tests.test_util.UtilTestCase)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Users/runner/work/pythainlp/pythainlp/tests/test_util.py", line 555, in test_normalize
    self.assertEqual(
AssertionError: Lists differ: ['ทำไม', 'คน', 'คน', 'คน', 'คน', ' ', 'ถึง', 'ทำ', 'ไม่ได้'] != ['ทำไม', 'คน', 'ดี', 'ดี', 'ดี', 'ดี', ' ', 'ถึง', 'ทำ', 'ไม่ได้']

First differing element 2:
'คน'
'ดี'

Second list contains 1 additional elements.
First extra element 9:
'ไม่ได้'

- ['ทำไม', 'คน', 'คน', 'คน', 'คน', ' ', 'ถึง', 'ทำ', 'ไม่ได้']
?                 ^^    ^^    ^^    ^

+ ['ทำไม', 'คน', 'ดี', 'ดี', 'ดี', 'ดี', ' ', 'ถึง', 'ทำ', 'ไม่ได้']
?                 ^^    ^^    ^^    ^^  +++++

Steps to reproduce

        self.assertEqual(
            maiyamok(
                ["ทำไม", "คน", "ดีๆ", " ", "ๆ", "ๆ", " ", "ถึง", "ทำ", "ไม่ได้"]
            ),
            ["ทำไม", "คน", "ดี", "ดี", "ดี", "ดี", " ", "ถึง", "ทำ", "ไม่ได้"],
        )

PyThaiNLP version

5.0.4

Python version

All

Operating system and version

All

More info

No response

Possible solution

No response

Files

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugbugs in the library

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions