cleanco icon indicating copy to clipboard operation
cleanco copied to clipboard

add more test data (company names)

Open petri opened this issue 10 years ago • 6 comments

@psolin , would you have any lists of company names that you want to see tested?

petri avatar Aug 25 '15 05:08 petri

Hi I've some compay name such as:

  • xxx co ltd
  • xxx private limited
  • xxx pte limited
  • xxx co limited

Do you think it's a good idea to add these additional terms on termdata.py ?

davidheryanto avatar Nov 11 '15 07:11 davidheryanto

https://opencorporates.com could be used for testing?

petri avatar Jan 11 '16 16:01 petri

@davidheryanto it depends. What countries are those for?

petri avatar Jan 03 '17 10:01 petri

I have added a companies.csv file to the tests directory, but unfortunately it seems we cannot really use bulk ascii company names for testing, since many international companies use common anglo-american suffixes such as ltd. or inc. in their corporate names. Which results in a lot of failures.

If we could get the unicode versions of the national suffixes, now that would be useful (ie. in native Chinese or Russian characters). But I am not sure whether cleanco even supports that.

petri avatar Jan 03 '17 10:01 petri

Yes, agree with the Unicode approach. It will be applicable to company names in different countries.

The company names I gave are examples of companies in Singapore.

davidheryanto avatar Jan 03 '17 12:01 davidheryanto

We now have improved Unicode & non - Latin script support. So better test coverage would make sense too.

One option would be to use https://faker.readthedocs.io/en/master/ to generate fake test company names. Manual labour would still be needed to provide the expected base names that cleanco should be able to produce.

petri avatar Apr 26 '20 14:04 petri